Back

Parampreet Singh

Python Developer 💻 ... • 6m

3B LLM outperforms 405B LLM 🤯 Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 🤯 🤯 LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-Time Scaling (TTS) can enhance the reasoning capabilities of LLMs by allocating additional computation at inference time, which improves LLM performance. In Simple terms: "think slowly with long Chain-of-Thought." But, By generating multiple outputs on a sample and picking the best one and training model again, which eventually leads to perform 0.5B LLM better than GPT-4o. But more computation. To make it efficient, they've used search based methods with the reward-aware Compute-optimal TTS. CC Paper: Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling 🔗 https://arxiv.org/abs/2502.06703 #Openai #LLM #GPT #GPTo1 #deepseek #llama3

1 Reply
5
Replies (1)

More like this

Recommendations from Medial

Image Description

Chirotpal Das

Building an AI eco-s... • 7m

I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated. Working on something and want to test it for reasoning capabilities.

1 Reply
5

Comet

#freelancer • 1y

Meet latest game-changer OpenAI o1 – an AI model built for deep thinking and unmatched performance - Takes extra time to think before responding for more accurate answers - Ranks in the 89th percentile in competitive coding (Codeforces) - Secure

See More
Reply
2
3

Swami Gadila

Founder of Friday AI • 2m

🚨 Open AI is an Wrapper👀🤯 Hot take, but let’s break it down logically: OpenAI is not a full-stack AI company — it’s a high-level wrapper over Azure and NVIDIA. Here’s why that matters 👇 🔹 1. Infra Backbone = Microsoft Azure Almost 90%+ of Op

See More
Reply
2
4
Image Description
Image Description

Vishu Bheda

 • 

Medial • 28d

𝗜 𝘀𝗽𝗲𝗻𝘁 𝟰+ 𝗵𝗼𝘂𝗿𝘀 𝗿𝗲𝘄𝗮𝘁𝗰𝗵𝗶𝗻𝗴 𝗞𝗮𝗿𝗽𝗮𝘁𝗵𝘆’𝘀 𝗬𝗖 𝗸𝗲𝘆𝗻𝗼𝘁𝗲. And I realized — we’ve been looking at LLMs the wrong way. They’re not just “AI models.” They’re a new kind of computer. • LLM = CPU • Context window = mem

See More
6 Replies
42
44

AI Engineer

AI Deep Explorer | f... • 5m

LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance

See More
Reply
2
Image Description
Image Description

Harsh Dwivedi

 • 

Medial • 1m

GPT-5 Full Review & 10 Mind-Blowing Use Cases OpenAI has just launched its most awaited model yet: GPT-5. And it’s not just one step closer to AGI, but has almost entirely automated a lot of things using just simple prompts. In this video, we put t

See More
6 Replies
29
88
4
Image Description

Comet

#freelancer • 1m

GPT-5 is here.* This is the moment when AI stops being a shiny toy and becomes infrastructure. 👇 OpenAI has launched its new flagship model, and the focus is clear: reliability, power, and a change in how we interact with AI. *The Essentials o

See More
1 Reply
2
11
Image Description
Image Description

Vishu Bheda

 • 

Medial • 9m

Jensen Huang, the CEO of NVIDIA, describes how AI is advancing in three key dimensions: 1. Pre-training: This is like getting a college degree. AI models are trained on massive datasets to develop broad, general knowledge about the world. 2. Post-

See More
5 Replies
2
13

Nupur Tevatiya

India's AI Filmmakin... • 3m

🚨 SmolVLA is here — and it’s changing how we think about robotics AI. Hugging Face just released SmolVLA, a lightweight Vision-Language-Action model trained on community-shared datasets from their LeRobot platform. Despite being just 450M paramete

See More
Reply
3

Vedanshu Singh

Newbie • 3d

⚡ “AI won’t just crown new kings… it will create empires for those who sell the rails it runs on.” 👑🚂✨ Larry Ellison just turned Oracle into the picks & shovels giant of the AI boom and the market went wild. 1. The Shockwave → Oracle signs a $30

See More
Reply
1

Download the medial app to read full posts, comements and news.