Back

Parampreet Singh

Python Developer ๐Ÿ’ป ...ย โ€ขย 2m

3B LLM outperforms 405B LLM ๐Ÿคฏ Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 ๐Ÿคฏ ๐Ÿคฏ LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-Time Scaling (TTS) can enhance the reasoning capabilities of LLMs by allocating additional computation at inference time, which improves LLM performance. In Simple terms: "think slowly with long Chain-of-Thought." But, By generating multiple outputs on a sample and picking the best one and training model again, which eventually leads to perform 0.5B LLM better than GPT-4o. But more computation. To make it efficient, they've used search based methods with the reward-aware Compute-optimal TTS. CC Paper: Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling ๐Ÿ”— https://arxiv.org/abs/2502.06703 #Openai #LLM #GPT #GPTo1 #deepseek #llama3

1 replies5 likes
Replies (1)

More like this

Recommendations from Medial

Image Description

Chirotpal Das

Building an AI eco-s...ย โ€ขย 3m

I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated. Working on something and want to test it for reasoning capabilities.

1 replies5 likes

Comet

#freelancerย โ€ขย 8m

Meet latest game-changer OpenAI o1 โ€“ an AI model built for deep thinking and unmatched performance - Takes extra time to think before responding for more accurate answers - Ranks in the 89th percentile in competitive coding (Codeforces) - Secure

See More
0 replies3 likes
2

Bhoop singh Gurjar

AI Deep Explorer | f...ย โ€ขย 23d

LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance

See More
0 replies2 likes
Image Description
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 5m

Jensen Huang, the CEO of NVIDIA, describes how AI is advancing in three key dimensions: 1. Pre-training: This is like getting a college degree. AI models are trained on massive datasets to develop broad, general knowledge about the world. 2. Post-

See More
5 replies13 likes
2
Image Description

Vikas Acharya

ย โ€ขย 

Welbeย โ€ขย 7d

Wake up, this is the GREATEST time to build a startup in 30 years.. The words of Greg Isenberg, CEO of latecheckoutplz ๐Ÿ‘‡๐Ÿป I say this as a 36 year old who's built/sold 3 companies, been part of companies that have raised billons and seeded multipl

See More
2 replies16 likes
12

Download the medial app to read full posts, comements and news.