Back to feeds

3B LLM outperforms 405B LLM 🤯 Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 🤯 🤯 LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-

See More
Anonymous

Anonymous 1

 • 

Foundation • 2d

Interesting

0 replies1 like

Download the medial app to read full posts, comements and news.