🚀 Medial Secures Investment on Shark Tank India - Fueling the Future of Professional Social Networking. 🔥

News

Messages

Try our Valuation Calculator →

Back

Parampreet Singh

Python Developer 💻 ... • 6m

3B LLM outperforms 405B LLM 🤯 Similarly, a 7B LLM outperforms OpenAI o1 & DeepSeek-R1 🤯 🤯 LLM: llama 3 Datasets: MATH-500 & AIME-2024 This has done on research with compute optimal Test-Time Scaling (TTS). Recently, OpenAI o1 shows that Test-Time Scaling (TTS) can enhance the reasoning capabilities of LLMs by allocating additional computation at inference time, which improves LLM performance. In Simple terms: "think slowly with long Chain-of-Thought." But, By generating multiple outputs on a sample and picking the best one and training model again, which eventually leads to perform 0.5B LLM better than GPT-4o. But more computation. To make it efficient, they've used search based methods with the reward-aware Compute-optimal TTS. CC Paper: Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling 🔗 https://arxiv.org/abs/2502.06703 #Openai #LLM #GPT #GPTo1 #deepseek #llama3

1 Reply

Replies (1)

Anonymous 1
•

Foundation • 6m

Interesting

Recommendations from Medial

Chirotpal Das

Building an AI eco-s... • 6m

I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated. Working on something and want to test it for reasoning capabilities.

1 Reply

Comet

#freelancer • 11m

Meet latest game-changer OpenAI o1 – an AI model built for deep thinking and unmatched performance - Takes extra time to think before responding for more accurate answers - Ranks in the 89th percentile in competitive coding (Codeforces) - Secure

Swami Gadila

Founder of Friday AI • 2m

🚨 Open AI is an Wrapper👀🤯 Hot take, but let’s break it down logically: OpenAI is not a full-stack AI company — it’s a high-level wrapper over Azure and NVIDIA. Here’s why that matters 👇 🔹 1. Infra Backbone = Microsoft Azure Almost 90%+ of Op

Vishu Bheda
•

Medial • 8d

𝗜 𝘀𝗽𝗲𝗻𝘁 𝟰+ 𝗵𝗼𝘂𝗿𝘀 𝗿𝗲𝘄𝗮𝘁𝗰𝗵𝗶𝗻𝗴 𝗞𝗮𝗿𝗽𝗮𝘁𝗵𝘆’𝘀 𝗬𝗖 𝗸𝗲𝘆𝗻𝗼𝘁𝗲. And I realized — we’ve been looking at LLMs the wrong way. They’re not just “AI models.” They’re a new kind of computer. • LLM = CPU • Context window = mem

6 Replies

AI Engineer

AI Deep Explorer | f... • 4m

LLM Post-Training: A Deep Dive into Reasoning LLMs This survey paper provides an in-depth examination of post-training methodologies in Large Language Models (LLMs) focusing on improving reasoning capabilities. While LLMs achieve strong performance

Harsh Dwivedi
•

Medial • 19d

GPT-5 Full Review & 10 Mind-Blowing Use Cases OpenAI has just launched its most awaited model yet: GPT-5. And it’s not just one step closer to AGI, but has almost entirely automated a lot of things using just simple prompts. In this video, we put t

6 Replies

Comet

#freelancer • 17d

GPT-5 is here.* This is the moment when AI stops being a shiny toy and becomes infrastructure. 👇 OpenAI has launched its new flagship model, and the focus is clear: reliability, power, and a change in how we interact with AI. *The Essentials o

1 Reply

Vishu Bheda
•

Medial • 9m

Jensen Huang, the CEO of NVIDIA, describes how AI is advancing in three key dimensions: 1. Pre-training: This is like getting a college degree. AI models are trained on massive datasets to develop broad, general knowledge about the world. 2. Post-

5 Replies

Nupur Tevatiya

India's AI Filmmakin... • 2m

🚨 SmolVLA is here — and it’s changing how we think about robotics AI. Hugging Face just released SmolVLA, a lightweight Vision-Language-Action model trained on community-shared datasets from their LeRobot platform. Despite being just 450M paramete

Vikas Acharya
•

Welbe • 3m

Wake up, this is the GREATEST time to build a startup in 30 years.. The words of Greg Isenberg, CEO of latecheckoutplz 👇🏻 I say this as a 36 year old who's built/sold 3 companies, been part of companies that have raised billons and seeded multipl