I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated.
Working on something and want to test it for reasoning capabilities.
1 replies5 likes
Aura
AI Specialist | Rese... • 8m
Revolutionizing AI with Inference-Time Scaling: OpenAI's o1 Model"
Inference-time Scaling: Focuses on improving performance during inference (when the model is used) rather than just training.
Reasoning through Search: The o1 model enhances reasonin
Anthropic has unveiled Claude 3.7 Sonnet, its most advanced AI yet and the first hybrid reasoning model. It combines rapid responses with deep, step-by-step reasoning, redefining AI problem-solving.
0 replies
Niket Raj Dwivedi
•
Medial • 1m
Most people I know are getting overly-reliant on AI for thinking/reasoning and strategy. This will have longterm negative impact on individuals as they'll become incapable of reasoning all together.
OpenAI launches o3-mini, a new AI reasoning model on Friday.
Here are the highlights -
-> More reliable: Fact-checks before responding, excelling in STEM fields like programming, math, and science.
-> Faster & cheaper: 63% lower cost than o1-min
See More
4 replies6 likes
Shivam Sharma
AI & ML engineer • 3m
I just used the DeepSeek's Deep Think (R1) model and I think it's incredible.
What do you think, have you used it till now?
Let discuss below 😊😊 👇 👇 👇
🚨 WHY GPT-4o IS A GAME-CHANGER 👇
- #2 non-reasoning model overall
- TIED for #1 in coding & hard prompts (w/ Gemini 2.5 Pro)
- BEST for coding + creative writing 🎨💻
- OUTPERFORMS Claude 3.7 & Gemini 2.0 in non-reasoning tests 🤯
- CRUSHE
See More
1 replies11 likes
Abhay Kumar
Undergraduate BCA St... • 1y
Is online internship is better or offline intership is better . reason why?