I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated.
Working on something and want to test it for reasoning capabilities.
1 replies5 likes
Aura
AI Specialist | Rese...ย โขย 7m
Revolutionizing AI with Inference-Time Scaling: OpenAI's o1 Model"
Inference-time Scaling: Focuses on improving performance during inference (when the model is used) rather than just training.
Reasoning through Search: The o1 model enhances reasonin
Anthropic has unveiled Claude 3.7 Sonnet, its most advanced AI yet and the first hybrid reasoning model. It combines rapid responses with deep, step-by-step reasoning, redefining AI problem-solving.
0 replies
Niket Raj Dwivedi
ย โขย
Medialย โขย 12d
Most people I know are getting overly-reliant on AI for thinking/reasoning and strategy. This will have longterm negative impact on individuals as they'll become incapable of reasoning all together.
OpenAI launches o3-mini, a new AI reasoning model on Friday.
Here are the highlights -
-> More reliable: Fact-checks before responding, excelling in STEM fields like programming, math, and science.
-> Faster & cheaper: 63% lower cost than o1-min
See More
4 replies6 likes
Shivam Sharma
AI & ML engineerย โขย 2m
I just used the DeepSeek's Deep Think (R1) model and I think it's incredible.
What do you think, have you used it till now?
Let discuss below ๐๐ ๐ ๐ ๐
๐จ WHY GPT-4o IS A GAME-CHANGER ๐
- #2 non-reasoning model overall
- TIED for #1 in coding & hard prompts (w/ Gemini 2.5 Pro)
- BEST for coding + creative writing ๐จ๐ป
- OUTPERFORMS Claude 3.7 & Gemini 2.0 in non-reasoning tests ๐คฏ
- CRUSHE
See More
1 replies11 likes
Abhay Kumar
Undergraduate BCA St...ย โขย 11m
Is online internship is better or offline intership is better . reason why?