I want a list of reasoning questions that openai o1 and/or deepseek r1 is failing to answer correctly. Quick help is much appreciated.
Working on something and want to test it for reasoning capabilities.
1 replies5 likes
Aura
AI Specialist | Rese... • 9m
Revolutionizing AI with Inference-Time Scaling: OpenAI's o1 Model"
Inference-time Scaling: Focuses on improving performance during inference (when the model is used) rather than just training.
Reasoning through Search: The o1 model enhances reasonin
Anthropic has unveiled Claude 3.7 Sonnet, its most advanced AI yet and the first hybrid reasoning model. It combines rapid responses with deep, step-by-step reasoning, redefining AI problem-solving.
0 replies
Niket Raj Dwivedi
•
Medial • 2m
Most people I know are getting overly-reliant on AI for thinking/reasoning and strategy. This will have longterm negative impact on individuals as they'll become incapable of reasoning all together.
Apple just exposed the truth behind so-called AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini:
They’re not actually reasoning — they’re just really good at memorizing patterns.
Here’s what Apple found:
0 replies18 likes
Ayush
Let's build together... • 4m
OpenAI launches o3-mini, a new AI reasoning model on Friday.
Here are the highlights -
-> More reliable: Fact-checks before responding, excelling in STEM fields like programming, math, and science.
-> Faster & cheaper: 63% lower cost than o1-min
See More
4 replies6 likes
Chamarti Sreekar
Passionate about Pos... • 1d
AI is no longer just catching up — in many areas, it’s already ahead.
According to the 2025 AI Index report,
models now outperform humans in most technical tasks.
From image classification to visual reasoning — machines are setting the pace.
In j