AI Model Performance Benchmarks: ๐
Comparing Claude, GPT-4o, and Gemini Across Key Tasks
with Claude 3.5 Sonnet performing best overall, especially in code (93.7%) and reasoning (65.0%). Gemini 1.5 Pro excels in math (86.5% with 4-shot CoT). GPT-
Deepseek R1 is 90% cheaper than o1 while being much better than o1-mini and almost as good as o1 itself.
People using https://lumbni.tech were able to switch to Deepseek R1 in barely a second with no downtime.
DM me to get free beta access.
5 replies13 likes
Chamarti Sreekar
Stealthย โขย 4d
Just found an app that lets you compare grocery prices at different platform!!
app name - quick compare
OpenAI has released a new o1 AI model. The smarter but expensive model is available for only ChatGPT Plus and Team users.
OpenAI Launches o1 Reasoning Model
#OpenAI #ChatGPTo1 #artificalintelligence
What a crazy week in AI ๐คฏ
Open ai dropped O1 and O1 pro model.. at the same time google silently dropped itโs new model to API which is better than O1 preview as per benchmark.
Iโm not a gemini fan but google is really working hard on this. What d
I am happy to announce that I wonโt be solving any Codeforces Problems as OpenAIโs o1 model is probably better than me in that front ๐
#meme #softwareengineering #openai #dsa
1 replies7 likes
The next billionaire
Stealthย โขย 1m
Chinese vs. American AI race really starting to heat up ๐จ๐ณ๐บ๐ธ
DeepSeek's latest R1 model matches or beats OpenAI's o1 on almost everything
AND it's fully open source, unlike o1
wild times.
Credit: x/@itsolelehmann
3 replies8 likes
Chamarti Sreekar
Stealthย โขย 1m
Sam Altman says the o3-mini will be worse than the o1 pro ๐