AI Model Performance Benchmarks: 🚀
Comparing Claude, GPT-4o, and Gemini Across Key Tasks
with Claude 3.5 Sonnet performing best overall, especially in code (93.7%) and reasoning (65.0%). Gemini 1.5 Pro excels in math (86.5% with 4-shot CoT). GPT-
Startup People! How often do you use AI platforms like ChatGPT, Gemini, Claude etc to review and get a gist of your company's legalities, compliances, agreements, etc?
From my experience claued free version is better than chatgpt free version.
5 replies10 likes
Chirotpal Das
Stealth • 27d
I am 100% sure all the LLM benchmarks are, well let’s just say incomplete- they just don’t work in real world scenarios, they do good hypothetically.
We need domain and industry specific benchmarks and we need them now.
Anyone creating anything lik
See More
10 replies8 likes
Payal Manghnani
Stealth • 1d
AI as a life saver:
1. ChatGPT - thesis, essay, writing
2. Scite and perplexity - literature review
3. Consesus - latest research paper
4. Gemini - coding and technical
5. Claude AI - Analysis data, comparison data, literature review
List of conversational Ai apps in my phone, which I use in daily in my life
#1 Gemini from Google ✨
#2 perplexity ai 🔍
#3 pi ai from inflection Ai 🗣️
#4 copilot from Microsoft 📰
#5 chatgpt from openai 📞
#6 Claude from Anthropic 📁
+ Bonus tool
See More
18 replies8 likes
Atharva Pache
Stealth • 8m
What do you guys think about new updates in chatGPT and Gemini? I feel it's clear that these all big players will do whatever it takes to win the market they don't care about anything!
11 replies8 likes
Payal Manghnani
Stealth • 7m
Why Don't More People Like ChatGPT?
Hey Everyone!
I've been really interested in Al, especially cool tools like ChatGPT, Gemini ,Claude and make creative projects using ai.
They can do so many cool things!
But I recently found out that some peopl