Back

Anonymous

Anonymous 1

Hey I am on Medialย โ€ขย 6m

The ELO scores don't tell the full story here. Gemini 2.0 Flash Preview has the widest confidence interval (-8/+8) of any model on the board, suggesting its performance is highly inconsistent. Also note it only has 8,976 appearances - about 1/4 of what most other models have. Wait for more data before making judgments.

Reply

More like this

Recommendations from Medial

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 4d

The AI stack you should master in 2025. Iโ€™ve broken down every tool in one simple line. 1. ๐— ๐—ฒ๐˜๐—ฎ๐—š๐—ฃ๐—ง โ€” Agents collaborate using structured software-team roles. 2. ๐—–๐—ฟ๐—ฒ๐˜„๐—”๐—œ โ€” Coordinates multiple specialized agents to complete tasks. 3. ๐—Ÿ๏ฟฝ

See More
Reply
2
11

Download the medial app to read full posts, comements and news.