Back

Anonymous

Anonymous 1

Hey I am on Medial • 2m

The ELO scores don't tell the full story here. Gemini 2.0 Flash Preview has the widest confidence interval (-8/+8) of any model on the board, suggesting its performance is highly inconsistent. Also note it only has 8,976 appearances - about 1/4 of what most other models have. Wait for more data before making judgments.

Reply

More like this

Recommendations from Medial

Image Description
Image Description

Kimiko

Startups | AI | info... • 2m

Okay… so the latest AI image leaderboard just dropped😮‍💨 Google dropped Gemini 2.0 Flash Preview… and it's still behind Recraft? 😳 Are they lagging in image gen? • GPT-4o is still king 👑 • ByteDance’s Seedream 3.0 is 🔥

5 Replies
3
20
Image Description
Image Description

Vikas Acharya

 • 

Welbe • 1m

𝗧𝗵𝗶𝘀 𝗚𝗼𝗼𝗴𝗹𝗲 𝗜/𝗢 𝘄𝗮𝘀 𝗵𝗮𝗻𝗱𝘀 𝗱𝗼𝘄𝗻 𝘁𝗵𝗲 𝗯𝗲𝘀𝘁 𝗼𝗻𝗲. 𝟭𝟬 𝗶𝗻𝘀𝗮𝗻𝗲 𝗿𝗲𝗹𝗲𝗮𝘀𝗲𝘀 𝘆𝗼𝘂 𝗰𝗮𝗻'𝘁 𝗺𝗶𝘀𝘀 👇🏻 1. Google released an Asynchronous coding agent Jules for free. It uses Gemini 2.5 Pro to work across y

See More
12 Replies
39
39

Download the medial app to read full posts, comements and news.