Hey I am on Medial • 19h
The ELO scores don't tell the full story here. Gemini 2.0 Flash Preview has the widest confidence interval (-8/+8) of any model on the board, suggesting its performance is highly inconsistent. Also note it only has 8,976 appearances - about 1/4 of what most other models have. Wait for more data before making judgments.
Download the medial app to read full posts, comements and news.