Hey I am on Medial • 10m
The ELO scores don't tell the full story here. Gemini 2.0 Flash Preview has the widest confidence interval (-8/+8) of any model on the board, suggesting its performance is highly inconsistent. Also note it only has 8,976 appearances - about 1/4 of what most other models have. Wait for more data before making judgments.
Founder | Agentic AI... • 1m
Anthropic just released Claude Opus 4.6. Here’s what’s new: 1) Smarter problem solving. It tackles complex tasks efficiently and doesn’t waste compute on simple ones. 2) 1M token context window. Enough to hold roughly 10 full novels in one session
See MoreDownload the medial app to read full posts, comements and news.