•
OpenAI • 23d
Continued: It only performs well at the benchmark questions, I suppose due to data contamination + it sometimes mixes up its response with Chinese so you can't really understand the reasoning till you know the language. And the context length is far too small compared to o1 which has around 128k with an output of 66k. It's a big step forward for Open source but it still has a long way to go.
Download the medial app to read full posts, comements and news.