muy emocionado de ver a dónde vamos a partir de aquí con los modelos de os
pash
pash19 jul, 09:58
I'd like to point out that for the real world tasks (not benchmarks), Kimi K2 outperforms Gemini. This is telemetry across all @cline users, showing diff edit failure rate. Notice how Kimi has about a 6% failure rate, which is significantly better than Gemini's ~ 10% error rate. Remarkably, Kimi even surpassed Claude 4 for most of this week, achieving a sub 4% failure rate!
7,46K