I like that Artificial Analysis is open about how they evaluate models and makes data public, it is a real service. However, I see folks citing their Intelligence Index as a metric without realizing it is an average of the same correlated, semi-saturated benchmarks everyone uses
16,76K