🚨GROK SCORES 26.9% ON "HUMANITY'S LAST EXAM" WITHOUT ANY TOOLS The scaling graph tells the story: more compute = better performance. Grok crushed over a quarter of the world's hardest academic benchmark using pure reasoning alone. No calculators, no external help. Just raw AI brainpower tackling 2,500 questions across every field of human knowledge. Most humans would fail this test even WITH tools. Grok's doing it blindfolded. Source: @xai @elonmusk
Mario Nawfal
Mario Nawfal10.7. klo 12.14
🚨"HUMANITY'S LAST EXAM" DROPPED: 2,500 QUESTIONS TO SEPARATE REAL AI FROM PRETENDERS X just unveiled the ultimate academic gauntlet - a benchmark so comprehensive it's meant to be the final test ever needed. Math dominates at 41%, followed by sciences and humanities. The name says it all: this is the exam to end all exams. Once AI aces this, what's left to prove? We're building the test that determines when machines officially outsmart us. Source: @xai @elonmusk
89,59K