Trendaavat aiheet
#
Bonk Eco continues to show strength amid $USELESS rally
#
Pump.fun to raise $1B token sale, traders speculating on airdrop
#
Boop.Fun leading the way with a new launchpad on Solana.

Jasper
Co-founder and CEO @Hyperbolic_Labs. ex-@avax & ex-@citsecurities. Finished Math PhD in 2yrs @UCBerkeley. Math Olympiad Gold Medalist. Highest honor @PKU1898
We might be heading into a plot twist in the OpenAI vs. DeepMind IMO saga.
Just saw a post from Joseph Myers (involved in the Math Olympiad since 1992): the IMO committee reportedly asked AI labs not to publish results until 7 days after the closing ceremony — out of respect for human contestants (see my post yesterday) and likely to allow time for proper verification of AI submissions and formats.
According to Joseph, OpenAI didn’t collaborate with the IMO to test their model, and none of the 91 official IMO coordinators were involved in grading its solutions. Meanwhile, it seems DeepMind is following the rules and patiently waiting their turn.
For context:
The IMO has 6 problems, each worth 7 points. This year’s gold cutoff is 35 points. Even a small deduction could knock OpenAI down to silver. And from my read of their writeups, some parts might raise questions — and possibly cost points.
Terence Tao also pointed out that while the problems stay the same, testing formats matter. A student who wouldn’t get a bronze under standard conditions might strike gold with a modified setup — which raises real questions about what “solving the IMO” means for AI.
Next week might get spicy. Stay tuned.


87,2K
Just got off work and tried Grok-4 on an undergrad topology problem. It took 9 minutes to think and then confidently gave a clean, plausible, but totally wrong answer 😅
Don’t think this one qualifies as “skillfully adversarial.” AI models are crushing benchmarks — but still a long way ahead for real math AGI.



Elon Musk10.7. klo 16.47
Grok 4 is at the point where it essentially never gets math/physics exam questions wrong, unless they are skillfully adversarial.
It can identify errors or ambiguities in questions, then fix the error in the question or answer each variant of an ambiguous question.
662,99K
Grok got full score on AIME 🤯 We definitely need a better math benchmark for AI now


xAI10.7. klo 12.01
Introducing Grok 4, the world's most powerful AI model. Watch the livestream now:
2,98K
The future of AI is collaborative

Yuchen Jin9.7. klo 06.09
Sam Altman was asked how he felt about Zuck and Meta poaching OpenAI’s top talent.
“Fine... good...” he said.
Behind Jony Ive–designed glasses, I couldn’t see his eyes. But I could feel the pain.
It's not hard for Zuck to poach OpenAI talent, not just because he has the money, but because open-source AI is fulfilling the original OpenAI mission.
1,44K
Johtavat
Rankkaus
Suosikit
Ketjussa trendaava
Trendaa X:ssä
Viimeisimmät suosituimmat rahoitukset
Merkittävin