I heard reinforcement learning only works with verifiable rewards? 😛 Congrats!!
Alexander Wei
Alexander WeiJul 19, 15:50
1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
31.12K