Recently Openai, Goolge reached IMO Gold Medal's with their new experimental models. But our team reached the same level with just o4-mini-high and our agent systems. And now we are opensourcing it. Especially we got insane improvements with the USAMO benchmarks. The base line was almost 0 but our agent got average 90%. Also we could prove theoreticaly the recent arxiv papers just giving the key research Idea.
84,34K