Similarly, Gemini 2.5 pro definitely outperforms o3 in any reasoning/logic/code-related context for me
Ernest Ryu
Ernest Ryu20.7. klo 06.30
5. In my experience using LLMs for math research, Gemini outperforms ChatGPT. We will see if the next-gen models (which seem to be what OpenAI and GDM are using for IMO) perform at research-level math. (5/10)
680