All these math/stats/theory people who were working on understanding LLM generalization sort of gave up 2-3 years ago, accepted it was a mystery, and just moved on to just doing hardcore empirical work
1,33K