i'm looking for good examples of reasoning model generalization for example, a model incentivized via RL to think for a while and solve math problems gets better at creative writing is this common?
21,86K