if I gave you an RL brain that could try billions of actions at scale, learn from its mistakes, and adapt, but you had to pick the environment what environment would you choose?
1,12K