Trendaavat aiheet
#
Bonk Eco continues to show strength amid $USELESS rally
#
Pump.fun to raise $1B token sale, traders speculating on airdrop
#
Boop.Fun leading the way with a new launchpad on Solana.
suppose you trained an RL agent to maximize reward across diverse environments
then if you dropped it into a new environment, the first question it’d learn to ask is “what is my reward function here?”
it might even learn to model the motives of its simulators to figure this out
“what is my goal/purpose” feels instrumentally convergent. I wonder if in some sense that’s why we seek god
24,72K
Johtavat
Rankkaus
Suosikit