*once again looks over at the Days Without a Crashout sign on my desk and grits teeth while getting back to writing up the generalized reasoning model that speaks in broken English and got IMO gold, I'm having a totally great day how are you?*
Owain Evans
Owain Evans23.7. klo 00.06
New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵
8,13K