A new world wodel from Meta FAIR after all the chaos. 🌍 Meet DINO-world: a generalist video world model that predicts the future—in latent space. Trained on uncurated videos with DINOv2, it learns diverse temporal dynamics (driving, indoors, sims), beats prior models on segmentation & depth, and even grasps intuitive physics. Bonus: it can be fine-tuned for action-conditioned planning.
21,35K