Top World Model Researchers in AI (2026)

In shortThe most influential world model researchers in 2026 are Yann LeCun (JEPA), Fei-Fei Li (spatial intelligence, World Labs), and Danijar Hafner (the Dreamer line), alongside pioneers including Pieter Abbeel, Chelsea Finn, David Ha, and Kaiming He. Below we profile 44 researchers spanning JEPA self-supervised models, video generation, 3D spatial intelligence, and model-based reinforcement learning.

What is a world model?

A world model is an AI system that learns an internal, predictive representation of how an environment evolves — letting an agent imagine future states, plan actions, and reason about cause and effect without acting in the real world first. World models span video generation, interactive simulation, 3D spatial intelligence, JEPA-style joint-embedding prediction, and model-based reinforcement learning.

Why it matters. Whoever controls world models controls the substrate that future agents, robots, and simulators are trained on. The investment surface has shifted from "can we build them?" to "who owns the deployment stack?" — making the scientists driving each architecture among the most consequential people in AI.

World model researchers by subfield

No single researcher leads every sub-area — the field splits into distinct specialties. Here is who leads each:

SubfieldLeading researchersWhy it mattersNotable work
Self-supervised world models (JEPA)Yann LeCun · Mahmoud Assran · Michael Rabbat · Saining Xie · Pascal Vincent · Ishan Misra · Piotr BojanowskiLearn predictive representations from video without generating pixels.A Path Towards Autonomous Machine Intelligence (JEPA)
Video generation as world simulationBill Peebles · Tim Brooks · Sergei Tulyakov · Cristóbal Valenzuela · Agrim Gupta · Zangwei ZhengLarge-scale video models that double as simulators of the visual world.Scalable Diffusion Models with Transformers (DiT)
3D & spatial intelligenceFei-Fei Li · Justin Johnson · Ben Mildenhall · Christoph Lassner · Aleksander Holynski · Jiajun WuPersistent, navigable 3D worlds reconstructed or generated from images and text.NeRF: Representing Scenes as Neural Radiance Fields
Model-based RL & agentsDanijar Hafner · David Ha · Nicklas Hansen · Jurgis Pasukonis · Pieter Abbeel · Eloi AlonsoAgents that learn and plan inside a learned, imagined world model.Mastering Diverse Domains through World Models (DreamerV3)
Interactive & playable world modelsTim Rocktäschel · Jack Parker-Holder · Stephen SpencerGenerate controllable, playable environments from a single prompt.Genie: Generative Interactive Environments
Driving & navigation world modelsAlex Kendall · Anthony Hu · Amir Bar · Gaoyue Zhou · Yunzhu LiPredictive world models for autonomous driving and visual navigation.Learning to Drive in a Day

The researchers

PioneersField-definers · 8

Pieter Abbeel

AcademiaFrontier lab

Professor of EECS, UC Berkeley · Amazon Scholar / Head of LLM efforts in AGI org, Amazon

PhD · Stanford University — Andrew Ng

Berkeley deep-RL/robot-learning leader; co-founded Covariant; now heads Amazon's LLM/AGI robotics efforts

Pioneer of deep RL and imitation learning. Drove UniSim and Large World Model, and recent generalist robot world models (DreamDojo) trained on tens of thousands of hours of human video.

Chelsea Finn

AcademiaStartup

Associate Professor of Computer Science & EE, Stanford University · Co-Founder, Physical Intelligence

PhD · UC Berkeley — Sergey Levine and Pieter Abbeel

Co-creator of MAML meta-learning; co-founder of Physical Intelligence (robot foundation models)

Pioneer of meta-learning and visual foresight for robots. Co-author of Physical Intelligence's pi-0.6 (RL self-improvement for VLAs) and Ctrl-World / VLAW, which use video world models to evaluate and improve robot policies.

Danijar Hafner

Startup

Co-founder, Embo

PhD · University of Toronto — Advised by Jimmy Ba (Vector Institute / U. Toronto)

Creator of the Dreamer line of world-model agents (DreamerV3, Dreamer 4); RL by imagination

Creator of the Dreamer line of model-based RL. Dreamer 4 (2025) was the first agent to obtain diamonds in Minecraft purely from offline data with no environment interaction — a landmark for training agents inside imagined world models.

Kaiming He

AcademiaFrontier lab

Associate Professor of EECS (tenured), MIT · Distinguished Scientist (part-time), Google DeepMind

PhD · The Chinese University of Hong Kong — Xiaoou Tang

Lead author of ResNet (deep residual learning); also Faster R-CNN, Mask R-CNN, MAE

Author of ResNet and Masked Autoencoders (MAE), among the most-cited works in deep learning. Recent generative work (JiT, improved Mean Flows) pushes diffusion-style models toward single-step, tokenizer-free generation — core infrastructure for video world models.

Yann LeCun

StartupAcademia

Founder & Executive Chairman, AMI Labs (Advanced Machine Intelligence) · Professor, New York University

PhD · Université Pierre et Marie Curie (Sorbonne / UPMC), Paris — PhD on backpropagation; later postdoc with Geoffrey Hinton at U. Toronto

Turing Award pioneer of CNNs/deep learning; champion of JEPA world-model architectures

Turing Award winner and originator of the Joint-Embedding Predictive Architecture (JEPA), the leading self-supervised alternative to autoregressive world models. His team's 2025–26 work extended JEPA into a deployable family — VL-JEPA, Causal-JEPA, and latent-action world models learned from in-the-wild video.

Fei-Fei Li

StartupAcademia

Co-founder & CEO, World Labs · Sequoia Professor of Computer Science; Co-Director, HAI, Stanford University

PhD · California Institute of Technology (Caltech) — Advised by Pietro Perona (and Christof Koch)

Created ImageNet; co-founder/CEO of World Labs building spatial-intelligence world models (Marble)

Creator of ImageNet and the intellectual leader of "spatial intelligence." Co-founded World Labs, which raised ~$1B and shipped Marble (persistent 3D worlds from text/image), the World API, and real-time interactive world models running on a single GPU.

Tim Rocktäschel

StartupAcademia

Co-founder & CEO, Recursive (Recursive Superintelligence) · Professor of Artificial Intelligence, University College London

PhD · University College London (UCL) — Advised by Sebastian Riedel

Led Genie foundation world models & Open-Endedness at DeepMind; UCL AI professor; co-founder of Recursive

Leads interactive world-model research at Google DeepMind, including the Genie line of foundation world models that generate playable, controllable environments from a single prompt.

Leading researchersShaping the field today · 30

Mahmoud Assran

Frontier lab

Research Scientist, Meta FAIR

PhD · McGill University — Mila (Quebec AI Institute)

Lead author of I-JEPA, V-JEPA and V-JEPA 2 self-supervised world-model architectures at Meta FAIR

Lead author of I-JEPA and V-JEPA, the image and video instantiations of LeCun's self-supervised world-model paradigm at Meta AI.

Piotr Bojanowski

Frontier lab

Research Scientist & Manager, Meta (FAIR)

PhD · Ecole Normale Superieure (ENS) Paris — WILLOW team; advisors Ivan Laptev, Jean Ponce, Cordelia Schmid, Josef Sivic

FAIR Paris research lead; co-author of fastText, DINO/DINOv2, and I-JEPA self-supervised models

Aleksander Holynski

Frontier labAcademia

Research Scientist, Google DeepMind · Assistant Professor, Columbia University

PhD · University of Washington — GRAIL / UW Graphics (advisors: Steve Seitz, Brian Curless, Rick Szeliski)

Google DeepMind research scientist & Columbia professor; generative video/3D and world-model research.

Anthony Hu

Startup

Researcher, General Intuition

PhD · University of Cambridge — Machine Intelligence Lab (advisor: Roberto Cipolla)

Led GAIA generative world models for autonomous driving at Wayve; now building world models at General Intuition.

Justin Johnson

StartupAcademia

Co-Founder, World Labs · Adjunct Assistant Professor, University of Michigan

PhD · Stanford University — Stanford Vision Lab (advisor: Fei-Fei Li)

Co-founder of World Labs; pioneered real-time neural style transfer and visual reasoning.

Co-founder of World Labs and a leader in 3D generation and visual reasoning; co-author of widely used vision and neural-style-transfer methods.

Lukasz Kaiser

Frontier lab

Member of Technical Staff (Research Scientist), OpenAI

PhD · RWTH Aachen University — Mathematical Foundations of Computer Science (advisor: Erich Grädel)

Co-author of 'Attention Is All You Need' (Transformer); research lead on OpenAI's o1 reasoning models.

Christoph Lassner

Startup

Co-founder, World Labs

PhD · University of Tübingen — Max Planck Institute for Intelligent Systems / Bernstein Center for Computational Neuroscience, Tübingen

Co-founder of World Labs; computer vision/graphics researcher in 3D/4D reconstruction and neural rendering (Pulsar).

Ishan Misra

Frontier lab

Director, Research Scientist, Meta Superintelligence Labs (TBD Labs)

PhD · Carnegie Mellon University — Robotics Institute (advised by Martial Hebert / Abhinav Gupta)

Self-supervised vision researcher; co-author of I-JEPA, DINO, and Movie Gen / Emu Video generation

Jack Parker-Holder

Frontier labAcademia

Research Scientist, Open-Endedness Team, Google DeepMind · Honorary Lecturer, University College London

PhD · University of Oxford — Stephen Roberts (Machine Learning Research Group)

Co-lead of DeepMind's Genie world models (Genie 1/2/3) for generative interactive environments

Genie lead at Google DeepMind, driving interactive foundation world models that generate controllable environments.

Michael Rabbat

Startup

VP of World Models, AMI Labs

PhD · University of Wisconsin-Madison — Robert Nowak (Electrical Engineering)

Co-creator of I-JEPA/V-JEPA self-supervised world models; now VP World Models at LeCun's AMI Labs

Cristóbal Valenzuela

Startup

Co-Founder & CEO, Runway

Co-founder & CEO of Runway; generative video moving into general world models (GWM-1)

CEO of Runway, which ships commercial generative world models for video (Gen-series, GWM-1) used widely in film and media production.

Pascal Vincent

Frontier labAcademia

Research Scientist, Meta (FAIR) · Adjunct Professor (Mila founding member), Universite de Montreal / Mila

PhD · Universite de Montreal — Yoshua Bengio (Mila)

Pioneer of denoising autoencoders & denoising score matching; co-author of I-JEPA self-supervised models

Jiajun Wu

Academia

Assistant Professor of Computer Science (courtesy in Psychology), Stanford University

PhD · MIT — William T. Freeman and Joshua B. Tenenbaum

Physical scene understanding; physics-engine-integrated and neuro-symbolic visual reasoning

Leads physics-based and 3D world-model research at Stanford, including multi-scale 3D world generation (WonderZoom) and intuitive-physics models.

Saining Xie

StartupAcademia

Co-founder & Chief Science Officer, AMI Labs · Assistant Professor of Computer Science (on leave), New York University

PhD · UC San Diego — Zhuowen Tu

Co-creator of Diffusion Transformers (DiT) and ConvNeXt; co-founder/CSO of AMI Labs (JEPA world models)

Co-creator of V-JEPA and a leader in generative representation learning; recent work on flow-map distillation and representation autoencoders set new image-generation SOTA.

Sherry Yang

AcademiaFrontier lab

Assistant Professor of Computer Science, NYU Courant · Staff Research Scientist, Google DeepMind

PhD · UC Berkeley — Pieter Abbeel

Generative world models / real-world simulators (UniSim) for embodied agents

Pioneer of treating video generation as a universal world model and simulator; co-author on Gemini Robotics and Veo-based policy evaluation at Google DeepMind.

Rising starsEmerging first-authors · 8

Amir Bar

AcademiaFrontier lab

Incoming Assistant Professor, Imperial College London · Research Scientist, Meta FAIR

PhD · Tel Aviv University / UC Berkeley — Advised by Amir Globerson and Trevor Darrell

Navigation World Models (CVPR 2025 Best Paper Honorable Mention); self-supervised visual world modeling

First author of Navigation World Models (CVPR 2025 Honorable Mention), a billion-parameter conditional diffusion transformer for visual navigation prediction.

Boyuan Chen

Academia

Dickinson Family Assistant Professor, Duke University

PhD · Columbia University — Hod Lipson's Creative Machines Lab

Dickinson Family Asst. Prof. at Duke; directs General Robotics Lab; self-modeling robots & automated scientific discovery

First author of Diffusion Forcing (NeurIPS 2024), a training paradigm unifying video prediction and world models; recent work on large video planners for robot control.

Yilun Du

AcademiaFrontier lab

Assistant Professor, Harvard University · Senior Research Scientist, Google DeepMind

PhD · Massachusetts Institute of Technology — Leslie Kaelbling, Tomas Lozano-Perez & Joshua Tenenbaum (MIT EECS)

Harvard SEAS Asst. Prof & Kempner Investigator; compositional/energy-based generative world models

Pioneer of composable diffusion and energy-based world models; frequent collaborator on video-model-as-simulator robotics work.

Tsun-Hsuan Wang

StartupAcademia

Co-Founder, Genesis AI · PhD Candidate, MIT CSAIL

PhD · Massachusetts Institute of Technology — Daniela Rus (MIT CSAIL Distributed Robotics Lab)

MIT CSAIL roboticist; co-founder of Genesis AI universal robotics foundation model startup

Co-lead on the Genesis physics engine for embodied AI, reported 10–80× faster than prior simulators.

Companies building in world model

  • AMI Labs — Yann LeCun's lab; JEPA world models predicting in representation space
  • Black Forest Labs — FLUX visual models unifying perception, generation, reasoning
  • Decart — Real-time interactive world models at low latency
  • Elorian — Andrew Dai's multimodal visual reasoning / world model startup
  • General Intuition — World models from gameplay video (DIAMOND team); games and the real world
  • Genesis AI — Genesis World physics simulation + general-purpose robots
  • Genie (Google DeepMind) — Real-time interactive world model generating playable 3D environments
  • Genmo — Open video generation models (Mochi)
  • Higgsfield — Controllable video generation and motion models
  • HPC-AI Tech — Open-Sora open video generation; Colossal-AI training infrastructure
  • Luma AI — AI 3D creation and video generation (Dream Machine)
  • Moonlake AI — Interactive, physics-accurate world models from Stanford team
  • Morpheus — Real-time interactive video world models for physical AI
  • Niantic Spatial — Large geospatial model; visual positioning and 3D maps
  • Nucleus4D — Gaussian-splatting 3D capture; digital twins and spatial data
  • Odyssey — Real-time multimodal world models (Starchild-1)
  • Reka AI — Multimodal foundation models across text, image, video
  • Runway — Video generation and general world models (GWM-1)
  • SpAItial — AI-native 3D foundation models turning images into interactive worlds
  • Twelve Labs — Video-native multimodal AI for understanding and search
  • Wayve — GAIA driving world models for autonomous vehicles
  • World Labs — Fei-Fei Li's spatial intelligence lab; Marble 3D world generation

Frequently asked questions

Who is the most influential researcher in world models?

Yann LeCun (AMI Labs / NYU) is widely regarded as the most influential, having defined the JEPA self-supervised paradigm that anchors one of the two dominant approaches to world models. Fei-Fei Li (World Labs / Stanford) and Danijar Hafner (creator of the Dreamer line of model-based RL) are the other two figures most often cited as field-defining.

What is a world model in AI?

A world model is an AI system that learns a predictive internal model of an environment, enabling an agent to imagine future outcomes and plan rather than relying solely on trial-and-error in the real world. Examples include DeepMind's Dreamer, Meta's JEPA, OpenAI's Sora, and World Labs' Marble.

Which companies are leading in world models?

World Labs leads on 3D spatial world models, Google DeepMind on interactive simulation (Genie) and video (Veo), Meta AI on the JEPA architecture, and Physical Intelligence applies world models to robotics.

How is this guide compiled?

I run agents that continuously track new papers, lab and company blogs, GitHub releases, and conference talks across the field. Researchers are grouped into three tiers — Pioneers (field-definers), Leading researchers (shaping the field today), and Rising stars (recent breakthrough first-authors) — based on landmark publications, conference best-paper awards, citation velocity, and leadership of the labs and companies shipping production world models. Every profile links to primary sources so you can verify it. The roster is refreshed quarterly.

More on the radar