65. World Models
Mini-Project: World Model: Simple Grid World Simulator
An agent that uses an internal world model to simulate thousands of action sequences in imagination before committing to the best plan, then verifies the advantage over a random baseline in terms of steps to reach the goal.
Description
World Models enable agents to build internal simulations of their environment and use them to plan actions without actually executing them. The agent learns a predictive model of how the world works (state transitions, physics, consequences of actions) and uses this model to "imagine" the outcomes of different plans before committing to one.
World models are critical for sample-efficient learning: instead of trying every action in the real world, the agent can simulate thousands of plans internally and pick the best one.
Architecture Diagram
flowchart TD
A[Real Observation] --> B[World Model]
B --> C[Predicted Next State]
C --> D[Plan in Imagination]
D --> E{Best Plan Found?}
E -->|No| D
E -->|Yes| F[Execute in Real World]
F --> G[New Observation]
G --> H[Update World Model]
H --> B
style B fill:#9C27B0,color:#fff
style D fill:#2196F3,color:#fff
style F fill:#4CAF50,color:#fff
Key Models/Systems
| Model | Creator | Description |
|---|---|---|
| Dreamer v3 | Google DeepMind | World model for RL agents, learns from images |
| GAIA-1 | Wayve | World model for autonomous driving |
| UniSim | Universal simulator from real-world data | |
| Genie | Google DeepMind | Generative interactive environment model |