65. World Models

Mini-Project: World Model: Simple Grid World Simulator

An agent that uses an internal world model to simulate thousands of action sequences in imagination before committing to the best plan, then verifies the advantage over a random baseline in terms of steps to reach the goal.

View on GitHub

Description

World Models enable agents to build internal simulations of their environment and use them to plan actions without actually executing them. The agent learns a predictive model of how the world works (state transitions, physics, consequences of actions) and uses this model to "imagine" the outcomes of different plans before committing to one.

World models are critical for sample-efficient learning: instead of trying every action in the real world, the agent can simulate thousands of plans internally and pick the best one.

Architecture Diagram

flowchart TD
    A[Real Observation] --> B[World Model]
    B --> C[Predicted Next State]
    C --> D[Plan in Imagination]
    D --> E{Best Plan Found?}
    E -->|No| D
    E -->|Yes| F[Execute in Real World]
    F --> G[New Observation]
    G --> H[Update World Model]
    H --> B

    style B fill:#9C27B0,color:#fff
    style D fill:#2196F3,color:#fff
    style F fill:#4CAF50,color:#fff

Key Models/Systems

Model	Creator	Description
Dreamer v3	Google DeepMind	World model for RL agents, learns from images
GAIA-1	Wayve	World model for autonomous driving
UniSim	Google	Universal simulator from real-world data
Genie	Google DeepMind	Generative interactive environment model