Skip to content

38. Introspective MCTS (I-MCTS)

Mini-Project: Introspective MCTS

An MCTS variant that adds a self-reflection step after each simulation — comparing sibling nodes to extract insights, then feeding those insights into future expansions to progressively bias the search toward more promising paths.

View on GitHub


Description

Introspective MCTS (I-MCTS) adds self-reflection to the MCTS search process. After each simulation, the agent reflects on why certain paths succeeded or failed and uses those insights to guide future search. This introspective step biases the search toward more promising areas based on learned reasoning patterns, not just statistical value estimates.

How It Works

Standard MCTS + an introspection step after each simulation that generates a "lesson learned" and adjusts the heuristic used for selection and expansion.

Diagram

flowchart TD
    A[Select Node] --> B[Expand]
    B --> C[Simulate]
    C --> D[Reflect: Why did this succeed/fail?]
    D --> E[Update Heuristics]
    E --> F[Backpropagate]
    F --> A

    style D fill:#9C27B0,color:#fff
    style E fill:#FF9800,color:#fff