31. Agentic RAG

Mini-Project: Technical Documentation Assistant

An agentic RAG system that intelligently routes queries across vector search, keyword search, and web search sources, reformulating queries and retrying when results are insufficient.

View on GitHub

Description

Agentic RAG (Retrieval-Augmented Generation) goes beyond simple retrieve-then-generate by giving the agent the ability to decide when, what, and how to retrieve. Instead of always retrieving before answering, the agent reasons about whether retrieval is needed, formulates targeted queries, evaluates retrieved documents for relevance, and may perform multiple retrieval rounds. The agent can also choose between different retrieval sources (vector DB, web search, SQL database).

When to Use

Question-answering over large document collections
When queries require multi-step retrieval (initial search, then follow-up)
Systems with multiple knowledge sources that need intelligent routing
When retrieved documents need relevance filtering before use

Benefits

Benefit	Description
Precision	Agent crafts targeted queries instead of naive retrieval
Multi-Source	Can query different stores based on the question type
Self-Correcting	Detects poor retrievals and retries with refined queries
Efficiency	Skips retrieval when the LLM already knows the answer

Architecture Diagram

flowchart TD
    A[User Question] --> B[Agent Reasoning]
    B --> C{Retrieval Needed?}
    C -->|No| D[Answer from Knowledge]
    C -->|Yes| E[Formulate Query]
    E --> F{Which Source?}
    F -->|Vector DB| G[Semantic Search]
    F -->|Web| H[Web Search]
    F -->|SQL| I[Database Query]
    G --> J[Evaluate Relevance]
    H --> J
    I --> J
    J --> K{Sufficient?}
    K -->|No| E
    K -->|Yes| L[Generate Answer with Context]

    style A fill:#4CAF50,color:#fff
    style B fill:#2196F3,color:#fff
    style C fill:#FF9800,color:#fff
    style E fill:#9C27B0,color:#fff
    style G fill:#00BCD4,color:#fff
    style H fill:#00BCD4,color:#fff
    style I fill:#00BCD4,color:#fff
    style J fill:#E91E63,color:#fff
    style L fill:#4CAF50,color:#fff