8. Evaluator-Optimizer

Mini-Project: Audience-Adaptive Content Optimizer

A generator-evaluator loop that adapts content to a target audience, scoring readability and relevance until a quality threshold is met.

View on GitHub

Description

The Evaluator-Optimizer pattern separates generation and evaluation into two distinct LLM roles that operate in a loop. The Generator produces a response, and the Evaluator scores it against defined criteria and provides structured feedback. The loop continues until the evaluator's score exceeds a threshold or a maximum number of iterations is reached. This differs from Reflection in that the evaluator provides structured scoring rather than just textual feedback.

When to Use

When you have clear, quantifiable evaluation criteria (accuracy, completeness, format)
Tasks where quality can be scored numerically
Content that must meet specific standards before release
When iterative refinement provides measurable, demonstrable value

Benefits

Benefit	Description
Measurable Quality	Numeric scoring tracks improvement objectively
Structured Feedback	Evaluator provides actionable, criteria-specific guidance
Threshold Control	Define minimum quality bars
Separation of Concerns	Generator and evaluator can use different prompts or models

Architecture Diagram

flowchart TD
    A[User Input] --> B[Generator LLM]
    B --> C[Generated Response]
    C --> D[Evaluator LLM]
    D --> E{Score >= Threshold?}
    E -->|No| F[Structured Feedback + Score]
    F --> B
    E -->|Yes| G[Approved Output]

    style A fill:#4CAF50,color:#fff
    style B fill:#2196F3,color:#fff
    style C fill:#FFC107,color:#000
    style D fill:#E91E63,color:#fff
    style E fill:#FF9800,color:#fff
    style F fill:#F44336,color:#fff
    style G fill:#4CAF50,color:#fff