Tutorials

Step-by-step guides for common tasks. Each tutorial takes 15-30 minutes.

Available Tutorials

Tutorial	What You’ll Build	Prerequisites
Bias Testing	Detect bias across model responses	First Run
Model Comparison	Compare GPT-4 vs Claude systematically	API keys
CI Integration	Add diff-gating to GitHub Actions	Git basics
Custom Probe	Build your own evaluation probe	Python basics

Tutorial Format

Each tutorial follows the same structure:

Goal - What you’ll accomplish
Prerequisites - What you need before starting
Steps - Numbered instructions with code
Verification - How to confirm it worked
Next Steps - Where to go from here

Choosing Your Tutorial

graph TD
    Q1{What do you want to do?} --> A1[Evaluate model fairness]
    Q1 --> A2[Compare models]
    Q1 --> A3[Automate testing]
    Q1 --> A4[Extend the framework]

    A1 --> T1[Bias Testing Tutorial]
    A2 --> T2[Model Comparison Tutorial]
    A3 --> T3[CI Integration Tutorial]
    A4 --> T4[Custom Probe Tutorial]

Before You Start

Make sure you’ve completed Getting Started:

insideLLMs installed
First run completed
Understand basic concepts (models, probes, harness)

Tutorials

Available Tutorials

Tutorial Format

Choosing Your Tutorial

Before You Start

Table of contents