Guides

How-to guides for specific tasks and integrations.

Available Guides

Guide	Description
Caching	Speed up runs with response caching
Rate Limiting	Handle API rate limits gracefully
Experiment Tracking	Log runs to W&B, MLflow, TensorBoard
Local Models	Run Ollama, llama.cpp, vLLM
Troubleshooting	Common issues and solutions

Guides vs Tutorials

Tutorials	Guides
Learning-oriented	Task-oriented
Step-by-step walkthroughs	Focused how-tos
Build something complete	Solve a specific problem
“How do I get started?”	“How do I do X?”

Quick Solutions

Speed Up Runs

# Enable async execution
async: true
concurrency: 10

insidellms harness config.yaml --async --concurrency 10

Reduce Costs

# Limit examples and enable caching
max_examples: 50
cache:
  enabled: true
  backend: sqlite

Run Offline

# Use DummyModel for testing
models:
  - type: dummy

Debug Failures

# Check environment
insidellms doctor

# Validate artifacts
insidellms validate ./my_run

# Verbose output
insidellms run config.yaml --verbose

Common Patterns

Development Workflow

# 1. Test with dummy model
insidellms run config.yaml --model-override dummy

# 2. Small sample with real model
insidellms run config.yaml --max-examples 10

# 3. Full run
insidellms run config.yaml

CI Workflow

# 1. Run candidate
insidellms harness config.yaml --run-dir ./candidate

# 2. Diff against baseline
insidellms diff ./baseline ./candidate --fail-on-changes

Guides

Available Guides

Guides vs Tutorials

Quick Solutions

Speed Up Runs

Reduce Costs

Run Offline

Debug Failures

Common Patterns

Development Workflow

CI Workflow

Table of contents