Tag view

#evals

Cross-subject tag search for related interview cards.

Clear

Results update as you type. Press / to jump straight into search.

Tagged with evals

1 card

Artificial Intelligence Medium Theory

What are AI evals?

Evals are repeatable tests that measure whether a model or prompt setup performs well on the behaviors you care about.

  • Use task-specific test cases
  • Track regressions over time
  • Human review may still be needed

What are AI evals?