Skip to main content

Start with the SDK quickstart

Build your first dataset, run a task, and compute quality metrics.
About the stack: LLM Stats runs on top of ZeroEval, the evaluation library developed by the same team behind LLM Stats.

What You Can Do

1

Install and authenticate

Follow /python-sdk/installation to install zeroeval and configure ZEROEVAL_API_KEY.
2

Complete the first eval

Run the walkthrough in /python-sdk/quickstart.
3

Master datasets

Learn creation, loading, versioning/subsets, and multimodal data handling.
4

Productionize eval execution

Add scoring, repetition, and resume/reliability controls.

Documentation Scope

  • Getting Started: setup and first run
  • Datasets: creation, loading, versioning, subsets, multimodal
  • Evals: execution, scoring, metrics, repetitions, resume
  • Examples: end-to-end text and multimodal workflows