> ## Documentation Index > Fetch the complete documentation index at: https://docs.pipecat.ai/llms.txt > Use this file to discover all available pages before exploring further. # eval > Run behavioral evals against a Pipecat agent, individually or as a suite Run scenario-based behavioral evals. `pipecat eval run` tests scenarios against an already-running agent; `pipecat eval suite` spawns the agents listed in a manifest and runs their scenarios concurrently. Both exit `0` when everything passes and `1` otherwise. If `pipecat-ai[cli]` is a dependency of your project, run these commands with `uv run pipecat eval`. They're also available as `python -m pipecat.evals`. See the [Pipecat Evals guide](/pipecat/evals/overview) for concepts, the scenario format, and manifests. ## eval run Run one or more scenarios against an already-running agent (started with `-t eval`). **Usage:** ```shell theme={null} pipecat eval run [OPTIONS] SCENARIOS... ``` **Arguments:** One or more scenario YAML files. **Options:** WebSocket URL of the agent's eval transport. Print a line for each turn and expectation as it resolves. Record each scenario's conversation audio (audio-mode scenarios). Directory for `--audio` recordings: `/.wav`. Directory for cached synthesized user audio. Defaults to `/pipecat/tts`. Disable the user-audio cache: re-synthesize every turn (no reads or writes). Default per-expectation timeout in seconds, for expectations without their own `within_ms`. Directory for each scenario's logs: `/.eval.log` (plus `.debug.log` under `--debug`). Also save `.debug.log` with the harness's full per-pipeline logs. Cancel the agent's pipeline (exit it) after the run. By default the agent is left running so it can serve more scenarios. Fire the bot's `on_client_disconnect` callback when the eval client disconnects. Bots often cancel their pipeline there, so it's off by default. A scenario's `trigger_disconnect:` field opts in on its own. ## eval suite Spawn the agents in a manifest and run their scenarios concurrently. Everything except the `suite:` list can be set in the manifest or overridden on the command line (the command line wins). **Usage:** ```shell theme={null} pipecat eval suite [OPTIONS] MANIFEST_PATH ``` **Arguments:** Manifest YAML listing agents and their scenarios. **Options:** Only run bots whose path contains this substring. Only run this scenario name. Run subdirectory name under `runs_dir`. Defaults to a timestamp. Output base, overriding the manifest's `runs_dir`. A `/` subdirectory with `logs/` and `recordings/` is created under it. Defaults to `eval-runs`. Override the manifest's `bots_dir` (bot paths are relative to it). Override the manifest's `scenarios_dir`. Override the manifest's `concurrency` (how many runs execute at once). Override the manifest's `base_port` (default `7900`). Each run gets \`base\_port * index\`. Override the manifest's `cache_dir` for cached synthesized user audio. Disable the user-audio cache: re-synthesize every turn (no reads or writes). Default per-expectation timeout in seconds, for expectations without their own `within_ms`. Override the manifest's spawn template. Default: `"{python} {bot} -t eval --port {port}"`. Override the Python interpreter used to spawn each agent. Record conversation audio. Also save `.debug.log` with the harness's full per-pipeline logs. ## Examples ```shell theme={null} # Run one scenario against a running agent pipecat eval run scenarios/capital_question.yaml # Run a batch of scenarios, verbosely pipecat eval run scenarios/*.yaml -v # Run a full suite pipecat eval suite manifest.yaml # Only the support agent, 8 runs at a time, named output dir pipecat eval suite manifest.yaml -p support -c 8 -n nightly ```