Deploying an agent
Deploy your agent to Pipecat Cloud with a single command:Deploying to a specific region
Agents deploy to your organization’s default region, which isus-west by default. You can deploy to a specific region to reduce latency or meet data residency requirements:
Agent names are globally unique across all regions. For multi-region
deployments, deploy uniquely named agents to each target region (e.g.,
my-agent-us-west, my-agent-us-east).Deploying with a custom image
If you prefer to build and push images to your own container registry, you can specify an image directly:Using a private container registry? See our Container Registry
guides for setup instructions for
GCP Artifact Registry, AWS ECR, and other private registries.
Deployment status
Once complete, Pipecat Cloud will return the status of your deployment or any errors that occurred during the deployment process.Using pcc-deploy.toml
To support shareable deployment configurations, you can create apcc-deploy.toml file in the root of your project (adjacent to where you run your CLI commands.)
The CLI will automatically detect this file and use it to populate the deployment configuration. Any CLI arguments you provide will override the values in the pcc-deploy.toml file.
An example pcc-deploy.toml file might look like this:
image field (and image_credentials for private registries):
Agent Profiles
Agent profiles determine the CPU and memory resources allocated to your deployed agents. Choose the profile that best matches your agent’s requirements:agent-1x(default): 0.5 vCPU and 1 GB of memory. Best for voice agents.agent-2x: 1 vCPU and 2 GB of memory. Well suited for voice and video agents or voice agents requiring extra processing.agent-3x: 1.5 vCPU and 3 GB of memory. Best for voice and video agents requiring extra processing or multiple video inputs.
Agent profiles affect resource allocation and impact your usage costs. Choose
the smallest profile that meets your agent’s performance requirements.
Checking status of a deployment
You can check the status of a deployment using the CLI:ready status.
Modifying a deployment
Agents are referenced as a mutable manifest in Pipecat Cloud. Pushing a new image or configuration to the same agent name will update your agent manifest and be used for any new requests.When there is no new deployment, instances are returned to the pool after a
session ends and remain available to serve new sessions during a cooldown
period. A new deployment changes this behavior: instances running the prior
image are terminated once their sessions conclude, rather than being recycled.
Failed deployments
If a deployment fails (i.e. fails to enter aready state), requests will be routed to any prior deployments in a ready state.
This ensures that your agent remains available to end-users even when a deployment fails.
Delete a deployment
If you no longer need a deployment, you can delete it using thedelete command: