Optimize your agent deployments to minimize cold starts
--min-agents
parameter to 1 in order to avoid scaling from zero and let Pipecat Cloud handle the rest.
However, for applications where traffic can fluctuate, you may need to plan for additional warm capacity to ensure your agents are always ready to respond immediately. For those cases, this guide will help you understand how warm capacity works in Pipecat Cloud, when you need to plan to use reserves, and how you can optimize your plan for both performance and cost.
--min-agents
deployment setting, ensuring immediate availability regardless of current traffic.min-agents
)Reserved | Active | Warm Capacity |
---|---|---|
10 | 1 | 10 |
10 | 10 | 10 |
1 | 10 | 10 |
min-agents: 0
to minimize costs during developmentmin-agents
to cover your baseline traffic to avoid cold startsScenario | Baseline | CPS | Idle Creation Delay | Calculation | Optimal Reserved |
---|---|---|---|---|---|
High volume | 10 | 1.0 (1 call/sec) | 30s | MAX(10, 1.0 × 30) | 30 |
Medium volume | 10 | 0.5 (1 call/2sec) | 30s | MAX(10, 0.5 × 30) | 15 |
Low volume | 10 | 0.1 (1 call/10sec) | 30s | MAX(10, 0.1 × 30) | 10 |