Text-to-speech service implementation using Deepgram’s Aura API
DEEPGRAM_API_KEY
.
TextFrame
- Text content to synthesize into speechTTSSpeakFrame
- Text that should be spoken immediatelyTTSUpdateSettingsFrame
- Runtime configuration updatesLLMFullResponseStartFrame
/ LLMFullResponseEndFrame
- LLM response boundariesTTSStartedFrame
- Signals start of synthesisTTSAudioRawFrame
- Generated audio data chunks (streaming)TTSStoppedFrame
- Signals completion of synthesisErrorFrame
- API or processing errorsVoice Model | Description | Language |
---|---|---|
aura-2-helena-en | Natural female voice | English |
aura-2-andromeda-en | Expressive female voice | English |
aura-helios-en | Warm male voice | English |
aura-luna-en | Conversational female voice | English |
aura-stella-en | Professional female voice | English |
aura-zeus-en | Authoritative male voice | English |
DeepgramTTSService
with your API key and use it in your pipeline:
TTSUpdateSettingsFrame
for the DeepgramTTSService
: