Text-to-speech service implementation using Sarvam AI’s TTS API
SARVAM_API_KEY
.
TextFrame
- Text content to synthesize into speechTTSSpeakFrame
- Text that should be spoken immediatelyTTSUpdateSettingsFrame
- Runtime configuration updatesLLMFullResponseStartFrame
/ LLMFullResponseEndFrame
- LLM response boundariesTTSStartedFrame
- Signals start of synthesisTTSAudioRawFrame
- Generated audio data (PCM, WAV header stripped)TTSStoppedFrame
- Signals completion of synthesisErrorFrame
- API or processing errorsLanguage Code | Description | Service Code |
---|---|---|
Language.BN | Bengali | bn-IN |
Language.EN | English (India) | en-IN |
Language.GU | Gujarati | gu-IN |
Language.HI | Hindi | hi-IN |
Language.KN | Kannada | kn-IN |
Language.ML | Malayalam | ml-IN |
Language.MR | Marathi | mr-IN |
Language.OR | Odia | od-IN |
Language.PA | Punjabi | pa-IN |
Language.TA | Tamil | ta-IN |
Language.TE | Telugu | te-IN |
TTSUpdateSettingsFrame
for the SarvamTTSService
: