Text-to-speech service implementation using LMNT’s streaming API
LMNT_API_KEY
.
TextFrame
- Text content to synthesize into speechTTSSpeakFrame
- Text that should be spoken immediatelyTTSUpdateSettingsFrame
- Runtime configuration updatesLLMFullResponseStartFrame
/ LLMFullResponseEndFrame
- LLM response boundariesTTSStartedFrame
- Signals start of synthesisTTSAudioRawFrame
- Generated audio data chunks (streaming PCM)TTSStoppedFrame
- Signals completion of synthesisErrorFrame
- WebSocket or API errorsView All Supported Languages
Language Code | Description | Service Code |
---|---|---|
Language.DE | German | de |
Language.EN | English | en |
Language.ES | Spanish | es |
Language.FR | French | fr |
Language.HI | Hindi | hi |
Language.ID | Indonesian | id |
Language.IT | Italian | it |
Language.JA | Japanese | ja |
Language.KO | Korean | ko |
Language.NL | Dutch | nl |
Language.PL | Polish | pl |
Language.PT | Portuguese | pt |
Language.RU | Russian | ru |
Language.SV | Swedish | sv |
Language.TH | Thai | th |
Language.TR | Turkish | tr |
Language.UK | Ukrainian | uk |
Language.VI | Vietnamese | vi |
Language.ZH | Chinese | zh |
Language.EN
- EnglishLanguage.ES
- SpanishLanguage.FR
- FrenchLanguage.DE
- GermanLanguage.ZH
- ChineseLanguage.JA
- JapaneseLmntTTSService
and use it in a pipeline:
TTSUpdateSettingsFrame
for the LmntTTSService
: