Speech-to-text service implementation using Soniox’s WebSocket API
SonioxSTTService
is a speech-to-text (STT) service that integrates with Soniox’s WebSocket API to provide real-time transcription capabilities. It processes audio input and produces transcription frames and interim transcription frames in real time, supporting over 60 languages. Supports custom context, multiple languages in same conversation and more.
SonioxSTTService
, you need to install the Soniox dependencies:
SONIOX_API_KEY
.
TranscriptionFrame
- Final transcription resultsInterimTranscriptionFrame
- Real-time transcription updatesErrorFrame
- Connection or processing errorsLanguage.EN_GB
will be treated same as Language.EN
. See Supported Languages for a list of supported languages.
You can learn more about language hints in the Soniox documentation.
SonioxSTTService
processes your speech and has two ways of knowing when to finalize the text.
vad_force_turn_endpoint
to True
. This will disable the automatic endpoint detection and force the service to return transcription results as soon as the user stops talking.