> ## Documentation Index > Fetch the complete documentation index at: https://docs.pipecat.ai/llms.txt > Use this file to discover all available pages before exploring further. # Gladia > Speech-to-text service implementation using Gladia's API ## Overview `GladiaSTTService` provides real-time speech recognition using Gladia's WebSocket API with support for 99+ languages, custom vocabulary, translation, sentiment analysis, and advanced audio processing features for comprehensive transcription. Pipecat's API methods for Gladia STT integration Complete example with interruption handling Official Gladia documentation and features Access multilingual transcription and API keys ## Installation To use Gladia services, install the required dependency: ```bash theme={null} uv add "pipecat-ai[gladia]" ``` ## Prerequisites ### Gladia Account Setup Before using Gladia STT services, you need: 1. **Gladia Account**: Sign up at [Gladia](https://www.gladia.io/) 2. **API Key**: Generate an API key from your account dashboard 3. **Region Selection**: Choose your preferred region (EU-West or US-West) ### Required Environment Variables * `GLADIA_API_KEY`: Your Gladia API key for authentication * `GLADIA_REGION`: Your preferred region (optional, defaults to "eu-west") ## Configuration ### GladiaSTTService Gladia API key for authentication. Region used to process audio. Defaults to `"eu-west"` when `None`. Gladia API URL for session initialization. Audio encoding format. Init-only -- not part of runtime-updatable settings. Audio bit depth. Init-only -- not part of runtime-updatable settings. Number of audio channels. Init-only -- not part of runtime-updatable settings. Audio sample rate in Hz. When `None`, uses the pipeline's configured sample rate. Model to use for transcription. *Deprecated in v0.0.105. Use `settings=GladiaSTTService.Settings(...)` instead.* Additional configuration parameters. *Deprecated in v0.0.105. Use `settings=GladiaSTTService.Settings(...)` instead.* Runtime-configurable settings for the STT service. See [Settings](#settings) below. Maximum size of audio buffer in bytes (default 20MB). Whether the bot should be interrupted when Gladia VAD detects user speech. P99 latency from speech end to final transcript in seconds. Override for your deployment. See [stt-benchmark](https://github.com/pipecat-ai/stt-benchmark). ### Settings Runtime-configurable settings passed via the `settings` constructor argument using `GladiaSTTService.Settings(...)`. These can be updated mid-conversation with `STTUpdateSettingsFrame`. See [Service Settings](/pipecat/fundamentals/service-settings) for details. | Parameter | Type | Default | Description | | -------------------------------------- | -------------------------- | ------- | ------------------------------------------------------------------------------------- | | `model` | `str` | `None` | STT model identifier. *(Inherited from base STT settings.)* | | `language` | `Language \| str` | `None` | Language for speech recognition. *(Inherited from base STT settings.)* | | `language_config` | `LanguageConfig` | `None` | Detailed language configuration with code switching support. | | `custom_metadata` | `Dict[str, Any]` | `None` | Additional metadata to include with requests. | | `endpointing` | `float` | `None` | Silence duration in seconds to mark end of speech. | | `maximum_duration_without_endpointing` | `int` | `5` | Maximum utterance duration (seconds) without silence. | | `pre_processing` | `PreProcessingConfig` | `None` | Audio pre-processing options (audio enhancer, speech threshold). | | `realtime_processing` | `RealtimeProcessingConfig` | `None` | Real-time processing features (custom vocabulary, translation, NER, sentiment). | | `messages_config` | `MessagesConfig` | `None` | WebSocket message filtering options. | | `enable_vad` | `bool` | `False` | Enable Gladia VAD for end-of-utterance detection. Use without other VAD in the agent. | ## Usage ### Basic Setup ```python theme={null} from pipecat.services.gladia.stt import GladiaSTTService stt = GladiaSTTService( api_key=os.getenv("GLADIA_API_KEY"), ) ``` ### With Language Configuration ```python theme={null} from pipecat.services.gladia.stt import GladiaSTTService from pipecat.services.gladia.config import LanguageConfig stt = GladiaSTTService( api_key=os.getenv("GLADIA_API_KEY"), region="us-west", settings=GladiaSTTService.Settings( model="solaria-1", language_config=LanguageConfig( languages=["en", "es"], code_switching=True, ), ), ) ``` ### With Real-time Processing ```python theme={null} from pipecat.services.gladia.stt import GladiaSTTService from pipecat.services.gladia.config import ( RealtimeProcessingConfig, CustomVocabularyConfig, CustomVocabularyItem, TranslationConfig, ) stt = GladiaSTTService( api_key=os.getenv("GLADIA_API_KEY"), settings=GladiaSTTService.Settings( realtime_processing=RealtimeProcessingConfig( custom_vocabulary=True, custom_vocabulary_config=CustomVocabularyConfig( vocabulary=[ CustomVocabularyItem(value="Pipecat", intensity=0.8), "Gladia", ], ), translation=True, translation_config=TranslationConfig( target_languages=["fr", "de"], model="enhanced", ), ), ), ) ``` ## Notes * **Session-based connection**: Gladia uses a two-step connection process: first an HTTP POST to initialize a session, then a WebSocket connection to the returned session URL. The session URL and ID are managed automatically. * **Audio buffering**: The service buffers audio data locally and sends it when connected. If the connection drops and reconnects, buffered audio is automatically re-sent to minimize transcript gaps. * **Keepalive**: Empty audio chunks are sent periodically to keep the Gladia connection alive (keepalive interval: 5s, timeout: 20s). * **Built-in VAD**: Set `enable_vad=True` in Settings to use Gladia's server-side VAD, which emits `UserStartedSpeakingFrame` and `UserStoppedSpeakingFrame`. When using this, do not enable another VAD in your pipeline. * **Translation**: Gladia supports real-time translation to multiple target languages. Translation results are pushed as `TranslationFrame`s. The `GladiaInputParams` / `params=` pattern is deprecated as of v0.0.105. Use `Settings` / `settings=` instead. See the [Service Settings guide](/pipecat/fundamentals/service-settings) for migration details. ## Event Handlers Gladia STT supports the standard [service connection events](/api-reference/server/events/service-events): | Event | Description | | ----------------- | ---------------------------------- | | `on_connected` | Connected to Gladia WebSocket | | `on_disconnected` | Disconnected from Gladia WebSocket | ```python theme={null} @stt.event_handler("on_connected") async def on_connected(service): print("Connected to Gladia") ```