> ## Documentation Index > Fetch the complete documentation index at: https://docs.pipecat.ai/llms.txt > Use this file to discover all available pages before exploring further. # ElevenLabs > Speech-to-text service implementation using ElevenLabs' file-based transcription API ## Overview ElevenLabs provides two STT service implementations: * **`ElevenLabsSTTService`** (HTTP) -- File-based transcription using ElevenLabs' Speech-to-Text API with segmented audio processing. Uploads audio files and receives transcription results directly. * **`ElevenLabsRealtimeSTTService`** (WebSocket) -- Real-time streaming transcription with ultra-low latency, supporting both partial (interim) and committed (final) transcripts with manual or VAD-based commit strategies. Pipecat's API methods for ElevenLabs STT integration Complete example with ElevenLabs STT and TTS Official ElevenLabs STT API documentation Access API keys and speech-to-text models ## Installation To use ElevenLabs STT services, install the required dependencies: ```bash theme={null} uv add "pipecat-ai[elevenlabs]" ``` ## Prerequisites ### ElevenLabs Account Setup Before using ElevenLabs STT services, you need: 1. **ElevenLabs Account**: Sign up at [ElevenLabs Platform](https://elevenlabs.io/) 2. **API Key**: Generate an API key from your account dashboard 3. **Model Access**: Ensure access to the Scribe v2 transcription model (default: `scribe_v2`) ### Required Environment Variables * `ELEVENLABS_API_KEY`: Your ElevenLabs API key for authentication ## ElevenLabsSTTService ElevenLabs API key for authentication. An aiohttp session for HTTP requests. You must create and manage this yourself. Base URL for the ElevenLabs API. Model ID for transcription. *Deprecated in v0.0.105. Use `settings=ElevenLabsSTTService.Settings(...)` instead.* Audio sample rate in Hz. When `None`, uses the pipeline's configured sample rate. Runtime-configurable settings for the STT service. See [Settings](#settings) below. Configuration parameters for the STT service. *Deprecated in v0.0.105. Use `settings=ElevenLabsSTTService.Settings(...)` instead.* P99 latency from speech end to final transcript in seconds. Override for your deployment. ### Settings Runtime-configurable settings passed via the `settings` constructor argument using `ElevenLabsSTTService.Settings(...)`. These can be updated mid-conversation with `STTUpdateSettingsFrame`. See [Service Settings](/pipecat/fundamentals/service-settings) for details. | Parameter | Type | Default | Description | | ------------------ | ----------------- | ------------- | ------------------------------------------------------------------------ | | `model` | `str` | `None` | Model ID for transcription. *(Inherited from base STT settings.)* | | `language` | `Language \| str` | `Language.EN` | Target language for transcription. *(Inherited from base STT settings.)* | | `tag_audio_events` | `bool` | `True` | Include audio events like (laughter), (coughing) in transcription. | | `keyterms` | `list[str]` | `None` | List of key terms or phrases to bias transcription towards. | ### Usage ```python theme={null} import aiohttp from pipecat.services.elevenlabs.stt import ElevenLabsSTTService async with aiohttp.ClientSession() as session: stt = ElevenLabsSTTService( api_key=os.getenv("ELEVENLABS_API_KEY"), aiohttp_session=session, ) ``` #### With Language and Audio Events ```python theme={null} import aiohttp from pipecat.services.elevenlabs.stt import ElevenLabsSTTService from pipecat.transcriptions.language import Language async with aiohttp.ClientSession() as session: stt = ElevenLabsSTTService( api_key=os.getenv("ELEVENLABS_API_KEY"), aiohttp_session=session, settings=ElevenLabsSTTService.Settings( language=Language.ES, tag_audio_events=False, ), ) ``` ### Notes * The HTTP service uploads complete audio segments and is best for VAD-segmented transcription. * Does not have connection events since it uses per-request HTTP calls. * **Multilingual support**: ElevenLabs Scribe supports 99+ languages. The default is `Language.EN` (English). Set `language=None` in settings to enable automatic language detection, which will transcribe whatever language the user speaks. ## ElevenLabsRealtimeSTTService ElevenLabs API key for authentication. Base URL for the ElevenLabs WebSocket API. Model ID for real-time transcription. *Deprecated in v0.0.105. Use `settings=ElevenLabsRealtimeSTTService.Settings(...)` instead.* Audio sample rate in Hz. When `None`, uses the pipeline's configured sample rate. Runtime-configurable settings for the Realtime STT service. See [Settings](#settings-2) below. How to segment speech. `CommitStrategy.MANUAL` uses Pipecat's VAD to control when transcript segments are committed. `CommitStrategy.VAD` uses ElevenLabs' built-in VAD for segment boundaries. Whether to include word-level timestamps in transcripts. Whether to enable logging on ElevenLabs' side. Whether to include language detection in transcripts. Configuration parameters for the STT service. *Deprecated in v0.0.105. Use `settings=ElevenLabsRealtimeSTTService.Settings(...)` instead.* P99 latency from speech end to final transcript in seconds. Override for your deployment. ### Settings Runtime-configurable settings passed via the `settings` constructor argument using `ElevenLabsRealtimeSTTService.Settings(...)`. These can be updated mid-conversation with `STTUpdateSettingsFrame`. See [Service Settings](/pipecat/fundamentals/service-settings) for details. | Parameter | Type | Default | Description | | ---------------------------- | ----------------- | ------- | --------------------------------------------------------------------------------------- | | `model` | `str` | `None` | Model ID for transcription. *(Inherited from base STT settings.)* | | `language` | `Language \| str` | `None` | Language for speech recognition. *(Inherited from base STT settings.)* | | `keyterms` | `list[str]` | `None` | List of key terms or phrases to bias transcription towards. | | `vad_silence_threshold_secs` | `float` | `None` | Seconds of silence before VAD commits (0.3-3.0). Only used with VAD commit strategy. | | `vad_threshold` | `float` | `None` | VAD sensitivity (0.1-0.9, lower is more sensitive). Only used with VAD commit strategy. | | `min_speech_duration_ms` | `int` | `None` | Minimum speech duration for VAD (50-2000ms). Only used with VAD commit strategy. | | `min_silence_duration_ms` | `int` | `None` | Minimum silence duration for VAD (50-2000ms). Only used with VAD commit strategy. | ### Usage ```python theme={null} from pipecat.services.elevenlabs.stt import ElevenLabsRealtimeSTTService stt = ElevenLabsRealtimeSTTService( api_key=os.getenv("ELEVENLABS_API_KEY"), ) ``` #### With Timestamps and Custom Commit Strategy ```python theme={null} from pipecat.services.elevenlabs.stt import ElevenLabsRealtimeSTTService, CommitStrategy stt = ElevenLabsRealtimeSTTService( api_key=os.getenv("ELEVENLABS_API_KEY"), language_code="eng", commit_strategy=CommitStrategy.VAD, include_timestamps=True, settings=ElevenLabsRealtimeSTTService.Settings( vad_silence_threshold_secs=1.0, ), ) ``` ### Notes * **Commit strategies**: Defaults to `manual` commit strategy, where Pipecat's VAD controls when transcription segments are committed. Set `commit_strategy=CommitStrategy.VAD` to let ElevenLabs handle segment boundaries. When using `MANUAL` commit strategy, transcription frames are marked as finalized (`TranscriptionFrame.finalized=True`). * **Keepalive**: Sends silent audio chunks as keepalive to prevent idle disconnections (keepalive interval: 5s, timeout: 10s). * **Auto-reconnect**: Automatically reconnects if the WebSocket connection is closed when new audio arrives. * **Multilingual support**: ElevenLabs Scribe supports 99+ languages. The Realtime service defaults to automatic language detection (`language=None`). To restrict transcription to a specific language, set `language` in settings. ### Event Handlers Supports the standard [service connection events](/api-reference/server/events/service-events): | Event | Description | | ----------------- | --------------------------------------------------- | | `on_connected` | Connected to ElevenLabs Realtime STT WebSocket | | `on_disconnected` | Disconnected from ElevenLabs Realtime STT WebSocket | ```python theme={null} @stt.event_handler("on_connected") async def on_connected(service): print("Connected to ElevenLabs Realtime STT") ``` The `InputParams` / `params=` pattern is deprecated as of v0.0.105. Use `Settings` / `settings=` instead. See the [Service Settings guide](/pipecat/fundamentals/service-settings) for migration details.