Overview

ElevenLabsSTTService provides speech recognition using ElevenLabs’ Speech-to-Text API with segmented audio processing. The service uploads audio files to ElevenLabs and receives transcription results directly, supporting multiple languages and audio event tagging for comprehensive transcription.

Installation

To use ElevenLabs STT services, install the required dependencies:
pip install "pipecat-ai[elevenlabs]"

Prerequisites

ElevenLabs Account Setup

Before using ElevenLabs STT services, you need:
  1. ElevenLabs Account: Sign up at ElevenLabs Platform
  2. API Key: Generate an API key from your account dashboard
  3. Model Access: Ensure access to the Scribe v1 transcription model
  4. HTTP Session: Configure aiohttp session for file uploads

Required Environment Variables

  • ELEVENLABS_API_KEY: Your ElevenLabs API key for authentication