Skip to main content

Overview

SarvamSTTService provides real-time speech recognition using Sarvam AI’s WebSocket API, supporting Indian language transcription with Voice Activity Detection (VAD) and multiple audio formats for high-accuracy speech recognition.

Installation

To use Sarvam services, install the required dependency:
pip install "pipecat-ai[sarvam]"

Prerequisites

Sarvam AI Account Setup

Before using Sarvam STT services, you need:
  1. Sarvam AI Account: Sign up at Sarvam AI
  2. API Key: Generate an API key from your account dashboard
  3. Model Access: Access to Saarika (STT) or Saaras (STT-Translate) models

Required Environment Variables

  • SARVAM_API_KEY: Your Sarvam AI API key for authentication