NVIDIA Riva

Overview

NvidiaTTSService provides high-quality text-to-speech synthesis through NVIDIA Riva’s cloud-based AI models accessible via gRPC API. The service offers multilingual support, configurable quality settings, and streaming audio generation optimized for real-time applications.

NVIDIA Riva TTS API Reference

Pipecat’s API methods for NVIDIA Riva TTS integration

Example Implementation

Complete example with Riva NIM

NVIDIA Riva Documentation

Official NVIDIA Riva TTS documentation

NVIDIA Developer Portal

Access API keys and Riva services

Installation

To use NVIDIA Riva services, install the required dependencies:

pip install "pipecat-ai[nvidia]"

Prerequisites

NVIDIA Riva Setup

Before using Riva TTS services, you need:

NVIDIA Developer Account: Sign up at NVIDIA Developer Portal
API Key: Generate an NVIDIA API key for Riva services
Riva Access: Ensure access to NVIDIA Riva TTS services

Required Environment Variables

NVIDIA_API_KEY: Your NVIDIA API key for authentication

Configuration

NvidiaTTSService

api_key

str

required

NVIDIA API key for authentication.

server

str

default:"grpc.nvcf.nvidia.com:443"

gRPC server endpoint.

voice_id

str

default:"Magpie-Multilingual.EN-US.Aria"

Voice model identifier.

sample_rate

int

default:"None"

Audio sample rate in Hz. When None, uses the pipeline’s configured sample rate.

model_function_map

dict

Dictionary containing function_id and model_name for the TTS model.

use_ssl

bool

default:"True"

Whether to use SSL for the NVIDIA Riva server connection.

params

InputParams

default:"None"

Runtime-configurable synthesis settings. See InputParams below.

InputParams

Parameter	Type	Default	Description
`language`	`Language`	`Language.EN_US`	Language code for synthesis.
`quality`	`int`	`20`	Audio quality setting (0-100).

Usage

Basic Setup

from pipecat.services.nvidia import NvidiaTTSService

tts = NvidiaTTSService(
    api_key=os.getenv("NVIDIA_API_KEY"),
)

With Custom Voice and Quality

from pipecat.services.nvidia import NvidiaTTSService
from pipecat.transcriptions.language import Language

tts = NvidiaTTSService(
    api_key=os.getenv("NVIDIA_API_KEY"),
    voice_id="Magpie-Multilingual.EN-US.Aria",
    model_function_map={
        "function_id": "877104f7-e885-42b9-8de8-f6e4c6303969",
        "model_name": "magpie-tts-multilingual",
    },
    params=NvidiaTTSService.InputParams(
        language=Language.EN_US,
        quality=40,
    ),
)

Notes

gRPC-based: NVIDIA Riva uses gRPC (not HTTP or WebSocket) for communication with the TTS service.
Model cannot be changed after initialization: The model and function ID must be set during construction via model_function_map. Calling set_model() after initialization will log a warning and have no effect.
SSL enabled by default: The service connects to NVIDIA’s cloud endpoint with SSL. Set use_ssl=False only for local or custom Riva deployments.
Blocking gRPC calls: Audio generation uses asyncio.to_thread to avoid blocking the event loop, since the underlying Riva client uses synchronous gRPC calls.

API Reference

Services

Utilities

Frameworks

Pipeline

Overview

NVIDIA Riva TTS API Reference

Example Implementation

NVIDIA Riva Documentation

NVIDIA Developer Portal

Installation

Prerequisites

NVIDIA Riva Setup

Required Environment Variables

Configuration

NvidiaTTSService

InputParams

Usage

Basic Setup

With Custom Voice and Quality

Notes

API Reference

Services

Utilities

Frameworks

Pipeline

​Overview

NVIDIA Riva TTS API Reference

Example Implementation

NVIDIA Riva Documentation

NVIDIA Developer Portal

​Installation

​Prerequisites

​NVIDIA Riva Setup

​Required Environment Variables

​Configuration

​NvidiaTTSService

​InputParams

​Usage

​Basic Setup

​With Custom Voice and Quality

​Notes

Overview

Installation

Prerequisites

NVIDIA Riva Setup

Required Environment Variables

Configuration

NvidiaTTSService

InputParams

Usage

Basic Setup

With Custom Voice and Quality

Notes