> ## Documentation Index > Fetch the complete documentation index at: https://docs.pipecat.ai/llms.txt > Use this file to discover all available pages before exploring further. # Gemini Live Vertex AI > A real-time, multimodal conversational AI service powered by Google's Gemini via Vertex AI ## Overview `GeminiLiveVertexLLMService` enables natural, real-time conversations with Google's Gemini model through Vertex AI. It provides built-in audio transcription, voice activity detection, and context management for creating interactive AI experiences with multimodal capabilities including audio, video, and text processing. Want to start building? Check out our [Gemini Live Guide](/pipecat/features/gemini-live) for general concepts, then follow the Vertex AI-specific setup below. Pipecat's API methods for Gemini Live Vertex AI integration Complete Gemini Live Vertex AI function calling example Official Vertex AI Gemini Live API documentation Gemini Live available models ## Installation To use Gemini Live Vertex AI services, install the required dependencies: ```bash theme={null} uv add "pipecat-ai[google]" ``` ## Prerequisites ### Google Cloud Setup Before using Gemini Live Vertex AI services, you need: 1. **Google Cloud Project**: Set up a project in the [Google Cloud Console](https://console.cloud.google.com/) 2. **Vertex AI API**: Enable the Vertex AI API in your project 3. **Service Account**: Create a service account with `roles/aiplatform.user` and `roles/ml.developer` permissions 4. **Authentication**: Set up service account credentials or Application Default Credentials ### Required Environment Variables * `GOOGLE_VERTEX_TEST_CREDENTIALS`: JSON string of service account credentials (optional if using ADC) * `GOOGLE_CLOUD_PROJECT_ID`: Your Google Cloud project ID * `GOOGLE_CLOUD_LOCATION`: Vertex AI region (e.g., "us-east4") ### Key Features * **Enterprise Authentication**: Secure service account-based authentication * **Multimodal Processing**: Handle audio, video, and text inputs simultaneously * **Real-time Streaming**: Low-latency audio and video processing * **Voice Activity Detection**: Automatic speech detection and turn management * **Function Calling**: Advanced tool integration and API calling capabilities * **Context Management**: Intelligent conversation history and system instruction handling ## Configuration ### GeminiLiveVertexLLMService This service extends `GeminiLiveLLMService` with Vertex AI authentication. It accepts all the same parameters as the [Gemini Live](/api-reference/server/services/s2s/gemini-live) service, with these differences: JSON string of Google service account credentials. If not provided, falls back to `credentials_path` or Application Default Credentials (ADC). Path to a service account JSON file. Used if `credentials` is not provided. GCP region for the Vertex AI endpoint (e.g., `"us-east4"`). Google Cloud project ID. Vertex AI model identifier to use. *Deprecated in v0.0.105. Use `settings=GeminiLiveVertexLLMService.Settings(model=...)` instead.* TTS voice identifier for audio responses. *Deprecated in v0.0.105. Use `settings=GeminiLiveVertexLLMService.Settings(voice=...)` instead.* System prompt for the model. Can also be provided via the LLM context. Tools available to the model: a `ToolsSchema`, a plain list of direct functions and/or `FunctionSchema` objects, or a list of provider-native tool dicts. Can also be provided via the LLM context. Runtime-configurable generation and session settings. See the [Gemini Live InputParams](/api-reference/server/services/s2s/gemini-live#settings) for details. *Deprecated in v0.0.105. Use `settings=GeminiLiveVertexLLMService.Settings(...)` instead.* Runtime-configurable settings. See the [Gemini Live Settings](/api-reference/server/services/s2s/gemini-live#settings) for the full reference. Whether to start with audio input paused. Whether to start with video input paused. Whether to generate a response when context is first set. Set to `False` to wait for user input before the model responds. HTTP options for the Google API client. ### Settings The Vertex AI variant uses the same Settings as the base Gemini Live service. See [Gemini Live Settings](/api-reference/server/services/s2s/gemini-live#settings) for the full reference. ## Usage Pair this service with `LLMContextAggregatorPair(context, realtime_service_mode=True)`. Realtime mode keeps context-writing correct for speech-to-speech services and adapts turn handling to the service. See [Realtime (Speech-to-Speech) Services](/api-reference/server/utilities/turn-management/external-turn-management#realtime-speech-to-speech-services). ### Basic Setup with Service Account Credentials ```python theme={null} import os from pipecat.services.google.gemini_live import GeminiLiveVertexLLMService llm = GeminiLiveVertexLLMService( credentials=os.getenv("GOOGLE_VERTEX_TEST_CREDENTIALS"), project_id=os.getenv("GOOGLE_CLOUD_PROJECT_ID"), location=os.getenv("GOOGLE_CLOUD_LOCATION"), settings=GeminiLiveVertexLLMService.Settings( voice="Charon", system_instruction="You are a helpful assistant.", ), ) ``` ### With Credentials File ```python theme={null} llm = GeminiLiveVertexLLMService( credentials_path="/path/to/service-account.json", project_id="my-gcp-project", location="us-east4", settings=GeminiLiveVertexLLMService.Settings( voice="Puck", system_instruction="You are a helpful assistant.", ), ) ``` ### Using Application Default Credentials (ADC) ```python theme={null} # When running on GCP or with gcloud auth application-default login llm = GeminiLiveVertexLLMService( project_id="my-gcp-project", location="us-east4", settings=GeminiLiveVertexLLMService.Settings( system_instruction="You are a helpful assistant.", ), ) ``` ### With Settings ```python theme={null} from pipecat.services.google.gemini_live import GeminiVADParams llm = GeminiLiveVertexLLMService( credentials=os.getenv("GOOGLE_VERTEX_TEST_CREDENTIALS"), project_id=os.getenv("GOOGLE_CLOUD_PROJECT_ID"), location="us-east4", settings=GeminiLiveVertexLLMService.Settings( model="google/gemini-live-2.5-flash-native-audio", voice="Charon", system_instruction="You are a helpful assistant.", temperature=0.7, max_tokens=2048, vad=GeminiVADParams( silence_duration_ms=500, ), ), ) ``` The `InputParams` / `params=` pattern is deprecated as of v0.0.105. Use `Settings` / `settings=` instead. See the [Service Settings guide](/pipecat/fundamentals/service-settings) for migration details. ## Notes * **No `api_key` parameter**: Unlike the base `GeminiLiveLLMService`, Vertex AI uses service account credentials or ADC for authentication. Passing `api_key` will raise a `ValueError`. * **Authentication priority**: The service tries credentials in this order: (1) `credentials` JSON string, (2) `credentials_path` file, (3) Application Default Credentials (ADC). * **File API not supported**: The Gemini File API is not available through Vertex AI. Use Google Cloud Storage for file handling instead. * **Model naming**: Vertex AI uses different model identifiers (e.g., `"google/gemini-live-2.5-flash-native-audio"`) compared to the Google AI variant. * **Async tool limitation**: Vertex AI's Gemini Live endpoint does not currently support NON\_BLOCKING tool calls. Functions registered with `cancel_on_interruption=False` will log a one-time warning and fall back to synchronous behavior (the conversation pauses while the tool runs). Use `cancel_on_interruption=True` (the default) or use a non-realtime LLM service if your tool requires async semantics. * **All other features** (VAD, context compression, thinking, function calling, etc.) work identically to the base [Gemini Live](/api-reference/server/services/s2s/gemini-live) service.