Simplismart

Overview

SimplismartSTTService is a segmented speech-to-text service that POSTs WAV audio segments to the Simplismart HTTP /predict endpoint and emits TranscriptionFrames. It requires upstream VAD (a VADProcessor or transport/user-aggregator VAD) so speech segments are delimited before transcription.

Source Repository

Source code, examples, and issues for the Simplismart integration

Learn more about Simplismart’s AI platform

Installation

This is a community-maintained package distributed separately from pipecat-ai. It is not published to PyPI, so install it from source:

uv pip install git+https://github.com/simpli-smart/pipecat-simplismart.git

Prerequisites

Simplismart Account Setup

Before using the Simplismart STT service, you need a Simplismart account and an API key. See Simplismart to get started.

Required Environment Variables

SIMPLISMART_API_KEY: Bearer token used to authenticate requests. May be passed directly via the api_key constructor argument instead.
SIMPLISMART_STT_URL (optional): Full URL for the STT endpoint. Defaults to https://api.simplismart.live/predict.

Configuration

str

default:"None"

Bearer token. Falls back to the SIMPLISMART_API_KEY environment variable if not provided.

str

default:"None"

Full URL to the predict endpoint. Falls back to the SIMPLISMART_STT_URL environment variable, then to https://api.simplismart.live/predict.

aiohttp.ClientSession

default:"None"

Optional shared aiohttp session. If not provided, the service creates and owns its own session.

int

default:"None"

Input audio sample rate. Usually supplied by the pipeline StartFrame.

SimplismartSTTService.Settings

default:"None"

Runtime-configurable STT settings. See Settings below.

Settings

Runtime-configurable settings passed via the settings constructor argument using SimplismartSTTService.Settings(...). The settings dataclass extends Pipecat’s common STTSettings (which includes model and language).

Parameter	Type	Default	Description
`vad_filter`	`bool`	`True`	Enable server-side VAD filtering when supported.
`vad_onset`	`float`	`0.5`	VAD onset threshold.
`beam_size`	`int`	`4`	Beam search size for decoding.
`temperature`	`float`	`0.0`	Decoding temperature.
`strict_hallucination_reduction`	`bool`	`True`	Ask the server to apply extra anti-hallucination logic (Whisper).

The default model is openai/whisper-large-v3-turbo and the default language is Language.EN.

See the source repository for the authoritative, up-to-date list of settings and defaults.

Usage

Place a VADProcessor before SimplismartSTTService so VAD events reach the segmented STT layer.

from pipecat.audio.vad.silero import SileroVADAnalyzer
from pipecat.processors.audio.vad_processor import VADProcessor
from pipecat_simplismart import SimplismartSTTService

vad_processor = VADProcessor(vad_analyzer=SileroVADAnalyzer())
stt = SimplismartSTTService(
    api_key="YOUR_KEY",
    base_url="https://api.simplismart.live/predict",
)

# pipeline: ... transport.input(), vad_processor, stt, ...

The service outputs TranscriptionFrames.

Compatibility

Tested with Pipecat v1.1.0 (pipecat-ai>=0.0.86). Check the source repository for the latest tested version and changelog.

Pipecat Server

Client SDKs

Pipecat Flows

Pipecat Cloud

CLI

Pipecat Context Hub

Simplismart

Overview

Source Repository

Simplismart

Installation

Prerequisites

Simplismart Account Setup

Required Environment Variables

Configuration

Settings

Usage

Compatibility

​Overview

Source Repository

Simplismart

​Installation

​Prerequisites

​Simplismart Account Setup

​Required Environment Variables

​Configuration

​Settings

​Usage

​Compatibility

Overview

Installation

Prerequisites

Simplismart Account Setup

Required Environment Variables

Configuration

Settings

Usage

Compatibility