Speech-to-Text
Speech-to-Text services receive and audio input and output transcriptions.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |
Large Language Models
LLMs receive text or audio based input and output a streaming text response.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |
Text-to-Speech
Text-to-Speech services receive text input and output audio streams or chunks.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |
Video
Video services enable you to build an avatar where audio and video are synchronized.| Service | Repository | Maintainer(s) |
|---|---|---|
| Beyond Presence | https://github.com/bey-dev/pipecat-bey | bey-dev |
Telephony Serializers
Serializers convert between frames and media streams, enabling real-time communication over a websocket.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |
Image Generation
Image generation services receive text inputs and output images.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |
Vision
Vision services receive a streaming video input and output text describing the video input.| Service | Repository | Maintainer(s) |
|---|---|---|
| No community integrations yet |