Skip to main content

Transports

Transports exchange audio and video streams between the user and bot.
ServiceSetup
DailyTransportuv add "pipecat-ai[daily]"
FastAPIWebSocketTransportuv add "pipecat-ai[websocket]"
HeyGenTransportuv add "pipecat-ai[heygen]"
LemonSliceTransportuv add "pipecat-ai[lemonslice]"
LiveKitTransportuv add "pipecat-ai[livekit]"
SmallWebRTCTransportuv add "pipecat-ai[webrtc]"
TavusTransportuv add "pipecat-ai[tavus]"
WebSocket Transportsuv add "pipecat-ai[websocket]"
WhatsAppTransportuv add "pipecat-ai[webrtc]"

Serializers

Serializers convert between frames and media streams, enabling real-time communication over a websocket.
ServiceSetup
ExotelNo dependencies required
GenesysNo dependencies required
PlivoNo dependencies required
TelnyxNo dependencies required
TwilioNo dependencies required
VonageNo dependencies required

Speech-to-Text

Speech-to-Text services receive and audio input and output transcriptions.
ServiceSetup
AssemblyAIuv add "pipecat-ai[assemblyai]"
AWS Transcribeuv add "pipecat-ai[aws]"
Azureuv add "pipecat-ai[azure]"
Cartesiauv add "pipecat-ai[cartesia]"
Deepgramuv add "pipecat-ai[deepgram]"
ElevenLabsuv add "pipecat-ai[elevenlabs]"
Fal Wizperuv add "pipecat-ai[fal]"
Gladiauv add "pipecat-ai[gladia]"
Googleuv add "pipecat-ai[google]"
Gradiumuv add "pipecat-ai[gradium]"
Groq (Whisper)uv add "pipecat-ai[groq]"
NVIDIAuv add "pipecat-ai[nvidia]"
OpenAIuv add "pipecat-ai[openai]"
Sarvamuv add "pipecat-ai[sarvam]"
Smallestuv add "pipecat-ai[smallest]"
Sonioxuv add "pipecat-ai[soniox]"
Speechmaticsuv add "pipecat-ai[speechmatics]"
Whisperuv add "pipecat-ai[whisper]"

Large Language Models

LLMs receive text or audio based input and output a streaming text response.
ServiceSetup
Anthropicuv add "pipecat-ai[anthropic]"
AWS Bedrockuv add "pipecat-ai[aws]"
Azureuv add "pipecat-ai[azure]"
Cerebrasuv add "pipecat-ai[cerebras]"
DeepSeekuv add "pipecat-ai[deepseek]"
Fireworks AIuv add "pipecat-ai[fireworks]"
Google Geminiuv add "pipecat-ai[google]"
Google Vertex AIuv add "pipecat-ai[google]"
Grokuv add "pipecat-ai[grok]"
Groquv add "pipecat-ai[groq]"
Mistraluv add "pipecat-ai[mistral]"
Nebiusuv add "pipecat-ai[nebius]"
Novita AIuv add "pipecat-ai[novita]"
NVIDIAuv add "pipecat-ai[nvidia]"
Ollamauv add "pipecat-ai[ollama]"
OpenAIuv add "pipecat-ai[openai]"
OpenAI Responsesuv add "pipecat-ai[openai]"
OpenRouteruv add "pipecat-ai[openrouter]"
Perplexityuv add "pipecat-ai[perplexity]"
Qwenuv add "pipecat-ai[qwen]"
SambaNovauv add "pipecat-ai[sambanova]"
Sarvamuv add "pipecat-ai[sarvam]"
Together AIuv add "pipecat-ai[together]"

Text-to-Speech

Text-to-Speech services receive text input and output audio streams or chunks.
ServiceSetup
Asyncuv add "pipecat-ai[asyncai]"
AWS Pollyuv add "pipecat-ai[aws]"
Azureuv add "pipecat-ai[azure]"
Camb AIuv add "pipecat-ai[camb]"
Cartesiauv add "pipecat-ai[cartesia]"
Deepgramuv add "pipecat-ai[deepgram]"
ElevenLabsuv add "pipecat-ai[elevenlabs]"
Fishuv add "pipecat-ai[fish]"
Googleuv add "pipecat-ai[google]"
Gradiumuv add "pipecat-ai[gradium]"
Groquv add "pipecat-ai[groq]"
Humeuv add "pipecat-ai[hume]"
InworldNo dependencies required
Kokorouv add "pipecat-ai[kokoro]"
LMNTuv add "pipecat-ai[lmnt]"
MiniMaxNo dependencies required
Mistraluv add "pipecat-ai[mistral]"
Neuphonicuv add "pipecat-ai[neuphonic]"
NVIDIAuv add "pipecat-ai[nvidia]"
OpenAIuv add "pipecat-ai[openai]"
PiperNo dependencies required
ResembleAIuv add "pipecat-ai[resemble]"
Rimeuv add "pipecat-ai[rime]"
SarvamNo dependencies required
Smallest AIuv add "pipecat-ai[smallest]"
Speechmaticsuv add "pipecat-ai[speechmatics]"
xAIuv add "pipecat-ai[xai]"
XTTSuv add "pipecat-ai[xtts]"

Speech-to-Speech

Speech-to-Speech services are multi-modal LLM services that take in audio, video, or text and output audio or text.
ServiceSetup
AWS Nova Sonicuv add "pipecat-ai[aws-nova-sonic]"
Gemini Liveuv add "pipecat-ai[google]"
Gemini Live Vertex AIuv add "pipecat-ai[google]"
Grok Voice Agentuv add "pipecat-ai[grok]"
Inworld Realtimeuv add "pipecat-ai[inworld]"
OpenAI Realtimeuv add "pipecat-ai[openai]"
Ultravoxuv add "pipecat-ai[ultravox]"

Image Generation

Image generation services receive text inputs and output images.
ServiceSetup
Azureuv add "pipecat-ai[azure]"
faluv add "pipecat-ai[fal]"
Googleuv add "pipecat-ai[google]"
OpenAIuv add "pipecat-ai[openai]"

Video

Video services enable you to build an avatar where audio and video are synchronized.
ServiceSetup
HeyGenuv add "pipecat-ai[heygen]"
Simliuv add "pipecat-ai[simli]"
Tavusuv add "pipecat-ai[tavus]"

Memory

Memory services can be used to store and retrieve conversations.
ServiceSetup
mem0uv add "pipecat-ai[mem0]"

Vision

Vision services receive a streaming video input and output text describing the video input.
ServiceSetup
Moondreamuv add "pipecat-ai[moondream]"

Analytics & Monitoring

Analytics services help you better understand how your service operates.
ServiceSetup
Sentryuv add "pipecat-ai[sentry]"