分類
Voice agents & realtime AI — phone, support, and live conversation stacks
Vapi, Retell, Bland, LiveKit Agents, Cartesia, Hume, Deepgram, Pipecat and more—build AI that can pick up the phone.
Voice agents exploded in 2025: chain **STT + LLM + TTS + telephony/WebRTC** to run front-desk, pre-sales, scheduling, and follow-ups. Four things decide the build: **end-to-end tail latency** (target < 700ms), **barge-in and turn-taking**, **regional numbering and compliance** (TCPA, two-party consent), and **hosted vs self-built**. On TTS, judge realism and voice-clone licensing; on STT, stress-test noise and accents.
檢索與補充說明
Vapi vs Retell vs Bland
Vapi and Retell are dev-first and let you swap STT/LLM/TTS. Bland is more batteries-included. Record a real script and run 50 calls through each before committing.
How do you get low latency with barge-in?
You need VAD plus duplex audio, streaming STT, streaming LLM, and streaming TTS. LiveKit Agents and Pipecat are popular plumbing; Cartesia and ElevenLabs drive the TTS-side latency that makes conversations feel natural.
Compliance and disclosure duties?
Most jurisdictions require consent before recording/transcription. Outbound phone work also involves numbering rules, holiday/time windows, and do-not-call lists—get local counsel.
本類工具
簡介與官網以各工具詳情頁為準;可在同類條目間交叉瀏覽。
開發者優先的語音代理平台,可自選 STT/LLM/TTS 與租用號碼;以分鐘計費,適合呼入/外呼與預約機器人。
強調低延遲與自然打斷的語音代理平台,可視化編排 + 號碼租用,面向呼入/外呼流程。
Bland AI:常見的 AI 產品——功能、價格、支援地區、資料處理與最新模型,請以官網說明為準。
LiveKit Agents:常見的 AI 產品——功能、價格、支援地區、資料處理與最新模型,請以官網說明為準。
低延遲 TTS(Sonic 系列),支援流式 API 與自訂聲音;語音代理與音訊產品追求「自然感」的常見選擇。
Hume AI:常見的 AI 產品——功能、價格、支援地區、資料處理與最新模型,請以官網說明為準。
Deepgram:常見的 AI 產品——功能、價格、支援地區、資料處理與最新模型,請以官網說明為準。
Pipecat:常見的 AI 產品——功能、價格、支援地區、資料處理與最新模型,請以官網說明為準。