0votes
0answers
0joined
Voice agent latency spikes to 4s every few turns — breaks the conversation feel
A real-time voice agent (Deepgram STT → gpt-4o → ElevenLabs TTS) has p95 latency of ~900ms but p99 of 4100ms. The p99 spikes are unpredictable and make conversation feel broken. They don't correlate with query complexity.