voice agent · production

Soniq

Voice agents that book the table, not the meeting after.

Sub-second turn latency. Multi-tenant by default. Production voice infrastructure for businesses that answer the phone: restaurants, clinics, salons, trades. Ship a branded agent the same day.

view on github see how it works

~450ms v2v·multi-tenant·7 presets·livekit sip

soniq

Deepgram

Cartesia

LiveKit

Supabase

Gemini

OpenAI

Groq

Hono

Next.js

Python

latency budget

450ms
voice to voice.
end to end.

Sub-second turn latency is what separates an agent that feels like a person from an agent that feels like a phone tree. Every component of the pipeline is streaming. Nothing waits for completion before passing to the next stage.

→Streaming STT, not record-then-transcribe.
→Token streaming LLM, first token in ~200ms.
→Streaming TTS, ~40ms first audio byte.
→Barge-in detection so users can interrupt naturally.

voice-to-voice budget · streaminglive · per turn

STT

LLM

TOOL

TTS

STTDeepgram Nova-3150ms

LLMOpenAI200ms

TOOLcalendar / faq60ms

TTSCartesia Sonic-340ms

total

~450ms

sub-second turn latency

capabilities

Production voice. Not a demo.

Six things that decide whether a voice agent survives contact with real users on a real phone network.

Multi-tenant

Database-level tenant isolation. Ship a white-label voice agent the same day.

Each tenant runs on a custom system prompt, dedicated phone number, industry preset, and isolated data scope. White-label ready: your logo, your domain, your billing surface. Supabase row-level security enforces the boundary at the data layer, not just the application.

rls · per-tenant prompts · custom numbers · domain mapping

Multi-provider LLM

Gemini, OpenAI, Groq, with fallback.

Provider goes down, traffic reroutes in under 200ms. No conversation interruption. Track quality and latency per provider, route by tenant or by task.

200ms failover · per-tenant routing

7 industry presets

Domain-trained agents out of the box.

Hotels, restaurants, medical, dental, salons, and auto service. Each preset includes prompt, escalation rules, and tool inventory tuned for the vertical.

instant deploy · preset library

Live observability

Supervisor dashboard. Live transcript. Mid-call intervention.

Watch the call as it happens. See the LLM's reasoning, every tool call, sentiment trajectory. Take over mid-conversation if the agent gets stuck.

live transcript · sentiment · takeover

Tool execution

Calendar, FAQ, routing, voicemail, built in.

Google Calendar, Outlook, and Calendly sync. FAQ retrieval over a per-tenant knowledge base. Smart routing using full conversation context. Voicemail transcription with sentiment.

calendar · faq · routing · transcripts

Phone-tuned audio

Built for the production phone network, not a browser demo.

Deepgram Nova-3 streaming STT tuned for phone codecs. Cartesia Sonic-3 streaming TTS with ~40ms first-byte. LiveKit handles SIP trunking and barge-in detection. VAD calibrated for noisy environments. The audio path that decides whether the call sounds like a person or a phone tree.

deepgram nova-3 · cartesia sonic-3 · livekit sip · barge-in

architecture

Three services. One pipeline.

An edge API orchestrates the call. A Python worker runs the voice pipeline on LiveKit. A multi-tenant dashboard observes everything in real time.

caller

PSTN · SIP trunk

livekit

WebRTC · barge-in · trunking

tenant

prompt · number · industry

Python · LiveKit worker

soniq-agent

·VAD + barge-in detection
·Pipes audio frames to STT
·Streams LLM tokens to TTS
·Runs LLM tool calls

Hono · TypeScript

soniq-api

·Pipeline orchestration
·Per-tenant prompt + tools
·Scheduled jobs · webhooks
·Business logic + CRM

Next.js 15 · React 19

soniq-dashboard

·Live call monitoring
·Tenant config · presets
·Deals · contacts · tasks
·Mid-call takeover surface

Deepgram Nova-3

streaming STT · ~150ms

OpenAI tool-calling

LLM · ~200ms TTFT

Cartesia Sonic-3

streaming TTS · ~40ms

supabasePostgres · realtime channels · row-level security per tenant

industry presets

7 verticals. Same-day deploy.

Each preset includes a domain-tuned prompt, escalation rules, tool inventory, and a baseline knowledge base. Pick a preset, set a phone number, ship.

HotelsMotelsRestaurantsMedicalDentalSalonsAuto Service

stack

Best of breed at every layer.

Deepgram Nova-3

Streaming STT

Cartesia Sonic-3

Streaming TTS

Gemini / OpenAI / Groq

LLM with fallback

LiveKit

SIP + WebRTC

Hono

Edge API

Next.js 15

Dashboard