xAI Grok Voice Agent for After-Hours Customer Support

Deploy an AI receptionist powered by xAI Grok that answers calls, qualifies leads, and routes urgent issues—without 24/7 staffing.

typescript nextjs xai grok voice-agent livekit

The problem

Small businesses lose potential customers when calls go unanswered after hours. Hiring 24/7 staff is cost-prohibitive, and basic voicemail often fails to capture and qualify leads in real-time.

Built from

Intro

This tutorial walks you through building an AI-powered voice receptionist that answers after-hours calls using xAI Grok, LiveKit, Deepgram, and Cartesia. In about 30 minutes, you will build a backend that receives LiveKit agent-dispatch webhooks, classifies caller intent with keyword routing, enforces per-call AI spend budgets, caches common responses to cut costs, and escalates urgent issues via Twilio SMS — all instrumented with Langfuse tracing.

Prerequisites

Node.js 22+ and pnpm 10 installed
An xAI API key (from the xAI console)
A LiveKit Cloud account with API key and secret (from LiveKit Cloud)
A Deepgram API key (from the Deepgram console)
A Cartesia API key (from Cartesia)
A Twilio account with an SMS-capable phone number and Account SID (from the Twilio Console)
A Langfuse account with public and secret keys (from Langfuse)
An OpenAI API key (needed by the LLM cache’s OpenAIEmbedder for semantic embeddings)
Familiarity with TypeScript and Next.js App Router basics

Step 1: Scaffold the Next.js project

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

174 kB·99 tests·100.0% coverage·vitest passing

SHA-256112122acae26cc0aa51730aa75ec3d5046fde7d77c2f1e69e6942746781aa797

Book a conversation All solutions

Comments

Loading comments…

// src/services/budget-engine.service.ts import { BudgetController } from "@reaatech/agent-budget-engine"; import { BudgetScope } from "@reaatech/agent-budget-types"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { createPricingProvider } from "../lib/pricing-provider.js"; import type { BudgetRecord } from "../lib/types.js"; export type CallBudgetManager = { controller: BudgetController; preFlightCheck: (sessionId: string, estimatedCost: number, modelId: string, tools?: string[]) => { allowed: boolean; action: string; suggestedModel?: string; disabledTools?: string[] }; recordSpend: (sessionId: string, requestId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string) => void; getBudgetState: (sessionId: string) => BudgetRecord; }; type HardStopEvent = { scopeType: string; scopeKey: string }; type ThresholdBreachEvent = { threshold: number; scopeType: string; scopeKey: string }; export function createBudgetController(): CallBudgetManager { const controller = new BudgetController({ spendTracker: new SpendStore(), pricing: createPricingProvider(), }); controller.on("hard-stop", (event: HardStopEvent) => { console.error("Budget hard-stop for " + event.scopeType + ":" + event.scopeKey, event); }); controller.on("threshold-breach", (event: ThresholdBreachEvent) => { if (event.threshold >= 0.9) { console.warn("Budget threshold breached: " + String(event.threshold * 100) + "% for " + event.scopeType + ":" + event.scopeKey); } }); return { controller, preFlightCheck(sessionId: string, estimatedCost: number, modelId: string, tools: string[] = []) { return controller.check({ scopeType: BudgetScope.User, scopeKey: sessionId, estimatedCost, modelId, tools, }); }, recordSpend(sessionId: string, requestId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string) { controller.record({ requestId, scopeType: BudgetScope.User, scopeKey: sessionId, cost, inputTokens, outputTokens, modelId, provider: "xai", timestamp: new Date(), }); }, getBudgetState(sessionId: string) { const st = controller.getState(BudgetScope.User, sessionId); if (st === undefined) { return { scopeType: "User", scopeKey: sessionId, spent: 0, limit: 0, state: "unknown" }; } return { scopeType: "User", scopeKey: sessionId, spent: st.spent, limit: st.remaining + st.spent, state: st.state, }; }, }; }

// src/services/llm-cache.service.ts import { CacheEngine, InMemoryAdapter, OpenAIEmbedder } from "@reaatech/llm-cache"; import type { CacheResult } from "@reaatech/llm-cache"; export type LlmCacheManager = { cache: CacheEngine; getCachedResponse: (prompt: string, model: string, useCase?: string) => Promise<CacheResult>; storeResponse: (prompt: string, response: object, model: string, inputTokens: number, outputTokens: number) => Promise<void>; invalidateCache: (useCase?: string) => Promise<void>; }; export function createLlmCache(): LlmCacheManager { const cache = new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder: new OpenAIEmbedder({ provider: "openai", model: "text-embedding-3-small", dimensions: 1536, apiKey: process.env.OPENAI_API_KEY ?? "", }), config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "text-embedding-3-small", dimensions: 1536, batchSize: 100, maxRetries: 3, }, similarity: { threshold: 0.85, metric: "cosine" as const, maxResults: 5 }, ttl: { default: 3600, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "voice-agent" }, cost: { enabled: true, currency: "USD" }, observability: { metrics: true, tracing: false, logging: "info" }, }, }); return { cache, async getCachedResponse(prompt: string, model: string, useCase = "voice-agent") { return cache.get(prompt, { model, useCase }); }, async storeResponse(prompt: string, response: object, model: string, inputTokens: number, outputTokens: number) { await cache.set(prompt, response, { model }, { tokens: { prompt: inputTokens, completion: outputTokens } }); }, async invalidateCache(useCase?: string) { await cache.invalidate(useCase ? { useCase } : { olderThan: new Date() }); }, }; }

// src/services/agent-handoff.service.ts import { createHandoffConfig, TypedEventEmitter, withRetry } from "@reaatech/agent-handoff"; import twilio from "twilio"; import type { HandoffRequest } from "../lib/types.js"; type HandoffEvents = { "handoff:started": { sessionId: string; channel: string }; "handoff:completed": { sessionId: string; sid: string }; "handoff:failed": { sessionId: string; error: string }; }; export type AgentHandoffService = { executeHandoff: (request: HandoffRequest) => Promise<{ channel: string; messageSid: string }>; emitter: TypedEventEmitter<HandoffEvents>; }; export function createAgentHandoffService(): AgentHandoffService { const emitter = new TypedEventEmitter<HandoffEvents>(); const twilioClient = twilio( process.env.TWILIO_ACCOUNT_SID ?? "", process.env.TWILIO_AUTH_TOKEN ?? "", ); createHandoffConfig({ routing: { minConfidenceThreshold: 0.6 }, }); async function sendEscalationSms(to: string, body: string): Promise<string> { const message = await withRetry( () => twilioClient.messages.create({ body, to, from: process.env.TWILIO_PHONE_NUMBER ?? "", }), { maxRetries: 3, backoff: "exponential" as const, baseDelayMs: 500, maxDelayMs: 5000, shouldRetry: () => true }, ); return message.sid; } return { emitter, async executeHandoff(request: HandoffRequest) { const channel = request.intent.label === "escalation" ? "sms" : "callback"; const body = "[" + request.intent.label + "] Caller: " + request.callerPhone + "\n\n" + request.transcriptSummary; emitter.emit("handoff:started", { sessionId: request.callerPhone, channel }); try { const sid = await sendEscalationSms(request.callerPhone, body); emitter.emit("handoff:completed", { sessionId: request.callerPhone, sid }); return { channel, messageSid: sid }; } catch (error) { emitter.emit("handoff:failed", { sessionId: request.callerPhone, error: String(error) }); throw error; } }, }; }

// app/api/webhook/voice/route.ts import { type NextRequest, NextResponse } from "next/server"; import crypto from "node:crypto"; import { WebhookReceiver } from "livekit-server-sdk"; import { createGrokClient } from "../../../../src/lib/grok.js"; import { createIntentRouter } from "../../../../src/services/confidence-router.service.js"; import { createBudgetController } from "../../../../src/services/budget-engine.service.js"; import { createLlmCache } from "../../../../src/services/llm-cache.service.js"; import { createAgentHandoffService } from "../../../../src/services/agent-handoff.service.js"; import { VoiceAgentOrchestrator } from "../../../../src/services/voice-agent.service.js"; function safeEvent(event: object): { evtType: string; agentName: string; metadata: string | undefined } { const record = event as Record<string, unknown>; return { evtType: typeof record.event === "string" ? record.event : "", agentName: typeof record.agentName === "string" ? record.agentName : "", metadata: typeof record.metadata === "string" ? record.metadata : undefined, }; } export async function POST(req: NextRequest) { try { const rawBody = await req.text(); const receiver = new WebhookReceiver( process.env.LIVEKIT_API_KEY ?? "", process.env.LIVEKIT_API_SECRET ?? "", ); const event = await receiver.receive(rawBody, req.headers.get("Authorization") ?? ""); const parsed = safeEvent(event); if (parsed.evtType !== "agent_dispatch") { return NextResponse.json({ handled: false }, { status: 200 }); } const sessionId = crypto.randomUUID(); let callerPhone = "unknown"; if (parsed.metadata !== undefined) { try { const meta = JSON.parse(parsed.metadata) as Record<string, unknown>; callerPhone = typeof meta.callerPhone === "string" ? meta.callerPhone : "unknown"; } catch { /* ignore */ } } const grok = createGrokClient(); const intentRouter = createIntentRouter(); const budgetManager = createBudgetController(); const cacheManager = createLlmCache(); const handoffService = createAgentHandoffService(); const agent = new VoiceAgentOrchestrator( grok, intentRouter, budgetManager, cacheManager, handoffService, sessionId, callerPhone, ); agent.handleIncomingCall(); return NextResponse.json({ sessionId, status: "accepted", agentName: parsed.agentName }, { status: 200 }); } catch (error) { const message = error instanceof Error ? error.message : String(error); return NextResponse.json({ error: "dispatch failed", detail: message }, { status: 500 }); } }

xAI Grok Voice Agent for After-Hours Customer Support

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project

Step 2: Configure environment variables

Step 3: Define shared types with Zod

Step 4: Create the xAI Grok client

Step 5: Build the pricing provider

Step 6: Set up Langfuse observability

Step 7: Wire the confidence router service

Step 8: Wire the budget engine service

Step 9: Wire the LLM cache service

Step 10: Wire the agent handoff service with Twilio

Step 11: Build the VoiceAgentOrchestrator

Step 12: Create the LiveKit webhook route

Step 13: Create the Twilio status callback route

Step 14: Create the barrel export

Step 15: Run the tests

Next steps