Mistral AI Voice Agent for After-Hours Customer Support

A voice AI receptionist that handles after-hours calls, answers FAQs, and books appointments using Mistral's LLM.

mistral voice-agent twilio deepgram elevenlabs fastify nextjs after-hours-support appointment-booking

The problem

SMBs lose business when customers call outside business hours. They can't afford 24/7 staff, and simple IVRs frustrate callers.

Built from

Intro

This tutorial walks you through building a voice AI receptionist that answers after-hours phone calls, understands natural speech, responds intelligently using Mistral AI, and books appointments on Google Calendar. You’ll wire up Twilio telephony, Deepgram speech-to-text, ElevenLabs text-to-speech, session continuity with agent memory, and observability via Langfuse — orchestrated by a Fastify server with Next.js App Router webhook endpoints.

Prerequisites

Node.js 22+ and pnpm 10 installed
A Twilio account with a phone number that has Media Streams enabled
Deepgram API key (Nova-2 model)
ElevenLabs API key (Turbo v2.5 model)
Mistral AI API key
Google Cloud service account with Calendar API enabled (for appointment booking)
OpenAI API key (optional — used by the agent memory module for embeddings and fact extraction)
Langfuse account (optional — for observability)
Familiarity with TypeScript and async/await patterns

Step 1: Set up the project scaffold

Create the project directory and initialize a pnpm workspace. A Fastify server handles the voice agent runtime, while Next.js App Router routes serve as Twilio webhook endpoints.

terminal

mkdir mistral-voice-agent
cd mistral-voice-agent
pnpm init

Install all dependencies:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

186 kB·110 tests·95.8% coverage·vitest passing

SHA-2567d134504c76364b8c0d790a7c9908b27c3c5abe40912294ec75332dd83085daf

Book a conversation All solutions

Comments

Loading comments…

import { SessionManager } from "@reaatech/session-continuity"; import { MemoryAdapter } from "@reaatech/session-continuity-storage-memory"; import type { Message } from "@reaatech/session-continuity"; import { agentMemory } from "./agent-memory.js"; export class SimpleTokenCounter { model = "simple-estimate"; tokenizer = "whitespace"; count(content: string): number { return Math.ceil(content.split(/\s+/).length * 1.3); } countMessages(messages: Message[]): number { return messages.reduce((sum: number, msg: Message) => { const content: string = typeof msg.content === "string" ? msg.content : ""; return sum + this.count(content); }, 0); } calculateMessageTokens(content: string): number { return this.count(content); } } export const sessionManager = new SessionManager({ storage: new MemoryAdapter(), tokenCounter: new SimpleTokenCounter(), tokenBudget: { maxTokens: 4096, reserveTokens: 500, overflowStrategy: "compress", }, compression: { strategy: "sliding_window", targetTokens: 3500, }, sessionTTL: 3600, }); sessionManager.on("session:created", (payload) => { console.log("Session created:", payload.sessionId); }); sessionManager.on("session:ended", (payload) => { console.log("Session ended:", payload.sessionId); }); sessionManager.on("message:added", (payload) => { void (async () => { const messages = await sessionManager.getMessages(payload.sessionId, { limit: 2 }); if (messages.length >= 2) { const conversation = messages.map((m) => ({ speaker: m.role as "user" | "agent", content: typeof m.content === "string" ? m.content : "", timestamp: m.createdAt, })); const stored = await agentMemory.extractAndStore(conversation); console.log(`Stored ${String(stored.length)} memories from session ${payload.sessionId}`); } })(); }); export async function getConversationContext(sessionId: string) { return sessionManager.getConversationContext(sessionId); }

import { Mistral } from "@mistralai/mistralai"; import { sessionManager } from "./session-store.js"; export const mistral = new Mistral({ apiKey: process.env.MISTRAL_API_KEY ?? "", }); export async function generateResponse( conversationText: string, sessionId: string, memoryContext: string, ): Promise<string> { try { const context = await sessionManager.getConversationContext(sessionId); const history = context.map((msg) => ({ role: msg.role as "user" | "assistant" | "system", content: typeof msg.content === "string" ? msg.content : "", })); const result = await mistral.chat.complete({ model: "mistral-large-latest", messages: [ { role: "system", content: buildSystemPrompt(memoryContext) }, ...history, { role: "user", content: conversationText }, ], }); const firstChoice = result.choices[0]; const content = firstChoice.message ? firstChoice.message.content ?? undefined : undefined; if (content && typeof content === "string") { return content; } return "I'm sorry, I couldn't process that."; } catch (error) { console.error("Mistral error:", error); return "I'm having trouble connecting to our AI service. Please try again shortly."; } } export function buildSystemPrompt(memoryContext: string): string { return `You are a helpful after-hours receptionist for a business. \ Answer FAQs about business hours and services. \ Book appointments via the calendar tool. \ Escalate urgent requests to a human agent. \ Keep responses concise and friendly. \ ${memoryContext ? `\nRelevant context: ${memoryContext}` : ""}`; } export function parseIntent( response: string, ): "faq" | "appointment" | "escalation" | "unknown" { const lower = response.toLowerCase(); if (lower.includes("book") || lower.includes("appointment") || lower.includes("schedule")) { return "appointment"; } if ( lower.includes("hours") || lower.includes("open") || lower.includes("service") || lower.includes("faq") ) { return "faq"; } if (lower.includes("emergency") || lower.includes("urgent") || lower.includes("escalate")) { return "escalation"; } return "unknown"; }

import { createPipeline, createLatencyBudget, initializeSessionManager, LatencyBudgetEnforcer, getDefaultConfig, } from "@reaatech/voice-agent-core"; import type { AudioChunk, STTProvider, TTSProvider, MCPClient, VoiceAgentKitConfig, } from "@reaatech/voice-agent-core"; import { generateResponse } from "./mistral-chat.js"; const voiceSessionManager = initializeSessionManager({ defaultTTL: 3600, maxTurns: 20, maxTokens: 4000, }); const latencyEnforcer = new LatencyBudgetEnforcer( createLatencyBudget({ target: 800, hardCap: 1200, stt: 200, mcp: 400, tts: 200, }), ); const mcpClient: MCPClient = { async connect() {}, async sendRequest(params) { const text = await generateResponse(params.utterance, params.sessionId, ""); return { text, toolCalls: [], latencyMs: 0 }; }, async close() {}, }; let pipeline: ReturnType<typeof createPipeline> | null = null; export function createVoicePipeline( sttProvider: STTProvider, ttsProvider: TTSProvider, mcpClientOverride?: MCPClient, config?: Partial<VoiceAgentKitConfig>, ) { const client = mcpClientOverride ?? mcpClient; const effectiveConfig = config ? { ...getDefaultConfig(), ...config } : getDefaultConfig(); pipeline = createPipeline({ sessionManager: voiceSessionManager, latencyEnforcer, sttProvider, ttsProvider, mcpClient: client, config: effectiveConfig, }); pipeline.on("pipeline:turn:end", (event: unknown) => { console.log("Turn complete:", (event as Record<string, unknown>).data); }); pipeline.on("pipeline:error", (event: unknown) => { console.error("Pipeline error:", event); }); return pipeline; } export async function startSession(sessionId: string): Promise<void> { if (pipeline) { await pipeline.startSession({ sessionId, status: "active" }); } } export async function processAudio( sessionId: string, chunk: AudioChunk, ): Promise<void> { if (pipeline) { await pipeline.processAudioChunk(sessionId, chunk); } } export async function endSession(sessionId: string): Promise<void> { if (pipeline) { await pipeline.endSession(sessionId); } } export function destroyPipeline(): void { pipeline?.destroy(); }

Mistral AI Voice Agent for After-Hours Customer Support

The problem

Built from

Intro

Prerequisites

Step 1: Set up the project scaffold

Example artifact

Comments

Intro

Prerequisites

Step 1: Set up the project scaffold

Step 2: Configure TypeScript, Next.js, and Vitest

Step 3: Define core domain types

Step 4: Create typed configuration from environment variables

Step 5: Define typed error classes

Step 6: Set up session continuity

Step 7: Wire up agent memory

Step 8: Connect Deepgram speech-to-text

Step 9: Connect ElevenLabs text-to-speech

Step 10: Integrate Mistral AI for conversation

Step 11: Build the Google Calendar integration

Step 12: Set up observability with Langfuse

Step 13: Create the Twilio webhook handler

Step 14: Orchestrate the voice pipeline

Step 15: Wire everything in the Fastify server entrypoint

Step 16: Create Next.js App Router webhook endpoints

Step 17: Configure environment variables

Step 18: Run the tests

Next steps