OpenAI Voice Agent for Appointment Scheduling

Answers after-hours calls, classifies caller intent, and books appointments without human intervention.

voice-agent appointment-scheduling openai twilio nextjs intent-classification budget-tracking observability

The problem

Small practices like dental clinics lose revenue because nobody answers the phone after hours. Missed appointment calls lead to empty slots and frustrated patients who expect 24/7 booking.

Built from

Intro

You’ll build a Twilio-powered voice agent that answers after-hours phone calls, transcribes the caller’s speech with OpenAI Whisper, classifies their intent, and books appointments through Calendly — all without human intervention. Along the way you’ll work with the REAA agent-mesh ecosystem for request orchestration, intent classification, session management, budget enforcement, and observability. By the end you’ll have a production-style Next.js application with full TypeScript coverage, a test suite, and two working API routes: one for inbound Twilio voice webhooks and one for health checks.

Prerequisites

Node.js >= 22 (the engines field requires it)
pnpm 10.x (the project uses pnpm@10.6.3 as its package manager; install it with corepack enable && corepack prepare pnpm@10.6.3 --activate)
Accounts and API keys:
- A Twilio account with an active phone number (TWILIO_ACCOUNT_SID, TWILIO_AUTH_TOKEN, TWILIO_PHONE_NUMBER from console.twilio.com)
- An OpenAI API key (OPENAI_API_KEY from platform.openai.com)
- A Calendly Personal Access Token (CALENDLY_API_KEY from calendly.com/integrations/api) and optionally a webhook secret (CALENDLY_WEBHOOK_SECRET)
Familiarity with TypeScript and Next.js app-router route handlers (files in src/app/)

Step 1: Scaffold the project

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

75 tests·99.7% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 (the engines field requires it)
pnpm 10.x (the project uses pnpm@10.6.3 as its package manager; install it with corepack enable && corepack prepare pnpm@10.6.3 --activate)
Accounts and API keys:
- A Twilio account with an active phone number (TWILIO_ACCOUNT_SID, TWILIO_AUTH_TOKEN, TWILIO_PHONE_NUMBER from console.twilio.com)
- An OpenAI API key (OPENAI_API_KEY from platform.openai.com)
- A Calendly Personal Access Token (CALENDLY_API_KEY from calendly.com/integrations/api) and optionally a webhook secret (CALENDLY_WEBHOOK_SECRET)
Familiarity with TypeScript and Next.js app-router route handlers (files in src/app/)

Step 1: Scaffold the project

/** * Session management wrapping @reaatech/agent-mesh-session. * * Provides getOrCreateVoiceSession, appendCallTurn, and finalizeSession * for the Twilio voice call lifecycle. */ import { getActiveSession, createSession, appendTurn, closeSession, } from '@reaatech/agent-mesh-session'; import { createChildLogger } from './logger'; const log = createChildLogger({ module: 'session' }); /** Turn entry shape for session history. */ export interface TurnEntry { role: string; content: string; timestamp: string; intent_summary?: string; } /** Session record shape from agent-mesh-session. */ export interface SessionRecord { session_id: string; user_id: string; employee_id: string; status: 'active' | 'completed' | 'abandoned' | 'error'; active_agent: string; turn_history: TurnEntry[]; workflow_state: Record<string, unknown>; created_at: string; updated_at: string; ttl: Date; } /** * Retrieves an active session for a call SID, or creates a new one. */ export async function getOrCreateVoiceSession( callSid: string, ): Promise<SessionRecord> { const existing = await getActiveSession(callSid); if (existing) { return existing as SessionRecord; } const session = await createSession({ userId: callSid, employeeId: callSid, activeAgent: 'scheduler', }); return session as SessionRecord; } /** * Appends a conversation turn to a session. */ export async function appendCallTurn( sessionId: string, role: 'user' | 'agent', content: string, intentSummary?: string, ): Promise<void> { await appendTurn(sessionId, { role, content, timestamp: new Date().toISOString(), intent_summary: intentSummary, }); } /** * Finalizes a session with a terminal status. */ export async function finalizeSession( sessionId: string, status: 'completed' | 'abandoned' | 'error', ): Promise<void> { await closeSession(sessionId, status); log.info('Session closed', { session_id: sessionId, status }); }

/** * Handoff routing helpers — builds handoff payloads and routes to agents * based on classifier output. */ import type { HandoffPayload, Message, } from '@reaatech/agent-handoff'; import type { ClassifierOutput } from './classifier'; /** * Custom routing decision type for this application. */ export interface AppRoutingDecision { action: 'route' | 'clarify' | 'fallback'; targetAgent?: string; reason: string; } /** * Builds a HandoffPayload from classification output and session context. */ export function buildHandoffPayload( intent: ClassifierOutput, sessionId: string, transcript: string, ): HandoffPayload { const now = new Date(); const message: Message = { id: crypto.randomUUID(), role: 'user', content: transcript, timestamp: now, }; return { handoffId: crypto.randomUUID(), sessionId, conversationId: sessionId, sessionHistory: [message], compressedContext: { summary: intent.intent_summary, keyFacts: [], intents: [ { intent: intent.intent_summary, confidence: intent.confidence, entities: Object.keys(intent.entities), }, ], entities: Object.entries(intent.entities).map(([name, value]) => ({ name, type: typeof value === 'string' ? 'string' : 'unknown', value: value as string, resolved: true, })), openItems: [], compressionMethod: 'identity', originalTokenCount: transcript.length, compressedTokenCount: intent.intent_summary.length, compressionRatio: transcript.length > 0 ? intent.intent_summary.length / transcript.length : 1, }, handoffReason: { type: 'specialist_required' as const, requiredSkills: [intent.agent_id], currentAgentSkills: [], }, userMetadata: { userId: sessionId, }, conversationState: { resolvedEntities: intent.entities as Record<string, unknown>, openQuestions: [], contextVariables: {}, }, createdAt: now, }; } /** * Routes a HandoffPayload to the appropriate agent based on classification. */ export function routeToAgent( _payload: HandoffPayload, intent: ClassifierOutput, ): AppRoutingDecision { if (intent.ambiguous) { return { action: 'clarify', reason: 'Intent ambiguous across agents. Need clarification.', }; } if (intent.agent_id === 'scheduler') { return { action: 'route', targetAgent: 'scheduler', reason: 'High-confidence scheduler intent', }; } if (intent.agent_id === 'general') { return { action: 'route', targetAgent: 'general', reason: 'General inquiry', }; } return { action: 'fallback', reason: 'No matching agent for intent', }; }

import { handleInternalRequest } from '@reaatech/agent-mesh-gateway'; import { createChildLogger, recordAgentDispatchDuration, } from '@reaatech/agent-mesh-observability'; const log = createChildLogger({ module: 'gateway' }); export interface GatewayDispatchResponse { response: string; classification: { intent: string; confidence: number; language: string; }; routing: { action: 'route' | 'clarify' | 'fallback'; reason: string; }; workflow_complete: boolean; clarification_question?: string; } export async function dispatchToAgentMesh( transcript: string, sessionId: string, callSid: string, ): Promise<GatewayDispatchResponse> { const start = Date.now(); try { const result = await handleInternalRequest({ input: transcript, user_id: callSid, session_id: sessionId, }); const durationMs = Date.now() - start; recordAgentDispatchDuration('scheduler', durationMs); const body = result.body as Record<string, unknown>; log.info('Gateway dispatch', { request_id: body.request_id as string | undefined, session_id: sessionId, }); const classificationRaw = body.classification as Record<string, unknown> | undefined; const routingRaw = body.routing as Record<string, unknown> | undefined; const routingAction = (routingRaw?.action as string) ?? 'route'; const response: GatewayDispatchResponse = { response: (body.response as string) ?? '', classification: { intent: (classificationRaw?.intent as string) ?? '', confidence: (classificationRaw?.confidence as number) ?? 0, language: (classificationRaw?.language as string) ?? 'en', }, routing: { action: routingAction as 'route' | 'clarify' | 'fallback', reason: (routingRaw?.reason as string) ?? '', }, workflow_complete: (body.workflow_complete as boolean) ?? false, }; if (routingAction === 'clarify' && body.clarification_question) { response.clarification_question = body.clarification_question as string; } return response; } catch (error) { const durationMs = Date.now() - start; recordAgentDispatchDuration('scheduler', durationMs); const message = error instanceof Error ? error.message : String(error); log.error('Gateway dispatch error', { error: message }); throw error; } }

import { vi } from 'vitest'; vi.mock('twilio', () => ({ default: vi.fn(), validateRequest: vi.fn(() => true), twiml: { VoiceResponse: vi.fn(() => ({ toString: vi.fn(() => '<?xml version="1.0"?><Response><Connect><Stream/></Connect></Response>'), connect: vi.fn(() => ({ stream: vi.fn(), })), })), }, })); vi.mock('openai', () => ({ default: vi.fn(() => ({ chat: { completions: { create: vi.fn(), }, }, audio: { transcriptions: { create: vi.fn(), }, }, })), })); vi.mock('@reaatech/agent-mesh-gateway', () => ({ handleInternalRequest: vi.fn(), })); vi.mock('@reaatech/agent-mesh-classifier', () => ({ classifierService: { classify: vi.fn(), isMock: vi.fn(() => true), }, })); vi.mock('@reaatech/agent-handoff', () => ({ createHandoffConfig: vi.fn(), defaultHandoffConfig: { routing: { minConfidenceThreshold: 0.6 } }, HandoffError: class HandoffError extends Error { constructor(msg: string) { super(msg); } }, TypedEventEmitter: vi.fn(), withRetry: vi.fn(), pickDefined: vi.fn(), })); vi.mock('@reaatech/agent-mesh-session', () => ({ getActiveSession: vi.fn(), createSession: vi.fn(), appendTurn: vi.fn(), closeSession: vi.fn(), updateWorkflowState: vi.fn(), })); vi.mock('@reaatech/agent-budget-engine', () => ({ BudgetController: vi.fn(() => ({ defineBudget: vi.fn(), check: vi.fn(() => ({ allowed: true })), record: vi.fn(), getState: vi.fn(() => ({ spent: 0.1, remaining: 0.9 })), on: vi.fn(), })), PolicyEvaluator: vi.fn(), })); vi.mock('@reaatech/agent-budget-spend-tracker', () => { class MockSpendStore { record(_entry: unknown): number { return 0; } getSpend(_scopeType: unknown, _scopeKey: string): number { return 0; } getAllScopes(): unknown[] { return []; } } return { SpendStore: MockSpendStore, }; }); vi.mock('@reaatech/agent-budget-types', () => ({ BudgetScope: { User: 'user', Task: 'task', Session: 'session', Org: 'org' }, })); vi.mock('@reaatech/agent-mesh-observability', () => ({ logger: { info: vi.fn(), warn: vi.fn(), error: vi.fn(), }, createChildLogger: vi.fn(() => ({ info: vi.fn(), warn: vi.fn(), error: vi.fn(), })), recordSessionLookupDuration: vi.fn(), recordClarification: vi.fn(), recordAgentDispatchDuration: vi.fn(), recordAgentDispatchError: vi.fn(), initOtel: vi.fn(() => null), }));

OpenAI Voice Agent for Appointment Scheduling

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Install dependencies

Step 3: Configure environment variables

Step 4: Create the environment validator and logger

Step 5: Build the voice middleware

Step 6: Create session management and intent classification

Step 7: Build handoff routing and gateway dispatch

Step 8: Add budget enforcement

Step 9: Create the scheduler and general agents

Step 10: Build the media stream handler

Step 11: Create the API routes

Step 12: Write the tests and run them

Step 13: Start the dev server

Next steps