OpenRouter Voice Agent for Cal.com Appointment Scheduling

Let callers book, reschedule, or cancel Cal.com appointments through a conversational voice agent that uses any LLM via OpenRouter.

openrouter voice-agent calcom appointment-scheduling twilio deepgram elevenlabs nextjs typescript

The problem

Small businesses lose appointments when customers can't book over the phone and staff can't answer every call.

Built from

Intro

This recipe builds a conversational voice agent that lets callers book, reschedule, or cancel Cal.com appointments by speaking naturally over the phone. Incoming audio from Twilio Media Streams is transcribed with Deepgram, intent is classified through @reaatech/confidence-router, appointments are managed via the Cal.com REST API, and responses are spoken back through ElevenLabs. All LLM calls go through OpenRouter, giving you provider flexibility without vendor lock-in. By the end you’ll have a working Next.js 16 app with unit-tested services, mock-HTTP tests, and 90%+ code coverage.

Prerequisites

Node.js 22+ and pnpm 10 installed on your machine
A Cal.com account (cloud or self-hosted) with OAuth2 client credentials
OpenRouter API key (free tier available at openrouter.ai/keys)
Deepgram API key (sign up at deepgram.com)
ElevenLabs API key and a voice ID (elevenlabs.io)
Langfuse account (optional — for observability tracing)
Familiarity with TypeScript and Next.js App Router basics

Step 1: Scaffold the project and install dependencies

Create the project directory and initialize it with a Next.js 16 App Router shell, or use the scaffold already provided in this recipe. Every dependency in package.json is pinned to an exact version so builds are reproducible.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

173 kB·155 tests·98.9% coverage·vitest passing

SHA-256eb6e8b8652c0d0fcdc2ae420a5f81b659d254d261da5f3e69c1d4e36c1194ad4

Book a conversation All solutions

Comments

Loading comments…

import { DeepgramSTTProvider, STTProviderInterface, type DeepgramConfig } from "@reaatech/voice-agent-stt"; import { ElevenLabsTTSProvider as ElevenLabsProvider, type ElevenLabsConfig, TTSProviderInterface } from "@reaatech/voice-agent-tts"; import { DeepgramClient } from "@deepgram/sdk"; import { ElevenLabsClient } from "@elevenlabs/elevenlabs-js"; import { getConfig } from "../lib/config.js"; import type { AudioChunk } from "@reaatech/voice-agent-core"; export type { ElevenLabsProvider }; export function createSTTProvider(): DeepgramSTTProvider { return new DeepgramSTTProvider(); } export async function connectSTT(stt: DeepgramSTTProvider): Promise<void> { const config = getConfig(); const deepgramConfig: DeepgramConfig = { provider: "deepgram", apiKey: config.DEEPGRAM_API_KEY, model: "nova-2", language: "en", sampleRate: 8000, encoding: "mulaw", smartFormat: true, interimResults: true, endpointing: 300, }; await stt.connect(deepgramConfig); } export function createTTSProvider(): ElevenLabsProvider { return new ElevenLabsProvider(); } export function convertAudioChunk(chunk: AudioChunk): AudioChunk { return STTProviderInterface.convertAudioFormat(chunk, 8000, "mulaw"); } export async function* synthesizeSpeech( tts: ElevenLabsProvider, text: string ): AsyncIterable<AudioChunk> { const config = getConfig(); const ttsConfig: ElevenLabsConfig = { provider: "elevenlabs", modelId: "eleven_flash_v2_5", voiceId: config.ELEVENLABS_VOICE_ID, outputFormat: "mulaw_8000", }; const sentences = TTSProviderInterface.chunkTextForStreaming(text, 200); for (let i = 0; i < sentences.length; i++) { const sentence = sentences[i]; const stream = tts.synthesize(sentence, ttsConfig); for await (const chunk of stream) { yield TTSProviderInterface.formatAudioForTwilio(chunk); } if (i < sentences.length - 1) { const silence = TTSProviderInterface.createSilenceChunk(500); yield silence; } } } export function cancelTTS(tts: ElevenLabsProvider): void { tts.cancel(); } export async function closeTTS(tts: ElevenLabsProvider): Promise<void> { if (typeof (tts as { close?: () => Promise<void> }).close === "function") { await (tts as { close: () => Promise<void> }).close(); } } export function getAvailableSDKs(): { DeepgramClient: typeof DeepgramClient; ElevenLabsClient: typeof ElevenLabsClient } { return { DeepgramClient, ElevenLabsClient }; }

import { ConfidenceRouter, type RoutingDecision } from "@reaatech/confidence-router"; import { generateResponse, buildSystemPrompt } from "./openrouter-service.js"; import type { AppointmentDetails } from "../lib/types.js"; let _router: ConfidenceRouter | null = null; export function getRouter(): ConfidenceRouter { if (!_router) { _router = new ConfidenceRouter({ routeThreshold: 0.8, fallbackThreshold: 0.3, clarificationEnabled: true, }); } return _router; } export function resetRouter(): void { _router = null; } export async function classifyIntent( transcript: string ): Promise<{ intent: string; confidence: number }> { const systemPrompt = buildSystemPrompt(); const result = await generateResponse([ { role: "system", content: systemPrompt }, { role: "user", content: `Classify this caller's intent: "${transcript}". Respond with JSON: {"intent": "<intent>", "confidence": <0-1>}`, }, ]); let parsed: { intent?: string; confidence?: number }; try { parsed = JSON.parse(result.text) as Record<string, unknown>; } catch { return { intent: "unknown", confidence: 0 }; } return { intent: typeof parsed.intent === "string" ? parsed.intent : "unknown", confidence: typeof parsed.confidence === "number" ? parsed.confidence : 0, }; } export async function classifyAndRoute( transcript: string, router?: ConfidenceRouter ): Promise<RoutingDecision> { const r = router ?? getRouter(); const { intent, confidence } = await classifyIntent(transcript); const decision = r.decide({ predictions: [{ label: intent, confidence }], }); return decision; } export async function extractAppointmentDetails( transcript: string ): Promise<AppointmentDetails> { const systemPrompt = buildSystemPrompt(); const result = await generateResponse([ { role: "system", content: systemPrompt }, { role: "user", content: `Extract appointment details from this request: "${transcript}". Respond with JSON: {"date": "...", "time": "...", "duration": 30, "description": "...", "attendeeName": "...", "attendeeEmail": "...", "attendeePhone": "..."}. Use null for missing fields.`, }, ]); let parsed: Record<string, unknown>; try { parsed = JSON.parse(result.text) as Record<string, unknown>; } catch { return {}; } const details: AppointmentDetails = {}; if (typeof parsed.date === "string") details.date = parsed.date; if (typeof parsed.time === "string") details.time = parsed.time; if (typeof parsed.attendeeName === "string") details.attendeeName = parsed.attendeeName; if (typeof parsed.attendeeEmail === "string") details.attendeeEmail = parsed.attendeeEmail; if (typeof parsed.attendeePhone === "string") details.attendeePhone = parsed.attendeePhone; return details; }

OpenRouter Voice Agent for Cal.com Appointment Scheduling

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables

Step 3: Validate configuration with a Zod schema

Step 4: Define shared TypeScript types

Step 5: Build the Cal.com API client

Step 6: Create the OpenRouter LLM service

Step 7: Build the speech service (Deepgram STT + ElevenLabs TTS)

Step 8: Classify caller intent with the Confidence Router

Step 9: Wire the Cal.com business logic

Step 10: Build the budget service

Step 11: Create the pipeline service

Step 12: Wire the voice route handler

Step 13: Add the health check and Cal.com webhook routes

Step 14: Run the tests

Next steps