Vertex AI Voice Agent for Cal.com Appointment Scheduling

Let customers book appointments on Cal.com over the phone with a voice agent that understands natural language, verifies availability, and confirms bookings.

vertex-ai voice-agent caldotcom appointment-scheduling twilio deepgram guardrails

The problem

Service businesses miss after-hours calls and lose revenue because clients can't schedule appointments when staff are unavailable. Existing IVR systems feel robotic and fail to handle complex scheduling requests.

Built from

Intro

In this tutorial, you’ll build a voice-powered appointment booking agent that connects a phone call to Cal.com through a series of AI services. When a caller says something like “I’d like to book an appointment for tomorrow at 2pm,” the agent transcribes their speech, classifies their intent, extracts the booking details, validates the payload, and creates the event in Cal.com over the phone. By the end, you’ll have an Express server that handles Twilio PSTN calls, streams audio to Deepgram for speech-to-text, passes transcripts through a PII guardrail, classifies intent with a confidence router, calls Gemini on Vertex AI for conversational reasoning, and books appointments via Cal.com’s REST API.

Prerequisites

Node.js 22 or later (check with node --version)
pnpm 10.x (npm install -g pnpm@10)
A Google Cloud project with Vertex AI enabled and a service account JSON key
A Twilio account with a voice-capable phone number
A Deepgram account (STT)
A Cartesia account (TTS)
A Cal.com account with OAuth2 developer credentials (client ID, client secret, private key)
Familiarity with TypeScript, Express, and REST APIs

Step 1: Initialize the project

Start with an empty directory and scaffold the project structure. The recipe uses pnpm workspaces and ESM modules.

terminal

mkdir vertex-ai-voice-agent && cd vertex-ai-voice-agent
pnpm init

Add the required fields to package.json so Node 22 and ESM are active.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

260 tests·98.1% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22 or later (check with node --version)
pnpm 10.x (npm install -g pnpm@10)
A Google Cloud project with Vertex AI enabled and a service account JSON key
A Twilio account with a voice-capable phone number
A Deepgram account (STT)
A Cartesia account (TTS)
A Cal.com account with OAuth2 developer credentials (client ID, client secret, private key)
Familiarity with TypeScript, Express, and REST APIs

Step 1: Initialize the project

Start with an empty directory and scaffold the project structure. The recipe uses pnpm workspaces and ESM modules.

terminal

mkdir vertex-ai-voice-agent && cd vertex-ai-voice-agent
pnpm init

Add the required fields to package.json so Node 22 and ESM are active.

import { DecisionEngine, mergeConfig, } from "@reaatech/confidence-router-core"; import { KeywordClassifier, ClassifierRegistry, type KeywordPattern, } from "@reaatech/confidence-router-classifiers"; import type { Intent } from "../types.js"; const APPOINTMENT_LABELS: Intent[] = [ "create_appointment", "reschedule_appointment", "cancel_appointment", "unknown", ]; export interface IntentResult { label: Intent; confidence: number; needsClarification: boolean; } export class IntentRouter { private readonly registry: ClassifierRegistry; private readonly engine: DecisionEngine; constructor() { this.registry = new ClassifierRegistry(); const patterns: KeywordPattern[] = [ { label: "create_appointment", keywords: ["book", "schedule", "appointment", "reserve", "set up"], weight: 1.0, }, { label: "reschedule_appointment", keywords: [ "reschedule", "change time", "change the appointment", "move appointment", ], weight: 1.0, }, { label: "cancel_appointment", keywords: ["cancel", "delete", "remove"], weight: 1.0, }, ]; const keywordClassifier = new KeywordClassifier(patterns, { name: "appointment-keywords", caseSensitive: false, }); this.registry.register(keywordClassifier); this.engine = new DecisionEngine( mergeConfig({ routeThreshold: 0.8, fallbackThreshold: 0.4, clarificationEnabled: true }) ); } async classifyIntent(transcription: string): Promise<IntentResult> { if (!transcription || transcription.trim().length === 0) { return { label: "unknown", confidence: 0, needsClarification: false }; } const result = await this.registry.classify(transcription, "appointment-keywords"); if (!result) { return { label: "unknown", confidence: 0, needsClarification: false }; } const predictions = result.predictions.filter((p) => APPOINTMENT_LABELS.includes(p.label as Intent) ); if (predictions.length === 0) { return { label: "unknown", confidence: 0, needsClarification: true }; } const top = predictions[0]!; const decision = this.engine.decide({ predictions: predictions.map((p) => ({ label: p.label, confidence: p.confidence })) }); if (decision.type === "CLARIFY" || decision.type === "FALLBACK") { return { label: "unknown", confidence: top.confidence, needsClarification: true }; } return { label: top.label as Intent, confidence: top.confidence, needsClarification: false, }; } }

import express from "express"; import { createServer } from "http"; import { WebSocketServer, WebSocket } from "ws"; import { TwilioService } from "./telephony/twilio.js"; const PORT = Number(process.env.EXPRESS_PORT ?? 3001); const BASE_URL = process.env.BASE_URL ?? `http://localhost:${PORT}`; export interface ServerHandle { app: express.Application; server: ReturnType<typeof createServer>; wss: WebSocketServer; twilioService: TwilioService; } export function createApp(): ServerHandle { const app = express(); app.use(express.urlencoded({ extended: true })); app.use(express.json()); app.use((req, _res, next) => { console.log(`[${new Date().toISOString()}] ${req.method} ${req.path}`); next(); }); const server = createServer(app); const wss = new WebSocketServer({ server, path: "/media-stream" }); const twilioService = new TwilioService(BASE_URL); app.post("/voice/incoming", (req, res) => { twilioService.handleIncomingCall( { headers: req.headers as Record<string, string>, body: req.body as Record<string, string> }, { set: (k, v) => res.setHeader(k, v), send: (body) => res.send(body), status: (code) => ({ send: (body) => res.status(code).send(body) }), } ); }); app.post("/voice/status", (req, res) => { const callSid = req.body["CallSid"]; const callStatus = req.body["CallStatus"]; console.log(`[Twilio] Call status: ${callSid} -> ${callStatus}`); res.status(200).send("OK"); }); app.get("/health", (_req, res) => { res.json({ status: "ok", timestamp: new Date().toISOString() }); }); wss.on("connection", async (ws: WebSocket, req) => { const url = new URL(req.url ?? "", `http://${req.headers.host}`); const callSid = url.searchParams.get("CallSid") ?? `ws-${Date.now()}`; console.log(`[WebSocket] Connection for call: ${callSid}`); await twilioService.handleWebSocket(ws, callSid); }); app.use((err: Error, _req: express.Request, res: express.Response, _next: express.NextFunction) => { console.error(`[Error] ${err.message}`); res.status(500).json({ error: err.message }); }); return { app, server, wss, twilioService }; } export function startServer(port: number = PORT): Promise<ServerHandle> { return new Promise((resolve) => { const handle = createApp(); handle.server.listen(port, () => { console.log(`[Server] Express listening on port ${port}`); resolve(handle); }); }); } if (import.meta.url === `file://${process.argv[1]}`) { startServer().catch((err) => { console.error("[Server] Fatal:", err); process.exit(1); }); }

Vertex AI Voice Agent for Cal.com Appointment Scheduling

The problem

Built from

Intro

Prerequisites

Step 1: Initialize the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Initialize the project

Step 2: Install dependencies

Step 3: Configure TypeScript

Step 4: Set up environment variables

Step 5: Write the shared types

Step 6: Write the Cal.com client

Step 7: Write the calendar repair service

Step 8: Write the intent router

Step 9: Write the guardrail service

Step 10: Write the budget controller

Step 11: Write the Gemini service

Step 12: Write the Twilio telephony service

Step 13: Write the Express server

Step 14: Add a Next.js health check route

Step 15: Run the tests

Step 16: Start the server

Next steps