Vertex AI Multi-Agent Handoff for ServiceTitan Dispatch Automation

An AI dispatch mesh that triages field service requests, schedules appointments, and assigns technicians in ServiceTitan via REST, reducing manual coordination.

vertex-ai multi-agent dispatch servicetitan typescript nextjs agent-handoff field-service

The problem

Home service companies on ServiceTitan rely on human dispatchers to juggle calls, schedule jobs, and update statuses; missed tasks and double bookings hurt customer satisfaction and revenue.

Built from

Intro

This tutorial walks you through building a multi-agent dispatch mesh that classifies incoming field service requests, routes them through a confidence gate, and dispatches technicians via the ServiceTitan REST API. You’ll wire up Vertex AI (Gemini) for classification and content generation, use @reaatech/agent-mesh-classifier for intent detection, @reaatech/agent-mesh-confidence for gating, and @reaatech/agent-handoff-routing for agent selection. By the end you’ll have a working Next.js App Router project with two API endpoints and a full test suite.

Prerequisites

Node.js >= 22 and pnpm 10.x installed
A Google Cloud project with the Vertex AI API enabled and a service account key
A ServiceTitan tenant with OAuth2 client credentials (client ID + secret)
A Langfuse account (free tier works) for observability tracing
Basic familiarity with TypeScript and Next.js App Router

Step 1: Scaffold the Next.js project

Create a new Next.js project with the App Router, then install the required dependencies.

terminal

npx create-next-app@latest vertex-ai-dispatch-mesh --typescript --app --src-dir
cd vertex-ai-dispatch-mesh

Now install the REAA mesh packages and supporting libraries. All versions are pinned exactly — no ^ or .

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

207 kB·79 tests·97.4% coverage·vitest passing

SHA-25612cde2533ede58bcef65582afecf8b3129f08f4103f387e0954dde7522b37a09

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 and pnpm 10.x installed
A Google Cloud project with the Vertex AI API enabled and a service account key
A ServiceTitan tenant with OAuth2 client credentials (client ID + secret)
A Langfuse account (free tier works) for observability tracing
Basic familiarity with TypeScript and Next.js App Router

Step 1: Scaffold the Next.js project

Create a new Next.js project with the App Router, then install the required dependencies.

terminal

npx create-next-app@latest vertex-ai-dispatch-mesh --typescript --app --src-dir
cd vertex-ai-dispatch-mesh

Now install the REAA mesh packages and supporting libraries. All versions are pinned exactly — no ^ or .

import { IncomingRequestSchema, type IncomingRequest, AgentResponseSchema, type AgentResponse, ClassifierOutputSchema, type ClassifierOutput, AgentConfigSchema, type AgentConfig, ConfidenceDecisionSchema, type ConfidenceDecision, ContextPacketSchema, type ContextPacket, } from "@reaatech/agent-mesh"; import { createHandoffConfig, type HandoffPayload, type AgentCapabilities, type HandoffConfig, type HandoffResult, HandoffError, type RoutingDecision } from "@reaatech/agent-handoff"; import { z } from "zod"; // Re-export everything export { IncomingRequestSchema, type IncomingRequest, AgentResponseSchema, type AgentResponse, ClassifierOutputSchema, type ClassifierOutput, AgentConfigSchema, type AgentConfig, ConfidenceDecisionSchema, type ConfidenceDecision, ContextPacketSchema, type ContextPacket }; export { createHandoffConfig, type HandoffPayload, type AgentCapabilities, type HandoffConfig, type HandoffResult, HandoffError, type RoutingDecision }; // Local types export interface ServiceTitanConfig { tenantId: string; clientId: string; clientSecret: string; baseUrl: string; } export const ServiceRequestSchema = z.object({ id: z.string(), customerName: z.string(), issueDescription: z.string(), priority: z.enum(["low", "medium", "high", "urgent"]), source: z.enum(["web", "sms", "voice"]), timestamp: z.string(), }); export type ServiceRequest = z.infer<typeof ServiceRequestSchema>; export interface Technician { id: string; name: string; skills: string[]; certifications: string[]; currentLoad: number; maxLoad: number; availability: "available" | "busy" | "offline"; } export interface AppointmentSlot { slotId: string; technicianId: string; startTime: string; endTime: string; status: "open" | "booked"; } export interface DispatchResult { status: "scheduled" | "pending" | "failed"; jobId?: string; technicianId?: string; scheduledTime?: string; reason?: string; } export interface DispatchAction { action: "schedule" | "reschedule" | "cancel" | "status"; serviceRequestId: string; technicianId?: string; scheduledTime?: string; }

import { VertexAI, type GenerativeModel } from "@google-cloud/vertexai"; import pRetry from "p-retry"; import { isRateLimitError } from "@reaatech/agent-mesh-classifier"; import { recipeEnv } from "./env"; let vertexAI: VertexAI | null = null; let cachedModel: GenerativeModel | null = null; function getVertexAI(): VertexAI { if (!vertexAI) { vertexAI = new VertexAI({ project: recipeEnv.GOOGLE_CLOUD_PROJECT, location: recipeEnv.GOOGLE_CLOUD_LOCATION, }); } return vertexAI; } export function getGenerativeModel(modelName?: string): GenerativeModel { const model = modelName ?? "gemini-2.5-flash"; if (model !== "gemini-2.5-flash" || !cachedModel) { const ai = getVertexAI(); const genModel = ai.getGenerativeModel({ model }); if (model === "gemini-2.5-flash") { cachedModel = genModel; } return genModel; } return cachedModel; } function extractContentText(result: { response: { candidates: Array<{ content?: { parts?: Array<{ text?: string }> } }> } }): string { const candidates = result.response.candidates; if (candidates.length === 0) return ""; const first = candidates[0]; return first.content?.parts?.[0]?.text ?? ""; } export async function generateContent(prompt: string, model?: string): Promise<string> { return pRetry(async () => { const genModel = getGenerativeModel(model); const result = await genModel.generateContent({ contents: [{ role: "user", parts: [{ text: prompt }] }], }); return extractContentText(result as Parameters<typeof extractContentText>[0]); }, { retries: 3, shouldRetry: (err) => isRateLimitError(err) }); } export async function generateContentStream(prompt: string, model?: string): Promise<AsyncIterable<string>> { return pRetry(async () => { const genModel = getGenerativeModel(model); const streamingResult = await genModel.generateContentStream({ contents: [{ role: "user", parts: [{ text: prompt }] }], }); return { [Symbol.asyncIterator]: async function* () { for await (const chunk of streamingResult.stream) { const text = chunk.candidates?.[0]?.content?.parts?.[0]?.text; if (text) yield text; } }, }; }, { retries: 3, shouldRetry: (err) => isRateLimitError(err) }); } export async function countTokens(text: string): Promise<{ totalTokens: number }> { return pRetry(async () => { const genModel = getGenerativeModel(); const response = await genModel.countTokens({ contents: [{ role: "user", parts: [{ text }] }], }); return { totalTokens: response.totalTokens }; }, { retries: 3, shouldRetry: (err) => isRateLimitError(err) }); }

import { classifierService, detectLanguage, isRateLimitError } from "@reaatech/agent-mesh-classifier"; import { AgentConfigSchema, type AgentConfig, type ClassifierOutput } from "@reaatech/agent-mesh"; export function buildAgentRegistry(): AgentConfig[] { const intake: AgentConfig = AgentConfigSchema.parse({ agent_id: "intake", display_name: "Intake Agent", description: "Classifies incoming service requests and triages dispatch needs", endpoint: "/api/webhook", confidence_threshold: 0.6, examples: ["I need a plumber", "My AC is broken"], clarification_required: true, }); const dispatch: AgentConfig = AgentConfigSchema.parse({ agent_id: "dispatch", display_name: "Dispatch Agent", description: "Schedules appointments and assigns technicians in ServiceTitan", endpoint: "/api/dispatch", confidence_threshold: 0.7, examples: ["Schedule a technician for Monday", "When can you come?"], clarification_required: false, }); return [intake, dispatch]; } export async function processIncomingRequest(rawInput: string): Promise<ClassifierOutput> { const language = detectLanguage(rawInput); const registry = buildAgentRegistry(); try { const result = await classifierService.classify(rawInput, registry, language); return result; } catch (error) { if (isRateLimitError(error)) { console.warn("Classifier rate-limited, retrying once"); try { const retryResult = await classifierService.classify(rawInput, registry, language); return retryResult; } catch (retryError) { if (isRateLimitError(retryError)) { console.warn("Classifier rate-limited on retry, returning fallback classification"); const fallback: ClassifierOutput = { agent_id: "intake", confidence: 0.3, ambiguous: true, detected_language: language, intent_summary: "Fallback due to rate-limit exhaustion", entities: {}, }; return fallback; } throw retryError; } } throw error; } }

import { ServiceTitanClient } from "../integration/servicetitan"; import type { Technician, AppointmentSlot, DispatchAction, DispatchResult } from "../types"; export async function findAvailableTechnicians(date: string, client: ServiceTitanClient): Promise<Technician[]> { const all = await client.getTechnicians(); return all.filter((t) => t.availability !== "offline" && t.currentLoad < t.maxLoad); } export async function findBestSlot(technicians: Technician[], client: ServiceTitanClient, date: string): Promise<AppointmentSlot | null> { const slots: AppointmentSlot[] = []; for (const tech of technicians) { const techSlots = await client.getAppointmentSlots(tech.id, date); slots.push(...techSlots); } slots.sort((a, b) => new Date(a.startTime).getTime() - new Date(b.startTime).getTime()); return slots.find((s) => s.status === "open") ?? null; } export async function validateSchedulingConstraints(technicianId: string, slotTime: string, client: ServiceTitanClient): Promise<boolean> { try { const existing = await client.getAppointmentSlots(technicianId, slotTime.split("T")[0]); const conflict = existing.some( (s) => s.status === "booked" && new Date(s.startTime).getTime() <= new Date(slotTime).getTime() && new Date(s.endTime).getTime() > new Date(slotTime).getTime() ); return !conflict; } catch { return false; } } export async function executeDispatchAction(action: DispatchAction, client: ServiceTitanClient): Promise<DispatchResult> { if (action.action === "status") { const job = await client.getJobStatus(action.serviceRequestId); return { status: "scheduled", jobId: job.jobId, technicianId: job.technicianId, scheduledTime: undefined }; } try { const available = await findAvailableTechnicians(action.scheduledTime ?? new Date().toISOString(), client); if (available.length === 0) return { status: "failed", reason: "No available technicians" }; const slot = await findBestSlot(available, client, action.scheduledTime ?? new Date().toISOString()); if (!slot) return { status: "failed", reason: "No open appointment slots" }; const valid = await validateSchedulingConstraints(slot.technicianId, slot.startTime, client); if (!valid) return { status: "failed", reason: "Scheduling constraint conflict" }; const result = await client.scheduleJob(action.serviceRequestId, slot.technicianId, slot.startTime); return { status: "scheduled", jobId: result.jobId, technicianId: slot.technicianId, scheduledTime: slot.startTime }; } catch (error) { return { status: "failed", reason: error instanceof Error ? error.message : "Unknown error" }; } }

Vertex AI Multi-Agent Handoff for ServiceTitan Dispatch Automation

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project

Step 2: Configure environment variables

Step 3: Define shared types and schemas

Step 4: Create the Vertex AI LLM adapter

Step 5: Build the ServiceTitan REST client

Step 6: Create the intake agent

Step 7: Create the dispatch agent

Step 8: Add Langfuse observability

Step 9: Wire the mesh orchestrator

Step 10: Create API route handlers and the home page

Step 11: Wire the module entry point

Step 12: Run the test suite

Step 13: Run type check and lint

Next steps