Vertex AI Multi-Agent Handoff for SMB Field Service Dispatch

Route incoming field service requests to specialist AI agents for scheduling, inventory, and billing, with confidence-based human fallback and spend-aware model selection on Vertex AI.

vertex-ai multi-agent handoff field-service dispatch typescript nextjs langchain gemini confidence-router

The problem

SMB field service dispatchers juggle multiple systems (booking, parts lookup, invoicing) and often lose context when transferring a customer. A single generic chatbot cannot handle domain-specific logic, leading to misrouted requests and missed up-sells.

Built from

Intro

This tutorial walks you through building a multi-agent handoff system for SMB field service dispatch on Vertex AI. You’ll create a Next.js application that routes incoming customer requests to specialist AI agents for scheduling, inventory, and billing, with confidence-based human fallback and spend-aware model selection. The dispatch orchestrator uses the REAA handoff protocol to transfer conversations between agents, preserves session history across handoffs via session continuity, and caps per-agent spend with budget-aware model downgrades. By the end, you’ll have a working dispatch API backed by Gemini models on Vertex AI.

Prerequisites

Node.js 22+ and pnpm 10 installed.
A Google Cloud project with the Vertex AI API enabled. You need your project ID and a default location (usually us-central1).
A Langfuse account (optional) for LLM observability tracing.
Basic familiarity with TypeScript, Next.js App Router, and REST APIs.

Step 1: Scaffold the project and install dependencies

Create a new Next.js project with the App Router and install all required dependencies. The scaffold provides the correct tsconfig.json, next.config.ts, vitest.config.ts, and linting configs — you won’t touch root configs.

terminal

npx create-next-app@latest vertex-ai-multi-agent-handoff --typescript --app --use-pnpm
cd vertex-ai-multi-agent-handoff

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

206 kB·173 tests·98.5% coverage·vitest passing

SHA-256b33b6700b288224ff7d19ac0cf183188d1193856d4743947f8654bab7d3074ee

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ and pnpm 10 installed.
A Google Cloud project with the Vertex AI API enabled. You need your project ID and a default location (usually us-central1).
A Langfuse account (optional) for LLM observability tracing.
Basic familiarity with TypeScript, Next.js App Router, and REST APIs.

Step 1: Scaffold the project and install dependencies

terminal

npx create-next-app@latest vertex-ai-multi-agent-handoff --typescript --app --use-pnpm
cd vertex-ai-multi-agent-handoff

import { BudgetController } from "@reaatech/agent-budget-engine"; import { createSpendStore } from "../lib/spend-store.js"; import type { AgentType } from "../lib/types.js"; import type { BudgetScope } from "@reaatech/agent-budget-types"; const AGENT_SCOPE = "agent" as const; export class BudgetExceededError extends Error { constructor(message: string) { super(message); this.name = "BudgetExceededError"; } } export class BudgetService { private controller: BudgetController; private constructor(controller: BudgetController) { this.controller = controller; } static async create(): Promise<BudgetService> { const store = createSpendStore(); const controller = Reflect.construct(BudgetController, [{ spendTracker: store }]) as BudgetController; const budgets: Array<{ agent: AgentType; limit: number }> = [ { agent: "scheduling", limit: 5.0 }, { agent: "inventory", limit: 3.0 }, { agent: "billing", limit: 7.0 }, ]; for (const { agent, limit } of budgets) { controller.defineBudget({ scopeType: AGENT_SCOPE as BudgetScope, scopeKey: agent, limit, policy: { softCap: 0.8, hardCap: 1.0, autoDowngrade: [{ from: ["gemini-2.5-pro"], to: "gemini-2.5-flash" }], }, }); await Promise.resolve(); } controller.on("threshold-breach", (event: unknown) => console.warn("BudgetService: threshold-breach", event)); controller.on("hard-stop", (event: unknown) => console.error("BudgetService: hard-stop", event)); return new BudgetService(controller); } check(agentType: AgentType, estimatedCost: number, modelId: string, tools?: string[]): Promise<{ allowed: boolean; suggestedModel?: string }> { const result = this.controller.check({ scopeType: AGENT_SCOPE as BudgetScope, scopeKey: agentType, estimatedCost, modelId, tools: tools ?? [], }); if (!result.allowed) { return Promise.reject(new BudgetExceededError(`Budget exceeded for ${agentType}: estimated cost ${String(estimatedCost)}`)); } return Promise.resolve({ allowed: result.allowed, suggestedModel: result.suggestedModel }); } record(agentType: AgentType, requestId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string): Promise<void> { this.controller.record({ requestId, scopeType: AGENT_SCOPE as BudgetScope, scopeKey: agentType, cost, inputTokens, outputTokens, modelId, provider: "google-vertex", timestamp: new Date() }); return Promise.resolve(); } getState(agentType: AgentType): Promise<{ spent: number; remaining: number; state: string }> { const state = this.controller.getState(AGENT_SCOPE as BudgetScope, agentType); return Promise.resolve({ spent: state?.spent ?? 0, remaining: state?.remaining ?? 0, state: state?.state ?? "Active" }); } }

import { TypedEventEmitter, withRetry, pickDefined, HandoffError } from "@reaatech/agent-handoff"; import type { AgentType, HandoffEvent, AgentCapability } from "../lib/types.js"; export class HandoffService { private emitter: TypedEventEmitter<{ handoff_started: HandoffEvent; handoff_completed: HandoffEvent; handoff_failed: HandoffEvent; }>; private capabilities: AgentCapability[]; constructor() { this.emitter = new TypedEventEmitter<{ handoff_started: HandoffEvent; handoff_completed: HandoffEvent; handoff_failed: HandoffEvent; }>(); this.capabilities = [ { agentType: "scheduling", label: "Scheduling Agent", skills: ["booking", "calendar"], domains: ["field-service", "appointment"] }, { agentType: "inventory", label: "Inventory Agent", skills: ["parts", "stock"], domains: ["field-service", "warehouse"] }, { agentType: "billing", label: "Billing Agent", skills: ["invoice", "payment"], domains: ["field-service", "finance"] }, ]; } async handoffToAgent( sessionId: string, fromAgent: AgentType | null, toAgent: AgentType, context?: Record<string, unknown>, ): Promise<{ success: boolean; sessionId: string; toAgent: AgentType }> { const targetCapability = this.capabilities.find((c) => c.agentType === toAgent); if (!targetCapability) throw new HandoffError(`Agent not registered: ${toAgent}`, "routing_error"); const now = new Date().toISOString(); this.emitter.emit("handoff_started", { type: "handoff_started", sessionId, fromAgent, toAgent, timestamp: now }); try { await withRetry( () => Promise.resolve(pickDefined({ sessionId, fromAgent, toAgent, ...(context ?? {}) })), { maxRetries: 3, backoff: "exponential", baseDelayMs: 100, maxDelayMs: 5000, shouldRetry: () => true }, ); this.emitter.emit("handoff_completed", { type: "handoff_completed", sessionId, fromAgent, toAgent, timestamp: new Date().toISOString() }); return { success: true, sessionId, toAgent }; } catch (error) { this.emitter.emit("handoff_failed", { type: "handoff_failed", sessionId, fromAgent, toAgent, timestamp: new Date().toISOString(), metadata: { error: error instanceof Error ? error.message : String(error) }, }); if (error instanceof HandoffError) throw error; throw new HandoffError(`Handoff failed after retries: ${error instanceof Error ? error.message : String(error)}`, "routing_error"); } } getAvailableAgents(): AgentCapability[] { return this.capabilities; } on(event: "handoff_started" | "handoff_completed" | "handoff_failed", handler: (event: HandoffEvent) => void): void { (this.emitter as { on: (e: string, h: (event: HandoffEvent) => void) => void }).on(event, handler); } }

import { CallbackHandler } from "langfuse-langchain"; import type { HandoffEvent, DispatchRequest, AgentType } from "../lib/types.js"; export class WebhookLoggerService { private webhookUrl: string | undefined; private langfusePublicKey: string | undefined; private langfuseSecretKey: string | undefined; private langfuseHandler: CallbackHandler | undefined; constructor() { this.webhookUrl = process.env.DISPATCH_WEBHOOK_URL; this.langfusePublicKey = process.env.LANGFUSE_PUBLIC_KEY; this.langfuseSecretKey = process.env.LANGFUSE_SECRET_KEY; if (this.langfusePublicKey && this.langfuseSecretKey) { try { this.langfuseHandler = new CallbackHandler({ publicKey: this.langfusePublicKey, secretKey: this.langfuseSecretKey, }); } catch { console.warn("WebhookLogger: Failed to initialize Langfuse CallbackHandler"); } } } async logHandoffEvent(event: HandoffEvent): Promise<void> { if (this.langfuseHandler) { this.langfuseHandler.langfuse.trace({ name: "handoff", input: event }); } if (!this.webhookUrl) return; try { const response = await fetch(this.webhookUrl, { method: "POST", headers: { "Content-Type": "application/json" }, body: JSON.stringify(event), signal: AbortSignal.timeout(5000), }); if (!response.ok) console.warn(`WebhookLogger: handoff event POST returned ${String(response.status)}`); } catch (error) { console.warn("WebhookLogger: failed to log handoff event", error instanceof Error ? error.message : String(error)); } } async logDispatchRequest(request: DispatchRequest, decision: { type: string; target?: AgentType }): Promise<void> { if (this.langfuseHandler) { this.langfuseHandler.langfuse.trace({ name: "dispatch_request", input: { request, decision } }); } if (!this.webhookUrl) return; try { const payload = { event: "dispatch_request", sessionId: request.sessionId, data: { request, decision }, timestamp: new Date().toISOString(), }; const response = await fetch(this.webhookUrl, { method: "POST", headers: { "Content-Type": "application/json" }, body: JSON.stringify(payload), signal: AbortSignal.timeout(5000), }); if (!response.ok) console.warn(`WebhookLogger: dispatch request POST returned ${String(response.status)}`); } catch (error) { console.warn("WebhookLogger: failed to log dispatch request", error instanceof Error ? error.message : String(error)); } } }

import { type NextRequest, NextResponse } from "next/server"; import { DispatchRequestSchema } from "../../../src/lib/types.js"; import { ConfidenceClassifier } from "../../../src/services/confidence-classifier.js"; import { ModelRouterService } from "../../../src/services/model-router-service.js"; import { BudgetService } from "../../../src/services/budget-service.js"; import { SessionService } from "../../../src/services/session-service.js"; import { HandoffService } from "../../../src/services/handoff-service.js"; import { WebhookLoggerService } from "../../../src/services/webhook-logger.js"; import { DispatchService } from "../../../src/services/dispatch-service.js"; import { createSessionStorage } from "../../../src/lib/session-storage.js"; import { createSimpleTokenizer } from "../../../src/lib/token-counter.js"; import { createVertexClient } from "../../../src/lib/vertex-client.js"; const sessionStorage = createSessionStorage(); const tokenizer = createSimpleTokenizer(); const vertexClient = createVertexClient(); const confidenceClassifier = new ConfidenceClassifier(); const modelRouterService = new ModelRouterService(); const budgetService = await BudgetService.create(); const sessionService = new SessionService(sessionStorage, tokenizer); const handoffService = new HandoffService(); const webhookLoggerService = new WebhookLoggerService(); const dispatchService = new DispatchService( confidenceClassifier, modelRouterService, budgetService, sessionService, handoffService, webhookLoggerService, vertexClient, ); export async function POST(req: NextRequest) { try { const body = await req.json() as Record<string, unknown>; const parsed = DispatchRequestSchema.safeParse(body); if (!parsed.success) { return NextResponse.json({ error: "Invalid request", details: parsed.error.issues }, { status: 400 }); } const result = await dispatchService.processDispatch(parsed.data); return NextResponse.json(result); } catch (err) { return NextResponse.json({ error: err instanceof Error ? err.message : "Unknown error" }, { status: 500 }); } } export function GET() { return NextResponse.json({ status: "dispatch API ready" }); }

Vertex AI Multi-Agent Handoff for SMB Field Service Dispatch

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables

Step 3: Define the core types

Step 4: Create the in-memory adapters for spend tracking and session storage

Spend store (`src/lib/spend-store.ts`)

Session storage (`src/lib/session-storage.ts`)

Token counter (`src/lib/token-counter.ts`)

Step 5: Build the Vertex AI client

Model executor for LLM Router

Step 6: Configure the handoff protocol

Step 7: Build the service layer

Confidence Classifier (`src/services/confidence-classifier.ts`)

Model Router Service (`src/services/model-router-service.ts`)

Budget Service (`src/services/budget-service.ts`)

Session Service (`src/services/session-service.ts`)

Handoff Service (`src/services/handoff-service.ts`)

Webhook Logger (`src/services/webhook-logger.ts`)

Step 8: Build the dispatch orchestrator

Step 9: Create the Next.js API routes

Dispatch route (`app/api/dispatch/route.ts`)

Health route (`app/api/health/route.ts`)

Step 10: Create the LangGraph state machine

Step 11: Create the Express webhook server and instrumentation

Express server (`src/server.ts`)

Instrumentation (`src/instrumentation.ts`)

Init module (`src/init.ts`)

Enable instrumentation in `next.config.ts`

Step 12: Run the tests

Next steps

Vertex AI Multi-Agent Handoff for SMB Field Service Dispatch

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables

Step 3: Define the core types

Step 4: Create the in-memory adapters for spend tracking and session storage

Spend store (src/lib/spend-store.ts)

Session storage (src/lib/session-storage.ts)

Token counter (src/lib/token-counter.ts)

Step 5: Build the Vertex AI client

Model executor for LLM Router

Step 6: Configure the handoff protocol

Step 7: Build the service layer

Confidence Classifier (src/services/confidence-classifier.ts)

Model Router Service (src/services/model-router-service.ts)

Budget Service (src/services/budget-service.ts)

Session Service (src/services/session-service.ts)

Handoff Service (src/services/handoff-service.ts)

Webhook Logger (src/services/webhook-logger.ts)

Step 8: Build the dispatch orchestrator

Step 9: Create the Next.js API routes

Dispatch route (app/api/dispatch/route.ts)

Health route (app/api/health/route.ts)

Step 10: Create the LangGraph state machine

Step 11: Create the Express webhook server and instrumentation

Express server (src/server.ts)

Instrumentation (src/instrumentation.ts)

Init module (src/init.ts)

Enable instrumentation in next.config.ts

Step 12: Run the tests

Next steps

Spend store (`src/lib/spend-store.ts`)

Session storage (`src/lib/session-storage.ts`)

Token counter (`src/lib/token-counter.ts`)

Confidence Classifier (`src/services/confidence-classifier.ts`)

Model Router Service (`src/services/model-router-service.ts`)

Budget Service (`src/services/budget-service.ts`)

Session Service (`src/services/session-service.ts`)

Handoff Service (`src/services/handoff-service.ts`)

Webhook Logger (`src/services/webhook-logger.ts`)

Dispatch route (`app/api/dispatch/route.ts`)

Health route (`app/api/health/route.ts`)

Express server (`src/server.ts`)

Instrumentation (`src/instrumentation.ts`)

Init module (`src/init.ts`)

Enable instrumentation in `next.config.ts`