Vertex AI Voice Agent for Buildium Maintenance Requests

A 24/7 voice receptionist that authenticates tenants, creates work orders in Buildium, and provides real‑time status updates — so property managers never miss a maintenance call.

vertex-ai voice-agent buildium property-management twilio nextjs hono typescript

The problem

Property management companies using Buildium lose after‑hours calls from tenants reporting leaks or HVAC failures. Manual entry into Buildium is slow, and missed requests lead to unhappy tenants and damage escalation.

Built from

Intro

In this tutorial you’ll build a 24/7 voice receptionist that answers Twilio calls, authenticates tenants against Buildium’s REST API, and uses Vertex AI Gemini to create or update maintenance work orders. When a tenant calls after hours to report a leak, the voice agent handles the entire interaction — transcribing speech with Deepgram Nova-2, reasoning with Gemini 2.5 Flash, and responding with Cartesia Sonic TTS.

You’ll use Next.js 16 (App Router), TypeScript, and the @reaatech/voice-agent-core pipeline to wire up eight cloud services.

Prerequisites

Node.js 22+ and pnpm 10
A Twilio account with a phone number that has voice capabilities
A Google Cloud project with the Vertex AI API enabled and a service account key
Deepgram API key (Nova-2 model)
Cartesia API key (Sonic TTS model)
Buildium account with API credentials (client ID and secret)
Upstash Redis database (URL and token)
Langfuse account (optional, for observability)
Basic familiarity with TypeScript, Next.js App Router, and WebSocket concepts

Step 1: Scaffold the project and configure environment variables

Create a new Next.js project and install all dependencies. The scaffold agent has already set up the project shell for you. Start by examining the package.json and .env.example to see what’s wired up.

The .env.example file lists every environment variable the application reads:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

209 kB·162 tests·96.8% coverage·vitest passing

SHA-256a04fd3e5ed0ca02df1930c96f566aeb704f84f5d23d98f5968d9f3c101525620

Book a conversation All solutions

Comments

Loading comments…

Intro

You’ll use Next.js 16 (App Router), TypeScript, and the @reaatech/voice-agent-core pipeline to wire up eight cloud services.

Prerequisites

Node.js 22+ and pnpm 10
A Twilio account with a phone number that has voice capabilities
A Google Cloud project with the Vertex AI API enabled and a service account key
Deepgram API key (Nova-2 model)
Cartesia API key (Sonic TTS model)
Buildium account with API credentials (client ID and secret)
Upstash Redis database (URL and token)
Langfuse account (optional, for observability)
Basic familiarity with TypeScript, Next.js App Router, and WebSocket concepts

Step 1: Scaffold the project and configure environment variables

The .env.example file lists every environment variable the application reads:

import twilio from "twilio"; import { createTwilioHandler } from "@reaatech/voice-agent-telephony"; const { RestException } = twilio; export { RestException }; let twilioHandlerInstance: ReturnType<typeof createTwilioHandler> | null = null; export function getTwilioStreamHandler() { if (!twilioHandlerInstance) { twilioHandlerInstance = createTwilioHandler({ bargeInEnabled: true, minSpeechDuration: 300, confidenceThreshold: 0.7, silenceThreshold: 0.3, }); } return twilioHandlerInstance; } export class TwilioAppError extends Error { constructor( public readonly code: number, message: string, public readonly moreInfo?: string, ) { super(message); this.name = "TwilioAppError"; } } export interface TwilioAppClient { lookupPhoneNumber(phone: string): Promise<{ countryCode: string; nationalFormat: string }>; sendSms(to: string, body: string): Promise<string>; } class TwilioAppClientImpl implements TwilioAppClient { private client: ReturnType<typeof twilio>; constructor() { this.client = twilio( process.env.TWILIO_ACCOUNT_SID ?? "", process.env.TWILIO_AUTH_TOKEN ?? "", ); } async lookupPhoneNumber(phone: string): Promise<{ countryCode: string; nationalFormat: string }> { try { const result = await this.client.lookups.v1.phoneNumbers(phone).fetch(); return { countryCode: result.countryCode, nationalFormat: result.nationalFormat, }; } catch (err: unknown) { if (err instanceof RestException) { throw new TwilioAppError(err.status, err.message, err.moreInfo); } throw err; } } async sendSms(to: string, body: string): Promise<string> { try { const message = await this.client.messages.create({ to, from: process.env.TWILIO_PHONE_NUMBER ?? "", body, }); return message.sid; } catch (err: unknown) { if (err instanceof RestException) { throw new TwilioAppError(err.status, err.message, err.moreInfo); } throw err; } } } let instance: TwilioAppClient | null = null; export function getTwilioClient(): TwilioAppClient { if (!instance) { instance = new TwilioAppClientImpl(); } return instance; }

import { generateId, calculateCostFromTokens, CostSpanSchema, loadConfig } from "@reaatech/llm-cost-telemetry"; import type { CostSpan } from "@reaatech/llm-cost-telemetry"; export interface CostTelemetryService { recordLlmCost(params: { sessionId: string; model: string; inputTokens: number; outputTokens: number; pricePerMillionInput: number; pricePerMillionOutput: number; }): CostSpan; recordSttCost(sessionId: string, durationMs: number): CostSpan; recordTtsCost(sessionId: string, charCount: number): CostSpan; getSessionTotal(costSpans: CostSpan[]): number; recordTurnCost(sessionId: string, metrics?: Record<string, unknown>): void; } export function createCostTelemetry(): CostTelemetryService { function recordLlmCost(params: { sessionId: string; model: string; inputTokens: number; outputTokens: number; pricePerMillionInput: number; pricePerMillionOutput: number; }): CostSpan { const inputCost = calculateCostFromTokens(params.inputTokens, params.pricePerMillionInput); const outputCost = calculateCostFromTokens(params.outputTokens, params.pricePerMillionOutput); const span = { id: generateId(), provider: "google-vertex-ai" as const, model: params.model, inputTokens: params.inputTokens, outputTokens: params.outputTokens, costUsd: inputCost + outputCost, tenant: params.sessionId, feature: "voice-agent", timestamp: new Date(), }; return CostSpanSchema.parse(span); } function recordSttCost(sessionId: string, durationMs: number): CostSpan { const minutes = durationMs / 60000; const costUsd = minutes * 0.0059; const span = { id: generateId(), provider: "deepgram" as const, model: "nova-2", inputTokens: Math.round(durationMs / 100), outputTokens: 0, costUsd, tenant: sessionId, feature: "voice-agent", timestamp: new Date(), }; return CostSpanSchema.parse(span); } function recordTtsCost(sessionId: string, charCount: number): CostSpan { const costUsd = charCount * 0.000015; const span = { id: generateId(), provider: "cartesia" as const, model: "sonic", inputTokens: 0, outputTokens: charCount, costUsd, tenant: sessionId, feature: "voice-agent", timestamp: new Date(), }; return CostSpanSchema.parse(span); } function getSessionTotal(costSpans: CostSpan[]): number { return costSpans.reduce((sum, s) => sum + s.costUsd, 0); } function recordTurnCost(_sessionId: string, _metrics?: Record<string, unknown>): void { // Record keeping for turn-level cost } return { recordLlmCost, recordSttCost, recordTtsCost, getSessionTotal, recordTurnCost }; } export function readBudgetConfig() { return loadConfig(); }

private async executeToolCall(fc: { name: string; args: Record<string, unknown> }): Promise<Record<string, unknown>> { switch (fc.name) { case "lookupTenant": { const tenant = await this.buildium.getTenantByPhone(fc.args.phone as string); return tenant ? { found: true, name: tenant.name, unit: tenant.unit, propertyName: tenant.propertyName } : { found: false }; } case "createWorkOrder": { const params = fc.args as Record<string, string>; const order = await this.buildium.createWorkOrder({ tenantId: params.tenantId, subject: params.subject, description: params.description, priority: params.priority as "low" | "medium" | "high" | "emergency" | undefined, }); return { workOrderId: order.id, status: order.status }; } case "getWorkOrderStatus": { const order = await this.buildium.getWorkOrder(fc.args.workOrderId as string); return { workOrderId: order.id, status: order.status, subject: order.subject, updatedAt: order.updatedAt }; } case "updateWorkOrder": { const params = fc.args as Record<string, string>; const order = await this.buildium.updateWorkOrder(params.workOrderId, { status: params.status as "open" | "in_progress" | "completed" | "cancelled" | undefined, description: params.description, }); return { workOrderId: order.id, status: order.status, updatedAt: order.updatedAt }; } default: return { error: `Unknown tool: ${fc.name}` }; } } async processAudio(sessionId: string, chunk: Buffer, sampleRate: number): Promise<void> { const audioChunk = { buffer: chunk, sampleRate, encoding: "mulaw" as const, channels: 1, timestamp: Date.now() }; const converted = STTProviderInterface.convertAudioFormat(audioChunk, 16000, "linear16"); await this.pipeline.processAudioChunk(sessionId, converted); } async endCallSession(sessionId: string): Promise<void> { await this.pipeline.endSession(sessionId); await this.stt.close(); this.sessions.delete(sessionId); } getCostTelemetry(): CostTelemetryService { return this.costTelemetry; } handleBargeIn(sessionId: string): void { this.pipeline.bargeIn(sessionId); this.tts.cancel(); } } let instance: VoiceAgentService | null = null; export function createVoiceAgent(): VoiceAgentService { if (!instance) { instance = new VoiceAgentServiceImpl(); } return instance; }

Vertex AI Voice Agent for Buildium Maintenance Requests

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and configure environment variables

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and configure environment variables

Step 2: Create the Buildium API client with OAuth2

Step 3: Build the Twilio client

Step 4: Implement the Vertex AI LLM service with Gemini function calling

Step 5: Create the session service with Upstash Redis

Step 6: Build the cost telemetry service

Step 7: Wire the observability and instrumentation

Step 8: Build the voice agent service

Step 9: Create the Twilio call route handler

Step 10: Create the Twilio Media Streams WebSocket handler

Step 11: Create the health check and work orders API routes

Step 12: Create the public entry point

Step 13: Run the tests

Next steps