OpenAI Cost Control for ServiceTitan Small Business Agent Spend

Cap and monitor OpenAI tokens per ServiceTitan tenant so field-service AI automation never blows the monthly budget.

openai cost-control servicetitan hono nextjs circuit-breaker llm-router helicone field-service agent-budget

The problem

SMBs deploying AI-driven scheduling and dispatch agents on ServiceTitan risk runaway costs if one tenant floods the system with requests; without per-tenant budgets, a single bad day can exceed the entire month's budget.

Built from

Intro

Small businesses using ServiceTitan with AI-driven scheduling and dispatch agents risk runaway OpenAI costs if one tenant floods the system with requests. Without per-tenant budgets, a single bad day can exceed the entire month’s spend. This recipe builds a cost-control layer that enforces per-tenant token budgets using @reaatech/agent-budget-engine, auto-downgrades to cheaper models when caps approach via @reaatech/llm-router-engine, trips circuit breakers after repeated failures via @reaatech/circuit-breaker-core, and streams spend metrics to Helicone for real-time dashboards. You’ll wire everything into a Next.js 16+ App Router project with a Hono API layer.

Prerequisites

Node.js 22+ and pnpm 10+ installed
An OpenAI API key with billing enabled
A ServiceTitan developer account with client ID and secret (or you can skip the ServiceTitan integration and test with static tenant IDs)
Familiarity with TypeScript, basic Next.js App Router concepts, and terminal usage

Step 1: Scaffold the project and install dependencies

Create a Next.js 16+ project with the App Router and install all dependencies, pinning every version exactly.

terminal

npx create-next-app@latest openai-cost-control --typescript --app --src-dir --no-tailwind --eslint --import-alias

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

184 kB·149 tests·94.8% coverage·vitest passing

SHA-25699b29d56672464664d5b7feb5f0beb51799c74bb1faa8b0ded435b62e11f3e71

Book a conversation All solutions

Comments

Loading comments…

import { z } from "zod"; // ---- Data types ---- export interface ServiceTitanConfig { clientId: string; clientSecret: string; tenantId: string; baseUrl: string; } export interface ServiceTitanJob { id: string; tenantId: string; customerName: string; jobType: string; status: string; } export interface TenantBudget { tenantId: string; dailyBudget: number; monthlyBudget: number; spentToday: number; spentThisMonth: number; state: string; } export interface CostSpanRecord { id: string; tenantId: string; provider: string; model: string; inputTokens: number; outputTokens: number; costUsd: number; feature?: string; route?: string; timestamp: Date; } export interface BudgetCheckResult { allowed: boolean; action: string; suggestedModel?: string; disabledTools?: string[]; remaining: number; } export interface RecordEntry { requestId: string; scopeType: "user" | "tenant"; scopeKey: string; cost: number; inputTokens: number; outputTokens: number; modelId: string; provider: string; timestamp: Date; } // ---- Zod schemas for runtime validation ---- export const ServiceTitanJobSchema = z.object({ id: z.string().min(1), tenantId: z.string().min(1), customerName: z.string(), jobType: z.string(), status: z.string(), }); export const CostSpanRecordSchema = z.object({ id: z.string().min(1), tenantId: z.string().min(1), provider: z.string(), model: z.string(), inputTokens: z.number().int().min(0), outputTokens: z.number().int().min(0), costUsd: z.number().min(0), feature: z.string().optional(), route: z.string().optional(), timestamp: z.date(), }); export const TenantBudgetSchema = z.object({ tenantId: z.string().min(1), dailyBudget: z.number().positive(), monthlyBudget: z.number().positive(), spentToday: z.number().min(0).default(0), spentThisMonth: z.number().min(0).default(0), state: z.string().default("Active"), }); export const BudgetCheckResultSchema = z.object({ allowed: z.boolean(), action: z.string(), suggestedModel: z.string().optional(), disabledTools: z.array(z.string()).optional(), remaining: z.number(), }); export const RecordEntrySchema = z.object({ requestId: z.string().min(1), scopeType: z.enum(["user", "tenant"]), scopeKey: z.string().min(1), cost: z.number().min(0), inputTokens: z.number().int().min(0), outputTokens: z.number().int().min(0), modelId: z.string(), provider: z.string(), timestamp: z.date(), });

import { ServiceTitanJobSchema, type ServiceTitanConfig, type ServiceTitanJob } from "../lib/types.js"; import { loadAppConfig } from "../lib/config.js"; export class ServiceTitanError extends Error { constructor( public status: number, message: string, ) { super(message); this.name = "ServiceTitanError"; } } interface TokenCache { accessToken: string; expiresAt: number; } export class ServiceTitanClient { private tokenCache: TokenCache | null = null; constructor(private config: ServiceTitanConfig) {} async getAccessToken(): Promise<string> { if (this.tokenCache && Date.now() < this.tokenCache.expiresAt) { return this.tokenCache.accessToken; } const resp = await fetch(`${this.config.baseUrl}/connect/token`, { method: "POST", headers: { "Content-Type": "application/x-www-form-urlencoded" }, body: new URLSearchParams({ grant_type: "client_credentials", client_id: this.config.clientId, client_secret: this.config.clientSecret, }), }); if (!resp.ok) { throw new ServiceTitanError(resp.status, `OAuth2 token request failed: ${resp.statusText}`); } const data = (await resp.json()) as { access_token: string; expires_in: number }; this.tokenCache = { accessToken: data.access_token, expiresAt: Date.now() + (data.expires_in - 60) * 1000, }; return data.access_token; } async getTenantFromJob(jobId: string): Promise<string> { if (!jobId) throw new Error("jobId is required"); const token = await this.getAccessToken(); const resp = await fetch(`${this.config.baseUrl}/v1/jobs/${jobId}`, { headers: { Authorization: `Bearer ${token}` }, }); if (resp.status === 404) throw new ServiceTitanError(404, `Job ${jobId} not found`); if (!resp.ok) throw new ServiceTitanError(resp.status, `Job fetch failed: ${resp.statusText}`); const data = (await resp.json()) as { tenantId: string }; return data.tenantId; } async getJobDetails(jobId: string): Promise<ServiceTitanJob> { if (!jobId) throw new Error("jobId is required"); const token = await this.getAccessToken(); const resp = await fetch(`${this.config.baseUrl}/v1/jobs/${jobId}`, { headers: { Authorization: `Bearer ${token}` }, }); if (resp.status === 404) throw new ServiceTitanError(404, `Job ${jobId} not found`); if (!resp.ok) throw new ServiceTitanError(resp.status, `Job fetch failed: ${resp.statusText}`); return ServiceTitanJobSchema.parse(await resp.json()); } } export function createServiceTitanClient(config?: Partial<ServiceTitanConfig>): ServiceTitanClient { const appConfig = loadAppConfig(); return new ServiceTitanClient({ clientId: config?.clientId ?? appConfig.SERVICETITAN_CLIENT_ID, clientSecret: config?.clientSecret ?? appConfig.SERVICETITAN_CLIENT_SECRET, tenantId: config?.tenantId ?? appConfig.SERVICETITAN_TENANT_ID, baseUrl: config?.baseUrl ?? appConfig.SERVICETITAN_BASE_URL, }); }

Endpoint	Method	Purpose
`/api/health`	GET	Liveness check
`/api/cost/budget/:tenantId`	GET	Current budget state
`/api/cost/check`	POST	Pre-flight budget check
`/api/cost/record`	POST	Record a spend after an LLM call
`/api/cost/summary/:tenantId`	GET	Aggregated cost by period
`/api/chat`	POST	End-to-end chat with budget enforcement

OpenAI Cost Control for ServiceTitan Small Business Agent Spend

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables

Step 3: Create shared types and config

Step 4: Set up the database layer

Step 5: Implement the ServiceTitan client

Step 6: Build the budget enforcement service

Step 7: Build the cost telemetry service

Step 8: Build the circuit breaker service

Step 9: Build the OpenAI service with Helicone proxying

Step 10: Build the LLM router with auto-downgrade

Step 11: Wire the Hono API app

Step 12: Create the Next.js catch-all route handler

Step 13: Write the orchestrator entry point

Step 14: Write tests and run the suite

Next steps