Vertex AI Budget Guardrails for Multi-Agent Systems

Keep your AI costs predictable with per-agent budget limits and automatic model routing on Vertex AI.

typescript express vertex-ai cost-control budget-guardrails multi-agent middleware

The problem

Small businesses using multiple AI agents on Vertex AI often exceed their monthly budget due to unpredictable model calls and no per-agent cost controls.

Built from

Intro

You’ll build an Express server that acts as a budget guard for any AI agent calling Vertex AI. Every LLM request passes through a middleware that checks per-scope spending limits, attaches budget headers to the response, and blocks requests when the limit is hit. By the end you’ll have a working server you can start with pnpm dev, a /api/llm endpoint that enforces budget rules, a /metrics endpoint for real-time cost dashboards, and a full test suite with 90%+ coverage.

Prerequisites

Node.js >= 22 — the project uses ES2022 and "type": "module"
pnpm 10.x — the lockfile was generated with pnpm 10.9.0
A Google Cloud project with the Vertex AI API enabled — you’ll need your project ID and a service account with Vertex AI permissions
Application Default Credentials — run gcloud auth application-default login so the Vertex AI SDK can authenticate
Familiarity with TypeScript, Express, and environment variables

Step 1: Scaffold the project

Create a new directory and a package.json that declares the project as an ES module with the exact scripts and metadata the recipe needs.

terminal

mkdir vertex-budget-guardrails
cd vertex-budget-guardrails

Create package.json:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

40 tests·95.2% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 — the project uses ES2022 and "type": "module"
pnpm 10.x — the lockfile was generated with pnpm 10.9.0
A Google Cloud project with the Vertex AI API enabled — you’ll need your project ID and a service account with Vertex AI permissions
Application Default Credentials — run gcloud auth application-default login so the Vertex AI SDK can authenticate
Familiarity with TypeScript, Express, and environment variables

Step 1: Scaffold the project

Create a new directory and a package.json that declares the project as an ES module with the exact scripts and metadata the recipe needs.

terminal

mkdir vertex-budget-guardrails
cd vertex-budget-guardrails

Create package.json:

import { Router, type Request, type Response } from 'express'; import { BudgetController } from '@reaatech/agent-budget-engine'; import { SpendStore } from '@reaatech/agent-budget-spend-tracker'; import { CostTracker } from '@reaatech/agent-eval-harness-cost'; import type { MetricsResponse } from '../types.js'; import type { AppConfig } from '../config.js'; export function createMetricsRouter( _store: SpendStore, controller: BudgetController, appConfig: Readonly<AppConfig>, ): Router { const router = Router(); const costTracker = new CostTracker(appConfig.defaultBudgetLimit); router.get('/metrics', (req: Request, res: Response): void => { // Check admin authorization const authHeader = req.headers.authorization; if (!authHeader) { res.status(401).json({ error: 'Missing authorization header' }); return; } const token = authHeader.startsWith('Bearer ') ? authHeader.slice(7) : authHeader; if (token !== appConfig.adminToken) { res.status(401).json({ error: 'Invalid admin token' }); return; } try { // Query all budgets from controller const allBudgets = controller.listAll(); let totalSpent = 0; const perScopeBreakdown: Record<string, { spent: number; limit: number; state: string }> = {}; for (const entry of allBudgets) { const def = entry.definition; const state = entry.state; const spent = state?.spent ?? 0; const limitVal = def.limit; const stateStr = state?.state ?? 'active'; totalSpent += spent; const key = `${def.scopeType}:${def.scopeKey}`; perScopeBreakdown[key] = { spent, limit: limitVal, state: stateStr }; } // Get cost tracker data const trackedTotal = costTracker.getTotalCost(); const trajectoryCount = costTracker.getTrajectoryCount(); // Efficiency scores - simple heuristic based on spend vs limit const efficiencyScores: Record<string, number> = {}; for (const entry of allBudgets) { const def = entry.definition; const state = entry.state; if (state !== undefined) { const efficiency = def.limit > 0 ? Math.min(100, Math.round((1 - state.spent / def.limit) * 100)) : 100; efficiencyScores[`${def.scopeType}:${def.scopeKey}`] = efficiency; } } const metrics: MetricsResponse = { totalSpent: trackedTotal > 0 ? trackedTotal : totalSpent, perScopeBreakdown, efficiencyScores, trajectoryCount, }; res.json(metrics); } catch (error: unknown) { const message = error instanceof Error ? error.message : 'Internal server error'; res.status(500).json({ error: message }); } }); return router; }

Vertex AI Budget Guardrails for Multi-Agent Systems

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Install dependencies

Step 3: Configure TypeScript

Step 4: Set environment variables

Step 5: Create the configuration loader

Step 6: Create shared types

Step 7: Create the Vertex AI client wrapper

Step 8: Create the budget middleware

Step 9: Create the metrics endpoint

Step 10: Assemble the Express app and entry point

Step 11: Run the test suite

Step 12: Start the server

Next steps