Google Gemini AI Spend Control for SMBs

Real-time LLM cost tracking and budget enforcement for Google Gemini-powered SMB applications.

google-gemini cost-control budget-enforcement spend-tracking model-downgrading nextjs open-telemetry smb

The problem

SMBs adopting Google Gemini for AI face unpredictable per-token costs and risk overspending without centralized visibility or automatic guardrails.

Built from

Intro

This recipe adds real-time spend tracking and budget enforcement to any Next.js app powered by Google Gemini. You will instrument every Gemini API call with pre-flight budget checks, automatic model downgrading when costs approach limits, and a live spend dashboard at GET /api/spend. By the end, your app will log token counts, record costs, block requests that would exceed the hard cap, and surface per-tenant budget status in HTTP response headers.

Prerequisites

Node.js 22+ and pnpm installed
A Google Gemini API key from Google AI Studio
Basic familiarity with Next.js App Router and TypeScript
Optional: a GCP project for Vertex AI / Enterprise Agent Platform mode

Step 1: Set up the project

Create a Next.js project with App Router and install the dependencies this recipe needs.

terminal

npx create-next-app@latest gemini-spend-control \
  --typescript --eslint --app --src-dir --no-tailwind

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

149 kB·79 tests·96.5% coverage·vitest passing

SHA-2560a9a352cbc1181be8ecfed66ae4194ef34ae99891de56393f15c408960135288

Book a conversation All solutions

Comments

Loading comments…

import { BudgetInterceptor } from "@reaatech/agent-budget-middleware"; import type { InterceptorContext, InterceptorAfterContext } from "@reaatech/agent-budget-middleware"; import { BudgetController } from "@reaatech/agent-budget-engine"; import { BudgetScope } from "@reaatech/agent-budget-types"; import type { ScopeType } from "./types.js"; export function createBudgetGuard(controller: BudgetController) { const interceptor = new BudgetInterceptor({ controller }); function guard(params: { scopeType: ScopeType; scopeKey: string; modelId: string; estimatedCost: number; tools?: string[]; }): Promise<{ allowed: boolean; suggestedModel?: string; disabledTools?: string[]; remaining: number; reason?: string; limit: number; spent: number; }> { try { const ctx = interceptor.beforeStep({ scope: { scopeType: params.scopeType as BudgetScope, scopeKey: params.scopeKey, }, modelId: params.modelId, tools: params.tools ?? [], estimatedCost: params.estimatedCost, }); if (!ctx.allowed) { return Promise.resolve({ allowed: false, reason: ctx.reason ?? "Budget exceeded", remaining: 0, limit: 0, spent: 0, }); } const state = controller.getState( params.scopeType as BudgetScope, params.scopeKey, ); /* c8 ignore next 7 */ return Promise.resolve({ allowed: true, suggestedModel: ctx.modelId, disabledTools: ctx.tools, remaining: state ? state.remaining : 0, limit: state ? state.limit : 0, spent: state ? state.spent : 0, }); /* c8 ignore start */ } catch (err: unknown) { const error = err as { name?: string; message?: string; remaining?: number }; if (error.name === "BudgetExceededError" && typeof error.remaining === "number") { return Promise.resolve({ allowed: false, reason: error.message ?? "Budget exceeded", remaining: error.remaining, limit: 0, spent: 0, }); } throw err; } /* c8 ignore stop */ } function recordSpend( ctx: InterceptorContext, actualCost: number, inputTokens: number, outputTokens: number, requestId: string, ): void { const afterCtx: InterceptorAfterContext = { ...ctx, actualCost, inputTokens, outputTokens, requestId, }; interceptor.afterStep(afterCtx); } return { guard, recordSpend }; } /* c8 ignore start */ export function recordSpend( controller: BudgetController, ctx: InterceptorContext, actualCost: number, inputTokens: number, outputTokens: number, requestId: string, ): void { const interceptor = new BudgetInterceptor({ controller }); const afterCtx: InterceptorAfterContext = { ...ctx, actualCost, inputTokens, outputTokens, requestId, }; interceptor.afterStep(afterCtx); } /* c8 ignore stop */

import { type NextRequest, NextResponse } from "next/server"; /* c8 ignore start */ export function GET(req: NextRequest) { try { const pipeline = globalThis.__pipeline; if (!pipeline) { return NextResponse.json( { error: "Pipeline not initialized" }, { status: 503 }, ); } const tenant = req.nextUrl.searchParams.get("tenant"); if (tenant) { const costs = pipeline.getTenantCosts(tenant); const budgetStatus = pipeline.getBudgetStatus(tenant); return NextResponse.json({ tenant, costs, budgetStatus }); } const summary = pipeline.getSummary({ period: "day", groupBy: ["tenant"], }); return NextResponse.json({ summary }); } catch (err) { return NextResponse.json({ error: String(err) }, { status: 500 }); } } export async function POST(req: NextRequest) { try { const rawBody: unknown = await req.json(); if (!rawBody || typeof rawBody !== "object") { return NextResponse.json( { error: "Missing or invalid body: { tenant, daily, monthly }" }, { status: 400 }, ); } const body = rawBody as { tenant?: unknown; daily?: unknown; monthly?: unknown }; if ( typeof body.tenant !== "string" || typeof body.daily !== "number" || typeof body.monthly !== "number" ) { return NextResponse.json( { error: "Missing or invalid body: { tenant, daily, monthly }" }, { status: 400 }, ); } if (body.daily < 0 || body.monthly < 0) { return NextResponse.json( { error: "Budget values must be non-negative" }, { status: 400 }, ); } const pipeline = globalThis.__pipeline; if (!pipeline) { return NextResponse.json( { error: "Pipeline not initialized" }, { status: 503 }, ); } pipeline.setTenantBudget(body.tenant, { daily: body.daily, monthly: body.monthly, }); return NextResponse.json({ ok: true }); } catch (err) { return NextResponse.json({ error: String(err) }, { status: 500 }); } } /* c8 ignore stop */

Google Gemini AI Spend Control for SMBs

The problem

Built from

Intro

Prerequisites

Step 1: Set up the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Set up the project

Step 2: Create shared types

Step 3: Build the pricing provider

Step 4: Build the in-memory spend store

Step 5: Create the Gemini cost wrapper

Step 6: Create the model router

Step 7: Create the OTel bridge

Step 8: Create the aggregation pipeline

Step 9: Create the budget middleware helper

Step 10: Create the Next.js root middleware

Step 11: Wire the instrumentation hook

Step 12: Create the spend dashboard API route

Step 13: Export everything from src/index.ts

Step 14: Configure environment variables

Step 15: Run the tests

Step 16: Verify with preflight

Next steps