OpenRouter Cost Control for SMB API Spend Management

Enforce daily budgets, track per‑model spend, and automatically downgrade to cheaper fallback models when your SMB API budget is at risk.

openrouter cost-control express budget-enforcement llm-proxy spend-management fallback-routing

The problem

Small businesses using OpenRouter often see unpredictable LLM bills because one expensive model call can blow their monthly budget. Without granular cost tracking and automatic throttling, spend control is reactive at best.

Built from

Intro

Small businesses using OpenRouter often see unpredictable API bills — one expensive model call can blow the monthly budget. Without automatic throttling and per-tenant cost tracking, spend control is reactive at best.

This tutorial builds a cost-aware proxy that sits between your application and OpenRouter. Every chat completion passes through a budget check — if a tenant’s daily cap is at risk, the proxy downgrades to a cheaper fallback model. Each call is recorded as a cost span, aggregated across tenants and models, and pushed to observability backends.

By the end you’ll have a working Next.js App Router project that enforces per-tenant daily/monthly budgets, routes through a fallback chain with circuit breakers, and exports telemetry. The @reaatech/* package family does the heavy lifting — you wire the pieces together.

Prerequisites

Node.js >= 22 — the project uses modern JavaScript features
pnpm 10 — the package manager is pinned in package.json
An OpenRouter API key — get one at https://openrouter.ai/keys (you’ll set it as OPENROUTER_API_KEY)
Familiarity with TypeScript and Next.js App Router — you’ll write route handlers in app/api/ using NextRequest and NextResponse

Step 1: Scaffold the project and install dependencies

Start from an empty directory. Initialize a Next.js 16 project with TypeScript and the App Router:

terminal

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

161 kB·58 tests·98.6% coverage·vitest passing

SHA-25621fb1b7de84cc2760df121990b8469c488e6d05af2c841cd817e65644bf97b4e

Book a conversation All solutions

Comments

Loading comments…

import { BudgetController } from "@reaatech/agent-budget-engine"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { BudgetScope, type BudgetCheckResult } from "@reaatech/agent-budget-types"; export function createBudgetController(): BudgetController { return new BudgetController({ spendTracker: new SpendStore() }); } export function defineTenantBudget( controller: BudgetController, tenantId: string, dailyLimit: number, monthlyLimit: number, autoDowngrade?: Array<{ from: string[]; to: string }>, ): void { controller.defineBudget({ scopeType: BudgetScope.User, scopeKey: tenantId, limit: dailyLimit, policy: { softCap: 0.8, hardCap: 1.0, autoDowngrade: autoDowngrade ?? [], }, }); } export function checkBudget( controller: BudgetController, tenantId: string, estimatedCost: number, modelId: string, tools?: string[], ): BudgetCheckResult { return controller.check({ scopeType: BudgetScope.User, scopeKey: tenantId, estimatedCost, modelId, tools: tools ?? [], }); } export function recordSpend( controller: BudgetController, tenantId: string, requestId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string, provider: string, ): void { controller.record({ requestId, scopeType: BudgetScope.User, scopeKey: tenantId, cost, inputTokens, outputTokens, modelId, provider, timestamp: new Date(), }); } export function onThresholdBreach(event: { threshold: number; scopeType: string; scopeKey: string }): void { console.warn(`Budget threshold breached at ${String(event.threshold * 100)}% for ${event.scopeType}:${event.scopeKey}`); } export function onHardStop(event: { spent: number; limit: number; scopeType: string; scopeKey: string }): void { console.error(`Hard stop triggered for ${event.scopeType}:${event.scopeKey} — spent ${String(event.spent)} / limit ${String(event.limit)}`); } export function onStateChange(event: { from: string; to: string; scopeType: string; scopeKey: string }): void { console.log(`Budget state change for ${event.scopeType}:${event.scopeKey}: ${event.from} -> ${event.to}`); } export function attachBudgetEvents(controller: BudgetController): void { controller.on("threshold-breach", onThresholdBreach); controller.on("hard-stop", onHardStop); controller.on("state-change", onStateChange); }

import OpenAI from "openai"; import { loadAppConfig } from "./lib/config.js"; import { createBudgetController, defineTenantBudget, attachBudgetEvents } from "./lib/budget-check.js"; import { buildFallbackChain, registerFallbackModels } from "./lib/fallback.js"; import { createCostPipeline, createPhoenixExporter } from "./lib/cost-sink.js"; import { ProxyService } from "./services/proxy-service.js"; import type { ModelDefinition } from "@reaatech/llm-router-core"; export function createProxyService(): ProxyService { const config = loadAppConfig(); const budgetCtrl = createBudgetController(); const tenantDefaults: Record<string, { daily: number; monthly: number }> = {}; const primaryTenant = "default"; tenantDefaults[primaryTenant] = { daily: config.proxy.defaultDailyBudget, monthly: config.proxy.defaultMonthlyBudget, }; const budgetsJson = process.env["TENANT_BUDGETS"]; if (budgetsJson) { try { const parsed = JSON.parse(budgetsJson) as Record<string, { daily: number; monthly: number }>; for (const [tenantId, limits] of Object.entries(parsed)) { tenantDefaults[tenantId] = limits; } } catch { // ignore invalid JSON } } for (const [tenantId, limits] of Object.entries(tenantDefaults)) { defineTenantBudget(budgetCtrl, tenantId, limits.daily, limits.monthly); } attachBudgetEvents(budgetCtrl); const allModelIds = [ config.proxy.primaryModel, ...config.proxy.fallbackModels, ].filter(Boolean); const fallbackChain = buildFallbackChain("openrouter-chain", allModelIds); const models: ModelDefinition[] = allModelIds.map((id) => ({ id, provider: "openrouter", costPerMillionInput: 0, costPerMillionOutput: 0, maxTokens: 128000, capabilities: ["general" as const], })); registerFallbackModels(fallbackChain, models); const { collector, aggregator, budgetManager } = createCostPipeline(tenantDefaults); const exporters: import("@reaatech/llm-cost-telemetry-exporters").BaseExporter[] = []; const lokiHost = process.env["LOKI_HOST"]; if (lokiHost) { const phoenixExporter = createPhoenixExporter(lokiHost); exporters.push(phoenixExporter); } const openRouterApiKey = config.proxy.openRouterApiKey; const openRouter = new OpenAI({ baseURL: "https://openrouter.ai/api/v1", apiKey: openRouterApiKey, }); return new ProxyService( openRouter, budgetCtrl, fallbackChain, collector, aggregator, budgetManager, exporters, config.proxy, ); }

import { describe, it, expect, afterEach } from "vitest"; import { PACKAGE_NAME } from "../../src/types.js"; import { getProxyConfig, parseFallbackChain, getTenantFromRequest, loadAppConfig, } from "../../src/lib/config.js"; const OLD_ENV = { ...process.env }; afterEach(() => { process.env = { ...OLD_ENV }; }); describe("getProxyConfig", () => { it("returns primaryModel from env when PRIMARY_MODEL is set", () => { process.env["PRIMARY_MODEL"] = "openai/gpt-5.2"; process.env["FALLBACK_MODEL_CHAIN"] = "openai/gpt-4,anthropic/claude-3"; process.env["DEFAULT_DAILY_BUDGET"] = "50"; process.env["DEFAULT_MONTHLY_BUDGET"] = "1000"; const config = getProxyConfig(); expect(config.primaryModel).toBe("openai/gpt-5.2"); expect(config.fallbackModels).toEqual(["openai/gpt-4", "anthropic/claude-3"]); expect(config.defaultDailyBudget).toBe(50); expect(config.defaultMonthlyBudget).toBe(1000); }); it("returns defaults when env vars are missing", () => { delete process.env["PRIMARY_MODEL"]; delete process.env["FALLBACK_MODEL_CHAIN"]; const config = getProxyConfig(); expect(config.primaryModel).toBe("openai/gpt-5.2"); expect(config.fallbackModels).toEqual([]); expect(config.defaultDailyBudget).toBe(100); expect(config.defaultMonthlyBudget).toBe(2000); }); }); describe("types", () => { it("PACKAGE_NAME is defined", () => { expect(PACKAGE_NAME).toBe("openrouter-cost-control"); }); }); describe("parseFallbackChain", () => { it('returns ["a","b","c"] for "a,b,c"', () => { expect(parseFallbackChain("a,b,c")).toEqual(["a", "b", "c"]); }); it('returns [] for ""', () => { expect(parseFallbackChain("")).toEqual([]); }); it("trims whitespace around model names", () => { expect(parseFallbackChain(" a , b , c ")).toEqual(["a", "b", "c"]); }); }); describe("getTenantFromRequest", () => { it("extracts X-Tenant-Id header", () => { const headers = new Headers({ "X-Tenant-Id": "acme" }); expect(getTenantFromRequest(headers)).toBe("acme"); }); it('returns "default" with missing header', () => { const headers = new Headers(); expect(getTenantFromRequest(headers)).toBe("default"); }); }); describe("loadAppConfig", () => { it("returns proxy and budget sections", () => { const appConfig = loadAppConfig(); expect(appConfig.proxy).toBeDefined(); expect(appConfig.budget).toBeDefined(); expect(appConfig.proxy.primaryModel).toBeDefined(); }); });

OpenRouter Cost Control for SMB API Spend Management

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Set up environment variables

Step 3: Define the shared types

Step 4: Build the configuration loader

Step 5: Create telemetry helpers

Step 6: Set up budget enforcement

Step 7: Build the fallback chain

Step 8: Create the cost aggregation pipeline

Step 9: Wire up the proxy service

Step 10: Bootstrap the proxy factory

Step 11: Wire up Next.js route handlers

Step 12: Write and run the tests

Next steps