OpenRouter Budget Guardrails for BigCommerce SMB Customer Support

A cost-control layer that enforces budget limits, routes to cheaper models when needed, and tracks spending for BigCommerce support chatbots.

openrouter cost-control bigcommerce customer-support express budget-guardrails llm-caching llm-routing

The problem

BigCommerce merchants running AI-powered customer support chatbots face unpredictable LLM costs, risking budget overruns. Without real-time spend governance, a traffic spike can lead to surprise bills.

Built from

Intro

BigCommerce merchants running AI-powered customer support chatbots face unpredictable LLM costs. Without real-time spend governance, a traffic spike from a flash sale or holiday rush can produce a surprise bill that wipes out margins. This tutorial walks you through building a self-policing cost-control layer that enforces per-tenant budget limits, dynamically routes to cheaper models when budgets tighten, tracks every penny spent, and caches frequently asked questions to avoid redundant API calls. You’ll wire up six @reaatech/* packages inside a Next.js 16 App Router project, orchestrated by a single API route that handles the full budget-check-route-record-cache lifecycle.

Prerequisites

Node.js 22+ and pnpm 10 installed on your machine
An OpenRouter API key — sign up at openrouter.ai/keys and add a credit balance
A Helicone API key — sign up at helicone.ai for cost telemetry (free tier works)
Basic familiarity with TypeScript and Next.js App Router route handlers

Step 1: Create the project and install dependencies

Start by scaffolding a Next.js project and installing the dependencies this recipe needs.

terminal

npx create-next-app@latest openrouter-budget-guardrails --typescript --app

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

199 kB·63 tests·94.3% coverage·vitest passing

SHA-256d6346c20e3b85d7710f875458e34a9e43d7a8da32bb5202328941323c30a5b58

Book a conversation All solutions

Comments

Loading comments…

import { z } from "zod"; export const ChatRequestSchema = z.object({ prompt: z.string().min(1), useCase: z.string().default("bigcommerce-support"), customerId: z.string().min(1), tenantId: z.string().min(1), }); export type ChatRequest = z.infer<typeof ChatRequestSchema>; export interface ChatResponse { reply: string; model: string; costUsd: number; cached: boolean; inputTokens: number; outputTokens: number; latencyMs: number; } export interface BudgetState { spent: number; remaining: number; state: "Active" | "Warned" | "Degraded" | "Stopped"; } export const BUDGET_STATE_MAP: Record<string, BudgetState["state"]> = { active: "Active", warned: "Warned", degraded: "Degraded", stopped: "Stopped", }; export interface BudgetControllerLike { defineBudget(definition: unknown): void | Promise<void>; check(request: unknown): BudgetCheckResultLike; record(entry: unknown): void | Promise<void>; getState(scopeType: unknown, scopeKey: string): { spent: number; remaining: number; state: string } | undefined; suggestDowngrade(currentModel: string, scopeType: unknown, scopeKey: string): string | undefined; on(event: string, callback: (...args: unknown[]) => void): void; } export interface BudgetCheckResultLike { allowed: boolean; action: string; suggestedModel?: string | null; disabledTools?: string[]; } export interface CacheEngineLike { get(prompt: string, options?: unknown): Promise<{ hit: boolean; type?: string; reason?: string; entry?: unknown; confidence?: number }>; set(prompt: string, response: unknown, options?: unknown, metadata?: unknown): Promise<unknown>; invalidate(criteria: unknown): Promise<unknown>; } export interface RouterConfigYaml { models: { workhorses: Array<{ id: string; provider: string; cost_per_million_input: number; cost_per_million_output: number; max_tokens: number; capabilities: string[] }>; judges: Array<{ id: string; provider: string; cost_per_million_input: number; cost_per_million_output: number; max_tokens: number; capabilities: string[] }>; }; strategies: Record<string, { type: string; workhorse_pool: string[]; judge_pool?: string[] }>; budgets: Record<string, { daily_limit: number; alert_thresholds: number[]; hard_limit: boolean }>; }

import { BudgetController } from "@reaatech/agent-budget-engine"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { BudgetScope, BudgetExceededError } from "@reaatech/agent-budget-types"; import type { BudgetState, BudgetControllerLike, BudgetCheckResultLike } from "./types.js"; import { BUDGET_STATE_MAP } from "./types.js"; export function createBudgetController(): BudgetController { const ctrl = new BudgetController({ spendTracker: new SpendStore() }); ctrl.on("threshold-breach", (evt) => { console.warn("threshold-breach", evt); }); ctrl.on("hard-stop", (evt) => { console.error("hard-stop", evt); }); return ctrl; } export async function defineTenantBudget( ctrl: BudgetControllerLike, tenantId: string, dailyLimit: number, ): Promise<void> { await ctrl.defineBudget({ scopeType: BudgetScope.User, scopeKey: tenantId, limit: dailyLimit, policy: { softCap: 0.8, hardCap: 1.0, autoDowngrade: [{ from: ["*"], to: "mistralai/mistral-small-latest" }], }, }); } export function withBudgetCheck( ctrl: BudgetControllerLike, tenantId: string, estimatedCostUsd: number, modelId: string, ): Promise<BudgetCheckResultLike> { const result = ctrl.check({ scopeType: BudgetScope.User, scopeKey: tenantId, estimatedCost: estimatedCostUsd, modelId, tools: [], }); return Promise.resolve({ allowed: result.allowed, action: result.action, suggestedModel: result.suggestedModel ?? null, disabledTools: result.disabledTools ?? [], }); } export async function recordSpend( ctrl: BudgetControllerLike, tenantId: string, requestId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string, provider: string, ): Promise<void> { await ctrl.record({ requestId, scopeType: BudgetScope.User, scopeKey: tenantId, cost, inputTokens, outputTokens, modelId, provider, timestamp: new Date(), }); } export function getBudgetState( ctrl: BudgetControllerLike, tenantId: string, ): BudgetState { const budgetState = ctrl.getState(BudgetScope.User, tenantId); const rawState = budgetState?.state ?? "active"; return { spent: budgetState?.spent ?? 0, remaining: budgetState?.remaining ?? 0, state: BUDGET_STATE_MAP[rawState] ?? "Active", }; } export function getSuggestedDowngrade( ctrl: BudgetControllerLike, modelId: string, tenantId: string, ): string | null { return ctrl.suggestDowngrade(modelId, BudgetScope.User, tenantId) ?? null; } export { BudgetExceededError };

import { generateId, now, calculateCostFromTokens, type CostSpan, CostSpanSchema } from "@reaatech/llm-cost-telemetry"; import { HeliconeAsyncLogger } from "@helicone/async"; export class CostTracker { private heliconeLogger: HeliconeAsyncLogger | null = null; constructor(private config: { heliconeApiKey: string }) {} private getLogger(): HeliconeAsyncLogger { if (!this.heliconeLogger) { this.heliconeLogger = new HeliconeAsyncLogger({ apiKey: this.config.heliconeApiKey, providers: {} }); } return this.heliconeLogger; } startSpan( provider: string, model: string, tenantId: string, feature: string, ): CostSpan { return { id: generateId(), provider, model, tenant: tenantId, feature, timestamp: now(), inputTokens: 0, outputTokens: 0, costUsd: 0, } as CostSpan; } completeSpan( span: CostSpan, inputTokens: number, outputTokens: number, pricePerMillionInput: number, pricePerMillionOutput: number, ): CostSpan { const inputCost = calculateCostFromTokens(inputTokens, pricePerMillionInput); const outputCost = calculateCostFromTokens(outputTokens, pricePerMillionOutput); const completed = { ...span, inputTokens, outputTokens, costUsd: inputCost + outputCost, }; return CostSpanSchema.parse(completed); } logToHelicone(span: CostSpan): Promise<void> { return Promise.resolve().then(() => { try { const validated = CostSpanSchema.parse(span); const logger = this.getLogger(); logger.init(); logger.withProperties( { provider: validated.provider, model: validated.model, tenant: validated.tenant ?? "", feature: validated.feature ?? "", costUsd: String(validated.costUsd), inputTokens: String(validated.inputTokens), outputTokens: String(validated.outputTokens), }, () => { console.debug("Helicone telemetry recorded:", validated.id); }, ); } catch (error) { console.error("Failed to log to Helicone:", error); } }); } }

import OpenAI from "openai"; import { CacheEngine, InMemoryAdapter, OpenAIEmbedder, type CacheResult } from "@reaatech/llm-cache"; import type { CacheEngineLike } from "./types.js"; export function createCacheEngine(_openaiClient: OpenAI): CacheEngine { void _openaiClient; const embedder = new OpenAIEmbedder({ provider: "openai", model: "text-embedding-3-small", dimensions: 1536, apiKey: process.env.OPENROUTER_API_KEY ?? "", }); return new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder, config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "text-embedding-3-small", dimensions: 1536, batchSize: 100, maxRetries: 3, }, similarity: { threshold: 0.85, metric: "cosine", maxResults: 3 }, ttl: { default: 3600, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "bigcommerce-support" }, cost: { enabled: true, currency: "USD" }, observability: { metrics: true, tracing: false, logging: "info" }, }, }); } export async function getCached( engine: CacheEngine | CacheEngineLike, prompt: string, useCase: string, model: string, ): Promise<CacheResult> { const result = await engine.get(prompt, { useCase, model, modelVersion: "openrouter" }); return result as CacheResult; } export async function setCache( engine: CacheEngine | CacheEngineLike, prompt: string, response: object, useCase: string, model: string, tokens: { prompt: number; completion: number }, ): Promise<void> { await engine.set( prompt, response, { model, modelVersion: "openrouter", useCase }, { queryType: "factual", tokens }, ); } export async function invalidateOlderThan( engine: CacheEngine | CacheEngineLike, olderThan: Date, ): Promise<void> { await engine.invalidate({ olderThan }); }

OpenRouter Budget Guardrails for BigCommerce SMB Customer Support

The problem

Built from

Intro

Prerequisites

Step 1: Create the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Create the project and install dependencies

Step 2: Configure environment variables

Step 3: Create the configuration loader

Step 4: Define shared types with Zod

Step 5: Implement the budget middleware

Step 6: Build the cost tracker

Step 7: Create the model router

Step 8: Implement the semantic cache

Step 9: Create the API route handler

Step 10: Set up instrumentation and Next.js config

Step 11: Create the programmatic entry point

Step 12: Set up the vitest configuration

Step 13: Run the tests

Next steps