AWS Bedrock Cost Control for Serverless AI Applications

A budget‑aware middleware that routes, caps, and tracks every AWS Bedrock LLM call, preventing surprise bills for SMBs running serverless AI features.

typescript nextjs bedrock cost-control budget-middleware helicone otel serverless

The problem

SMBs adopting AWS Bedrock for AI features often see unpredictable costs because there's no native spend controls. A single mis‑sized prompt or a spike in traffic can burn through a month's budget in hours, with no granular insight into which model or user caused it.

Built from

Intro

In this tutorial, you’ll build a budget-aware proxy around AWS Bedrock LLM calls using Next.js and REAA’s budget engineering packages. Every Bedrock invocation passes through a BudgetInterceptor that estimates cost, checks remaining budget, optionally downgrades the model, records the spend, and pushes metrics to Helicone. You’ll also get a dashboard page that shows spend across all budget scopes, plus API routes to query state and invoke models with automatic enforcement.

Prerequisites

Node.js >= 22
pnpm 10.x (install with npm install -g pnpm)
AWS account with Bedrock access (access key ID, secret access key, and region)
Helicone account for AI observability (API key — free tier works)
Basic familiarity with TypeScript and Next.js App Router

Step 1: Initialize the project

Start by creating a fresh Next.js project with TypeScript. The App Router is required for this recipe’s route handler and instrumentation hook setup.

terminal

npx create-next-app@latest bedrock-cost-control \
  --typescript \
  --app \
  --no-tailwind \
  --no-src-dir \
  --import-alias "@/*"

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

94 kB·170 tests·98.2% coverage·vitest passing

SHA-256ffeaa9d91483274bb705566906aca939d80ffa040a85ae61c3bdbaab97bc9674

Book a conversation All solutions

Comments

Loading comments…

import { BudgetController, } from "@reaatech/agent-budget-engine"; import { BudgetDefinition, BudgetScope, type BudgetState, } from "@reaatech/agent-budget-types"; import { getSpendStore } from "./spend-tracker.js"; import { pricingEngine } from "./pricing.js"; let _controller: BudgetController | undefined; export function getBudgetController(): BudgetController { if (!_controller) { _controller = new BudgetController({ spendTracker: getSpendStore(), pricing: pricingEngine, defaultEstimateTokens: 1000, }); } return _controller; } export function resetBudgetController(): void { _controller = undefined; } export function defineBudget( scopeType: string, scopeKey: string, limit: number, policy?: { softCap: number; hardCap: number; autoDowngrade?: { from: string[]; to: string }[]; disableTools?: string[]; } ): void { const controller = getBudgetController(); const definition: BudgetDefinition = { scopeType: scopeType as BudgetScope, scopeKey, limit, policy: { softCap: policy?.softCap ?? 0.8, hardCap: policy?.hardCap ?? 1.0, autoDowngrade: policy?.autoDowngrade ?? [], disableTools: policy?.disableTools ?? [], }, }; controller.defineBudget(definition); } export function getBudgetState( scopeType: string, scopeKey: string ): { spent: number; remaining: number; limit: number; state: string; } | null { const controller = getBudgetController(); const state = controller.getState(scopeType as BudgetScope, scopeKey); if (!state) return null; return { spent: state.spent, remaining: state.remaining, limit: state.limit, state: state.state, }; } export function listAllBudgets(): Array<{ scopeType: string; scopeKey: string; limit: number; spent: number; remaining: number; state: string; }> { const controller = getBudgetController(); const all = controller.listAll(); return all.map((entry) => { const def = entry.definition; const st = entry.state ?? { spent: 0, remaining: def.limit, limit: def.limit, state: "active", }; return { scopeType: def.scopeType, scopeKey: def.scopeKey, limit: def.limit, spent: st.spent, remaining: st.remaining, state: st.state, }; }); } export { BudgetScope };

import { BudgetInterceptor } from "@reaatech/agent-budget-middleware"; import { BudgetScope, ScopeIdentifier } from "@reaatech/agent-budget-types"; import { getBudgetController } from "./budget.js"; let _interceptor: BudgetInterceptor | undefined; export function getInterceptor(): BudgetInterceptor { if (!_interceptor) { _interceptor = new BudgetInterceptor({ controller: getBudgetController(), }); } return _interceptor; } export interface BeforeStepParams { scopeType: string; scopeKey: string; modelId: string; estimatedCost: number; tools?: string[]; requestId?: string; } export interface AfterStepParams { scopeType: string; scopeKey: string; modelId: string; actualCost: number; inputTokens: number; outputTokens: number; requestId?: string; } export interface InterceptorContext { allowed: boolean; modelId: string; tools: string[]; reason: string; } export async function beforeStep( params: BeforeStepParams ): Promise<InterceptorContext> { const interceptor = getInterceptor(); const scope: ScopeIdentifier = { scopeType: params.scopeType as BudgetScope, scopeKey: params.scopeKey, }; const ctx = interceptor.beforeStep({ scope, modelId: params.modelId, tools: params.tools ?? [], estimatedCost: params.estimatedCost, }); return { allowed: ctx.allowed, modelId: ctx.modelId, tools: ctx.tools, reason: ctx.reason ?? "", }; } export async function afterStep(params: AfterStepParams): Promise<void> { const interceptor = getInterceptor(); interceptor.afterStep({ scope: { scopeType: params.scopeType as BudgetScope, scopeKey: params.scopeKey, }, modelId: params.modelId, tools: [], allowed: true, originalModelId: params.modelId, originalTools: [], actualCost: params.actualCost, inputTokens: params.inputTokens, outputTokens: params.outputTokens, requestId: params.requestId ?? "", provider: "bedrock", }); }

import { BedrockRuntimeClient, ConverseCommand, type ConverseCommandInput, type TokenUsage, } from "@aws-sdk/client-bedrock-runtime"; import { env } from "../lib/env.js"; export class BedrockServiceError extends Error { readonly code: string; readonly statusCode: number; constructor(code: string, message: string, statusCode: number) { super(message); this.name = "BedrockServiceError"; this.code = code; this.statusCode = statusCode; } } // Lazy singleton let _client: BedrockRuntimeClient | undefined; export function getBedrockClient(): BedrockRuntimeClient { if (!_client) { _client = new BedrockRuntimeClient({ region: env.awsRegion, credentials: { accessKeyId: env.awsAccessKeyId, secretAccessKey: env.awsSecretAccessKey, }, }); } return _client; } export async function invokeModel(params: { modelId: string; prompt: string; maxTokens: number; tools?: unknown[]; }): Promise<{ body: string; inputTokens: number; outputTokens: number; }> { const client = getBedrockClient(); // Build Converse API input const input: ConverseCommandInput = { modelId: params.modelId, messages: [ { role: "user", content: [{ text: params.prompt }], }, ], inferenceConfig: { maxTokens: params.maxTokens, }, }; try { const response = await client.send(new ConverseCommand(input)); // Extract text from response const contentBlocks = response.output?.message?.content ?? []; const textParts: string[] = []; for (const block of contentBlocks) { if (block.text) { textParts.push(block.text); } } const body = textParts.join(""); // Extract token counts from usage field const usage: TokenUsage | undefined = response.usage; const inputTokens = usage?.inputTokens ?? 0; const outputTokens = usage?.outputTokens ?? 0; return { body, inputTokens, outputTokens, }; } catch (err) { const error = err as { name?: string; message?: string; $metadata?: { statusCode?: number } }; const statusCode = error.$metadata?.statusCode ?? 500; throw new BedrockServiceError( error.name ?? "UnknownError", error.message ?? "Bedrock invocation failed", statusCode ); } } export { BedrockRuntimeClient, ConverseCommand }; export type { ConverseCommandInput } from "@aws-sdk/client-bedrock-runtime";

import { listAllBudgets } from "../src/services/budget.js"; interface BudgetCard { scopeKey: string; scopeType: string; limit: number; spent: number; remaining: number; state: string; } function StateBadge({ state }: { state: string }) { const colors: Record<string, string> = { active: "#22c55e", warned: "#eab308", degraded: "#f97316", stopped: "#ef4444", }; const color = colors[state] ?? "#6b7280"; return ( <span style={{ backgroundColor: color, color: "white", padding: "2px 8px", borderRadius: "4px", fontSize: "0.75rem", }} > {state} </span> ); } function BudgetCardComponent({ budget }: { budget: BudgetCard }) { const pct = budget.limit > 0 ? (budget.spent / budget.limit) * 100 : 0; return ( <div style={{ border: "1px solid #e5e7eb", borderRadius: "8px", padding: "1rem", marginBottom: "0.75rem", }} > <div style={{ display: "flex", justifyContent: "space-between", marginBottom: "0.5rem" }}> <strong>{budget.scopeKey}</strong> <StateBadge state={budget.state} /> </div> <div style={{ marginBottom: "0.25rem" }}> <small> ${budget.spent.toFixed(4)} / ${budget.limit.toFixed(2)} ({pct.toFixed(1)}%) </small> </div> <div style={{ width: "100%", height: "8px", backgroundColor: "#e5e7eb", borderRadius: "4px", overflow: "hidden", }} > <div style={{ width: `${Math.min(pct, 100)}%`, height: "100%", backgroundColor: pct > 80 ? "#ef4444" : pct > 50 ? "#eab308" : "#22c55e", }} /> </div> <div style={{ marginTop: "0.25rem" }}> <small>Remaining: ${budget.remaining.toFixed(4)}</small> </div> </div> ); } export default async function DashboardPage() { let budgets: BudgetCard[] = []; try { budgets = listAllBudgets() as BudgetCard[]; } catch { budgets = []; } return ( <div style={{ maxWidth: "800px", margin: "0 auto", padding: "2rem" }}> <h1>AWS Bedrock Cost Control Dashboard</h1> <p>Monitor and control LLM spending across your serverless AI applications.</p> {budgets.length === 0 ? ( <p>No budgets defined yet.</p> ) : ( <> <h2>Budget Overview</h2> {budgets.map((b, i) => ( <BudgetCardComponent key={`${b.scopeKey}-${i}`} budget={b} /> ))} </> )} </div> ); }

AWS Bedrock Cost Control for Serverless AI Applications

The problem

Built from

Intro

Prerequisites

Step 1: Initialize the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Initialize the project

Step 2: Install dependencies

Step 3: Set up environment variables

Step 4: Create the environment schema

Step 5: Create the types file

Step 6: Create the pricing service

Step 7: Create the spend-tracker service

Step 8: Create the budget service

Step 9: Create the interceptor service

Step 10: Create the router service

Step 11: Create the Bedrock service

Step 12: Create the Helicone service

Step 13: Create the OTEL service

Step 14: Create the index barrel

Step 15: Configure the instrumentation hook

Step 16: Create the health route

Step 17: Create the invoke route

Step 18: Create the budget routes

Step 19: Create the dashboard page

Step 20: Run the tests

Step 21: Start the dev server

Next steps