Vercel AI Gateway Observability for SMB AI Agent Operations

Unified OpenTelemetry tracing, cost tracking, and performance alerts for every LLM call routed through Vercel AI Gateway.

vercel-ai-gateway observability opentelemetry llm-cost-tracking nextjs hono langfuse supabase smb-ai-agents

The problem

Small businesses deploying AI agents on multiple models through Vercel AI Gateway lack visibility into token consumption, latency, and failure rates across providers. Without centralized monitoring, they cannot pinpoint cost spikes, detect degradation, or enforce budgets, leading to runaway bills and unreliable customer experiences.

Built from

Intro

Small businesses running AI agents on multiple models through Vercel AI Gateway quickly lose visibility into token consumption, latency, and failure rates across providers. Without centralized monitoring, cost spikes go unnoticed, performance degrades silently, and budgets get blown. This tutorial builds an observability layer that auto-instruments every LLM call with OpenTelemetry GenAI semantics, tracks per-agent spend, enforces budgets, and sends Slack alerts — without writing per-provider instrumentation code.

You’ll use the REAA telemetry ecosystem (@reaatech/otel-genai-semconv-core, @reaatech/llm-cost-telemetry, and friends) to instrument your gateway, aggregate costs in Supabase, export traces to Langfuse, and surface everything through a Next.js admin dashboard with a Hono API backend.

Prerequisites

Node.js 22+ — the project uses "node": ">=22" and ESM ("type": "module")
pnpm 10+ — the package manager is pinned in package.json as "packageManager": "pnpm@10.0.0"
Vercel AI Gateway account and API key — the dashboard fetches real-time metrics from the gateway
Langfuse account — for trace export (get your public and secret keys from the project settings)
Supabase project — for persisting cost spans, budget configs, and alerts
Slack webhook URL — optional, for budget alert notifications
TypeScript and basic Next.js App Router familiarity

Step 1: Scaffold the Next.js project and install dependencies

Create a new Next.js 16 project with the App Router and TypeScript:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

182 kB·93 tests·97.3% coverage·vitest passing

SHA-25676462fb763ce060b61bf0d127386aedbc8da609ed300ae9fb6db67826c3260ce

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ — the project uses "node": ">=22" and ESM ("type": "module")
pnpm 10+ — the package manager is pinned in package.json as "packageManager": "pnpm@10.0.0"
Vercel AI Gateway account and API key — the dashboard fetches real-time metrics from the gateway
Langfuse account — for trace export (get your public and secret keys from the project settings)
Supabase project — for persisting cost spans, budget configs, and alerts
Slack webhook URL — optional, for budget alert notifications
TypeScript and basic Next.js App Router familiarity

Step 1: Scaffold the Next.js project and install dependencies

Create a new Next.js 16 project with the App Router and TypeScript:

import { generateId, now, calculateCostFromTokens, CostSpanSchema, type CostSpan, type Provider } from "@reaatech/llm-cost-telemetry"; import { calculateCost, estimateCost } from "@reaatech/llm-cost-telemetry-calculator"; import type { CostCollector } from "@reaatech/llm-cost-telemetry-aggregation"; import { insertCostSpan } from "./supabase.js"; export class CostTracker { private collector: CostCollector; private supabaseClient: { insertCostSpan: typeof insertCostSpan }; constructor(collector: CostCollector, supabaseClient: { insertCostSpan: typeof insertCostSpan }) { this.collector = collector; this.supabaseClient = supabaseClient; } async trackCall(params: { provider: string; model: string; inputTokens: number; outputTokens: number; tenant: string; feature?: string; cacheReadTokens?: number; cacheCreationTokens?: number; }) { const result = calculateCost({ provider: params.provider as Provider, model: params.model, inputTokens: params.inputTokens, outputTokens: params.outputTokens, cacheReadTokens: params.cacheReadTokens, cacheCreationTokens: params.cacheCreationTokens, }); calculateCostFromTokens(params.inputTokens, 0); const span: CostSpan = { id: generateId(), provider: params.provider as Provider, model: params.model, inputTokens: params.inputTokens, outputTokens: params.outputTokens, costUsd: result.costUsd, tenant: params.tenant, feature: params.feature ?? "default", timestamp: now(), }; CostSpanSchema.parse(span); this.collector.add(span); await this.supabaseClient.insertCostSpan(span); return span; } async estimateCallCost(params: { provider: string; model: string; estimatedInputTokens: number; estimatedOutputTokens: number; }) { const result = await estimateCost({ provider: params.provider as Provider, model: params.model, inputTokens: params.estimatedInputTokens, outputTokens: params.estimatedOutputTokens, }); return { costUsd: result.usd, confidence: result.confidence }; } } export function createCostTracker( collector: CostCollector, supabaseClient: { insertCostSpan: typeof insertCostSpan }, ) { return new CostTracker(collector, supabaseClient); }

import { CostCollector, CostAggregator, BudgetManager } from "@reaatech/llm-cost-telemetry-aggregation"; import type { AppConfig } from "../config.js"; import { getBudgetConfigs } from "../lib/supabase.js"; export class CostAggregationService { collector: CostCollector; aggregator: CostAggregator; budget: BudgetManager; constructor(config: AppConfig) { this.aggregator = new CostAggregator({ dimensions: ["tenant", "feature", "provider", "model"], timeWindows: ["hour", "day", "month"], }); this.budget = new BudgetManager({ global: { daily: config.defaultDailyBudget, monthly: config.defaultMonthlyBudget }, tenants: {}, alerts: [ { threshold: 0.5, action: "log" }, { threshold: 0.75, action: "notify" }, { threshold: 0.9, action: "block" }, ], }); this.collector = new CostCollector({ maxBufferSize: 1000, flushIntervalMs: 60000, onFlush: (spans) => { for (const s of spans) { this.aggregator.add(s); void this.budget.record({ tenant: s.tenant ?? "default", cost: s.costUsd }); } }, }); } async init() { try { const configs = await getBudgetConfigs(); for (const c of configs) { const row = c as { tenant: string; daily?: number; monthly?: number }; this.budget.setLimits(row.tenant, { daily: row.daily, monthly: row.monthly }); } } catch { // budget configs not available — use defaults } } async checkBudget(tenant: string, estimatedCost: number) { return await this.budget.check({ tenant, estimatedCost }); } getTenantCosts(tenant: string, period?: string) { const window = period as "hour" | "day" | "month" | undefined; return this.aggregator.getByTenant(tenant, window); } getSummary(options?: { period?: string; groupBy?: string[] }) { return this.aggregator.getSummary({ period: (options?.period ?? "day") as "hour" | "day" | "month", groupBy: options?.groupBy as ("tenant" | "feature" | "provider" | "model")[] | undefined, }); } async flush() { await this.collector.flush(); } close() { void this.collector.close(); } } export async function createCostAggregationService(config: AppConfig) { const service = new CostAggregationService(config); await service.init(); return service; }

import { TelemetryContextSchema, type TelemetryContext, type Provider } from "@reaatech/llm-cost-telemetry"; import { calculateCost } from "@reaatech/llm-cost-telemetry-calculator"; import type { LLMRequest, LLMResponse, CostData } from "@reaatech/otel-genai-semconv-core"; import type { SpanEnricher } from "../lib/span-enricher.js"; import type { CostTracker } from "../lib/cost-tracker.js"; import type { CostAggregationService } from "./cost-aggregation-service.js"; import type { CostLogger, MetricsManager } from "@reaatech/llm-cost-telemetry-observability"; export class TelemetryService { enricher: SpanEnricher; costTracker: CostTracker; aggregation: CostAggregationService; logger: CostLogger; metrics: MetricsManager; constructor(deps: { enricher: SpanEnricher; costTracker: CostTracker; aggregation: CostAggregationService; logger: CostLogger; metrics: MetricsManager; }) { this.enricher = deps.enricher; this.costTracker = deps.costTracker; this.aggregation = deps.aggregation; this.logger = deps.logger; this.metrics = deps.metrics; } async recordLLMCall(request: LLMRequest, response: LLMResponse, context: TelemetryContext) { TelemetryContextSchema.parse(context); const span = this.enricher.buildRequestSpan(request); this.enricher.enrichWithResponse(span, response); const costResult = calculateCost({ provider: request.model.split("-")[0] as Provider, model: request.model, inputTokens: response.usage.inputTokens, outputTokens: response.usage.outputTokens, }); const costData: CostData = { total: costResult.costUsd, input: costResult.breakdown.inputCostUsd, output: costResult.breakdown.outputCostUsd, currency: "USD", }; this.enricher.addCostData(span, costData); const costSpan = await this.costTracker.trackCall({ provider: request.model.split("-")[0], model: request.model, inputTokens: response.usage.inputTokens, outputTokens: response.usage.outputTokens, tenant: context.tenant ?? "default", feature: context.feature, }); this.metrics.recordCostSpan(costSpan); this.enricher.finalizeOk(span); this.logger.logCostSpan(costSpan); return costSpan; } recordError(span: unknown, error: Error, context: TelemetryContext) { this.enricher.recordError(span, error); this.metrics.recordError("unknown", "unknown", error.message, context.tenant); this.logger.logError(error, { tenant: context.tenant }); this.enricher.finalizeOk(span); } } export function createTelemetryService(deps: { enricher: SpanEnricher; costTracker: CostTracker; aggregation: CostAggregationService; logger: CostLogger; metrics: MetricsManager; }) { return new TelemetryService(deps); }

Vercel AI Gateway Observability for SMB AI Agent Operations

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Step 2: Configure environment variables

Step 3: Create typed configuration with Zod

Step 4: Create the Supabase client wrapper

Step 5: Build the span enricher

Step 6: Create the cost tracker

Step 7: Wire up the cost aggregation service

Step 8: Set up OpenTelemetry instrumentation

Step 9: Build the telemetry service — the orchestration hub

Step 10: Create the Hono API

Step 11: Wire up the Next.js route handler and instrumentation hook

Step 12: Create the budget alert scheduler

Step 13: Create the dashboard page

Step 14: Run the tests

Next steps