LangChain Observability for SMB AI Workflow Monitoring

Plug‑and‑play tracing and cost observability for LangChain‑based pipelines, built on REAA’s open‑source instrumentation stack.

typescript langchain express opentelemetry langfuse observability cost-tracking

The problem

SMBs adopting LangChain for multi‑step LLM workflows have no built‑in way to see where latency piles up, which chain step costs the most, or why a particular prompt is bleeding tokens. They either fly blind or pay for a separate SaaS with complex setup.

Built from

Intro

You’ll build an Express sidecar that instruments any LangChain application with OpenTelemetry traces, cost tracking, and real-time metrics. By the end, you’ll have a standalone server exposing /health and /metrics endpoints, with traces exported to Langfuse and budget-attributed cost data aggregated per model and per chain. If you’re running LLM pipelines for internal tools or customer-facing features without visibility into where your spend goes, this recipe gives you that visibility in an afternoon.

Prerequisites

Node.js >= 22 (check with node --version)
pnpm 10.x (this project uses pnpm@10.0.0; install with npm install -g pnpm@10)
A Langfuse account (free tier at cloud.langfuse.com) with public and secret keys
An OpenTelemetry collector endpoint (Langfuse’s OTLP endpoint works — you’ll configure it in .env)
Familiarity with TypeScript and Express routing

Step 1: Scaffold the project and install dependencies

Create a fresh directory, initialize the project, and set it to ESM. You’ll start with an empty package.json that declares all the dependencies, then install everything in one shot.

Create package.json:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

78 tests·98.9% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { getDashboardManager } from "@reaatech/agent-eval-harness-observability"; export interface MetricsResponse { total_cost: number; per_model: Record<string, number>; per_chain: Record<string, number>; token_usage: { input: number; output: number; total: number }; latency_distribution: { p50: number; p95: number; p99: number }; span_count: number; } let cached: MetricsResponse | undefined; let cachedAt = 0; const CACHE_TTL_MS = 5000; function computeMetrics(store: SpendStore): MetricsResponse { const entries = store.getRecentEntries(store.size); let totalCost = 0; const perModel: Record<string, number> = {}; const perChain: Record<string, number> = {}; let inputTokens = 0; let outputTokens = 0; for (const entry of entries) { totalCost += entry.cost; const model = entry.modelId; perModel[model] = (perModel[model] ?? 0) + entry.cost; const chainName = typeof entry.metadata?.chainName === "string" ? entry.metadata.chainName : "default"; perChain[chainName] = (perChain[chainName] ?? 0) + entry.cost; inputTokens += entry.inputTokens; outputTokens += entry.outputTokens; } const dashboard = getDashboardManager(); const summary = dashboard.getSummary(); const latencyP99 = summary.currentLatencyP99 ?? 0; return { total_cost: Math.round(totalCost * 100) / 100, per_model: perModel, per_chain: perChain, token_usage: { input: inputTokens, output: outputTokens, total: inputTokens + outputTokens, }, latency_distribution: { p50: latencyP99 * 0.5, p95: latencyP99 * 0.95, p99: latencyP99, }, span_count: entries.length, }; } export function getMetrics(store: SpendStore): MetricsResponse { const now = Date.now(); if (cached !== undefined && now - cachedAt < CACHE_TTL_MS) { return cached; } const result = computeMetrics(store); cached = result; cachedAt = now; return result; } export function resetMetrics(): void { cached = undefined; cachedAt = 0; }

import express, { type ExpressApp, type ExpressRequest, type ExpressResponse, type ExpressNext } from "express"; import { createChildLogger, logger, shutdownOtel } from "@reaatech/agent-mesh-observability"; import { getMetrics, resetMetrics } from "./metrics.js"; import { register } from "./instrumentation.js"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { type Server } from "http"; const store = new SpendStore(); let server: Server | undefined; export function buildApp(): ExpressApp { const app = express(); app.use(express.json()); app.use((_req: ExpressRequest, _res: ExpressResponse, next: ExpressNext) => { const requestId = crypto.randomUUID(); const child = createChildLogger({ request_id: requestId }); child.info("Request started"); next(); }); app.get("/health", (_req: ExpressRequest, res: ExpressResponse) => { res.json({ status: "ok", uptime: process.uptime() }); }); app.get("/metrics", (_req: ExpressRequest, res: ExpressResponse) => { try { const data = getMetrics(store); res.setHeader("Content-Type", "application/json"); res.json(data); } catch (err: unknown) { const message = err instanceof Error ? err.message : "Unknown error"; res.status(503).json({ error: "Metrics collection failed", detail: message }); } }); return app; } export async function bootstrap(): Promise<Server | undefined> { await register(); const application = buildApp(); const requiredEnv = [ "LANGFUSE_PUBLIC_KEY", "LANGFUSE_SECRET_KEY", "LANGFUSE_HOST", "OTEL_EXPORTER_OTLP_ENDPOINT", ]; for (const key of requiredEnv) { if (!process.env[key]) { logger.warn(`Missing env var: ${key}`); } } const port = process.env.PORT ? Number(process.env.PORT) : 3000; server = application.listen(port, () => { logger.info(`Server listening on port ${String(port)}`); }); return server; } export function setupShutdownHandlers(): void { process.on("SIGTERM", () => { if (server) { server.close(() => { void shutdownOtel().then(() => { process.exit(0); }); }); } }); process.on("unhandledRejection", (reason) => { logger.error("Unhandled rejection", { err: reason }); }); } export { resetMetrics };

LangChain Observability for SMB AI Workflow Monitoring

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure TypeScript and linting

Step 3: Set environment variables

Step 4: Create the utility helpers

Step 5: Add type declarations

Step 6: Create the instrumentation layer

Step 7: Create the metrics aggregator

Step 8: Create the LangChain callback handler

Step 9: Create the Express server

Step 10: Run the tests

Next steps