Cohere LLM Cost Observability for SMB Support Agents

Wrap every Cohere API call with cost telemetry and OTel spans so SMBs can see exactly where their LLM budget goes and stop cost overruns.

cohere observability cost-tracking open-telemetry nextjs typescript llm-cost-telemetry budget-alerts

The problem

Small businesses running Cohere-powered support bots have no per-call cost visibility; a single verbose handling loop can silently triple the monthly bill.

Built from

Intro

This recipe wraps the Cohere TypeScript SDK (cohere-ai) with per-call cost telemetry, OpenTelemetry spans, and real-time budget tracking so small businesses running support bots can see exactly where their LLM budget goes. You’ll build an InstrumentedCohereClient that captures token counts and calculates costs on every chat() and chatStream() call, an in-memory spend dashboard with a Next.js API route, and a polling BudgetWatcher that fires Pino alerts when daily limits are breached.

Prerequisites

Node.js 22+ — runtime for the Next.js app
pnpm 10 — package manager (exact version: 10.0.0; npm or yarn won’t match the lockfile)
Cohere API key — get one at dashboard.cohere.com
Langfuse account — optional but recommended for the OTel dashboard; sign up at langfuse.com
Basic familiarity with Next.js App Router, TypeScript, and pnpm

Step 1: Scaffold the project

Create an empty directory and initialise a Next.js project. This recipe pins every dependency to an exact version so you don’t hit surprises on upgrades.

terminal

mkdir cohere-cost-observability
cd cohere-cost-observability

Create package.json with the full dependency list. The foundation is @reaatech/llm-cost-telemetry and its three companion packages — the calculator for per-model pricing, the observability layer for OTel tracing and Pino logging, and for OpenTelemetry GenAI semantic convention types.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

161 kB·66 tests·98.6% coverage·vitest passing

SHA-25620216bc5eaa2004fa1dce52dfc2e0e6b3f9f7f39ad00de8c0cae1e77463474d4

Book a conversation All solutions

Comments

Loading comments…

Cohere LLM Cost Observability for SMB Support Agents

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Create the Cohere config loader

Step 3: Define shared types

Step 4: Initialise telemetry with Langfuse

Step 5: Build the instrumented Cohere client

Step 6: Build the dashboard API route

Step 7: Create the budget watcher

Step 8: Wire up the Next.js instrumentation hook

Step 9: Export the public API surface

Step 10: Run the tests

Next steps