vLLM Agent Mesh for E-commerce Order Management

A multi-agent system that handles order inquiries, shipment tracking, and returns for SMB e‑commerce stores, powered by a vLLM‑hosted model and orchestrated with REAA agent-mesh.

vllm agent-mesh e-commerce order-management nextjs multi-agent reaatech

The problem

Small online retailers manually handle repetitive customer queries about order status, shipping updates, and return policies. Delegating these tasks to a single LLM agent leads to context‑lost handoffs and inconsistent responses.

Built from

Intro

This tutorial walks you through building a multi-agent e-commerce order management system powered by a vLLM-hosted LLM and orchestrated with the REAA agent-mesh framework. You’ll create a Next.js App Router application where specialist agents handle order inquiries, shipment tracking, and returns — with intent classification via @reaatech/agent-mesh-router, multi-turn session continuity backed by Upstash Redis, structured output repair for vLLM responses, and per-session cost telemetry.

Prerequisites

Node.js 22+ and pnpm 10+
A vLLM endpoint running an OpenAI-compatible model (e.g., Qwen 2.5 7B Instruct)
An Upstash Redis instance (for session storage)
Familiarity with TypeScript, Next.js App Router, and basic LLM concepts

Step 1: Scaffold the Next.js project and install dependencies

Create a new Next.js project with TypeScript and the App Router, then install the REAA orchestration packages:

terminal

pnpm create next-app@latest vllm-agent-mesh \
  --typescript --eslint --app --src-dir \
  --import-alias "@/*" --use-pnpm --no-turbopack

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

203 kB·165 tests·99.4% coverage·vitest passing

SHA-256d74f08411c5ca71d0ba266f2692732afe6b597a74914fb0914b5f3cda2223485

Book a conversation All solutions

Comments

Loading comments…

import { generateId, now, loadConfig, calculateCostFromTokens, getWindowStart, getWindowEnd, CostSpanSchema, type CostSpan, type BudgetStatus } from "@reaatech/llm-cost-telemetry"; class CostTelemetryService { private spans: CostSpan[] = []; recordCall( provider: string, model: string, inputTokens: number, outputTokens: number, costUsd?: number, ): CostSpan { const timestamp = now(); const totalCost = costUsd ?? calculateCostFromTokens(inputTokens, outputTokens); const span = CostSpanSchema.parse({ id: generateId(), provider, model, inputTokens, outputTokens, costUsd: totalCost, timestamp, }); this.spans.push(span); return span; } private spansInWindow(window: "day" | "month"): CostSpan[] { const windowStart = getWindowStart(now(), window); const windowEnd = getWindowEnd(now(), window); return this.spans.filter((s) => { if (!s.timestamp) { return false; } return s.timestamp >= windowStart && s.timestamp <= windowEnd; }); } getDailyCost(): number { return this.spansInWindow("day").reduce((sum, s) => sum + s.costUsd, 0); } getMonthlyCost(): number { return this.spansInWindow("month").reduce((sum, s) => sum + s.costUsd, 0); } checkBudget(): BudgetStatus { const config = loadConfig(); const dailyLimit = config.budget.global?.daily ?? 10; const monthlyLimit = config.budget.global?.monthly ?? 300; const dailyCost = this.getDailyCost(); const monthlyCost = this.getMonthlyCost(); const dailyPercentage = dailyLimit > 0 ? (dailyCost / dailyLimit) * 100 : 0; const monthlyPercentage = monthlyLimit > 0 ? (monthlyCost / monthlyLimit) * 100 : 0; return { tenant: "default", dailySpent: dailyCost, dailyBudget: dailyLimit, dailyLimit, dailyPercentage, dailyRemaining: Math.max(0, dailyLimit - dailyCost), dailyExceeded: dailyCost > dailyLimit, withinBudget: dailyCost <= dailyLimit && monthlyCost <= monthlyLimit, monthlySpent: monthlyCost, monthlyBudget: monthlyLimit, monthlyLimit, monthlyPercentage, monthlyRemaining: Math.max(0, monthlyLimit - monthlyCost), monthlyExceeded: monthlyCost > monthlyLimit, activeAlerts: [], }; } getSpans(): readonly CostSpan[] { return this.spans; } } const costTelemetry = new CostTelemetryService(); export { CostTelemetryService, costTelemetry };

import { withRetry, TypedEventEmitter } from "@reaatech/agent-handoff"; import { SessionNotFoundError, ConcurrencyError } from "@reaatech/session-continuity"; import { sessionManager } from "../lib/state.js"; interface HandoffLifecycleEvents { "handoff:started": { sessionId: string; fromAgentId: string; toAgentId: string }; "handoff:completed": { sessionId: string; toAgentId: string }; "handoff:failed": { sessionId: string; fromAgentId: string; toAgentId: string; error: Error }; } class HandoffManager { private emitter: TypedEventEmitter<HandoffLifecycleEvents>; constructor() { this.emitter = new TypedEventEmitter<HandoffLifecycleEvents>(); } get events(): TypedEventEmitter<HandoffLifecycleEvents> { return this.emitter; } async executeHandoff( sessionId: string, fromAgentId: string, toAgentId: string, ): Promise<void> { this.emitter.emit("handoff:started", { sessionId, fromAgentId, toAgentId }); try { await withRetry( async () => { await sessionManager.handoffToAgent(sessionId, toAgentId); }, { maxRetries: 3, backoff: "exponential", baseDelayMs: 100, maxDelayMs: 5000, shouldRetry: (err: unknown) => err instanceof Error, }, ); this.emitter.emit("handoff:completed", { sessionId, toAgentId }); } catch (cause) { const error = cause instanceof Error ? cause : new Error(String(cause)); if (cause instanceof SessionNotFoundError) { this.emitter.emit("handoff:failed", { sessionId, fromAgentId, toAgentId, error }); throw error; } if (cause instanceof ConcurrencyError) { try { await withRetry( async () => { await sessionManager.handoffToAgent(sessionId, toAgentId); }, { maxRetries: 1, backoff: "linear", baseDelayMs: 50, maxDelayMs: 1000, shouldRetry: (err: unknown) => err instanceof Error, }, ); this.emitter.emit("handoff:completed", { sessionId, toAgentId }); return; } catch { this.emitter.emit("handoff:failed", { sessionId, fromAgentId, toAgentId, error }); throw error; } } this.emitter.emit("handoff:failed", { sessionId, fromAgentId, toAgentId, error }); throw error; } } async compressForHandoff(sessionId: string): Promise<void> { await sessionManager.compressContext(sessionId); } } const handoffManager = new HandoffManager(); export { HandoffManager, handoffManager }; export type { HandoffLifecycleEvents };

vLLM Agent Mesh for E-commerce Order Management

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Step 2: Configure environment variables

Step 3: Define the domain types with Zod

Step 4: Create the vLLM client

Step 5: Build session state management with Upstash Redis

Step 6: Implement cost telemetry

Step 7: Create the agent registry

Step 8: Build the handoff manager

Step 9: Build the core agent router

Step 10: Wire the Chat API route

Step 11: Add the admin endpoint

Step 12: Create the Zustand store and chat component

Step 13: Wire up the home page and layout

Step 14: Create the barrel export file

Step 15: Configure Vitest and the test setup

Step 16: Run the tests

Next steps