vLLM Multi-Agent Handoff for E-commerce Support Routing

Route customer queries across product, order, and returns agents hosted on vLLM, with compressed context handoff so no conversation gets lost.

vllm multi-agent e-commerce support-routing langgraph agent-handoff express typescript

The problem

E-commerce support teams hosting cost-effective vLLM models find it hard to coordinate multiple specialist agents; misrouted questions cause customer frustration and agent loops.

Built from

Intro

This tutorial walks you through building a multi-agent e-commerce support routing system using vLLM for model serving, LangGraph for state machine orchestration, and the REAA agent-handoff protocol for compression and routing. You will create three specialist agents — Product, Order, and Returns — that share a single vLLM endpoint. Incoming customer messages go through a LangGraph workflow that routes to the best-fit agent, compresses context when it exceeds token limits, and persists conversation state in Upstash Redis with automatic retry on transient failures.

Prerequisites

Node.js 22+ and pnpm 10+ installed
An Upstash Redis account (free tier works) — you will need the REST URL and token
A running vLLM server with an OpenAI-compatible endpoint (or access to one) hosting at least one chat model
Basic familiarity with Next.js App Router, TypeScript, and LangGraph concepts

Step 1: Scaffold the project and install dependencies

Start from an empty directory and create the package.json with all dependencies exact-pinned. The project uses Next.js 16 with the App Router, Zod for configuration validation, LangGraph for orchestration, the AI SDK for vLLM communication, Upstash Redis for session persistence, p-retry for resilience, and four REAA agent-handoff packages for routing, compression, and protocol orchestration.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

165 kB·56 tests·93.6% coverage·vitest passing

SHA-256c43d413bf6f2bc8c2c509386bab8ad5ff727124483814d8c10dc953445be9820

Book a conversation All solutions

Comments

Loading comments…

vLLM Multi-Agent Handoff for E-commerce Support Routing

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Create the configuration schema and shared types

Step 3: Set up environment variables

Step 4: Register the three e-commerce specialist agents

Step 5: Create the vLLM client adapter

Step 6: Build the compression service

Step 7: Implement the Redis-backed session store

Step 8: Wire the confidence-based handoff router

Step 9: Build the HandoffManager orchestrator

Step 10: Create the LangGraph state machine

Step 11: Write the API route handler

Step 12: Set up the test infrastructure and run the suite

Next steps