OpenAI Lead Intake with Adaptive Routing

Intro

This recipe builds an Express server that receives inbound leads, parses attached PDF and DOCX files, classifies intent through a keyword matcher and OpenAI LLM, routes each lead by confidence score, hands off high-confidence leads to a webhook, and syncs contacts to HubSpot. You will wire together six services and end with a running server you can POST lead data against.

Prerequisites

Node.js 22 or later
pnpm installed (npm install -g pnpm)
An OpenAI API key with access to gpt-5.2-mini
A Langfuse project (self-hosted or cloud.langfuse.com)
A HubSpot private app access token
A webhook URL to receive routed leads

Step 1: Install dependencies

Start from the project root. Copy the .env.example file, fill in your keys, then install all packages:

terminal

cp .env.example .env
pnpm install

Expected output: pnpm prints resolution tables and concludes with Done.

The package.json pins all dependencies exactly, including the four REAA packages:

json

{
  "dependencies": {
    "@reaatech/confidence-router": "0.1.0",
    "@reaatech/confidence-router-classifiers": "0.1.0",
    "@reaatech/agent-handoff": "0.1.0",
    "@reaatech/agent-budget-engine": "0.1.0",
    "openai": "6.38.0",
    "express": "5.2.1",
    "langfuse": "3.38.20",
    "unpdf": "1.6.2",
    "mammoth": "1.12.0",
    "@hubspot/api-client": "13.5.0",
    "zod":

Step 2: Configure environment variables

Open .env and fill in the values:

env

NODE_ENV=development
OPENAI_API_KEY=<your-openai-key>
LANGFUSE_PUBLIC_KEY=<your-langfuse-public-key>
LANGFUSE_SECRET_KEY=<your-langfuse-secret-key>
LANGFUSE_BASE_URL=https://cloud.langfuse.com
HUBSPOT_ACCESS_TOKEN=<your-hubspot-private-app-token>
LEAD_HANDOFF_WEBHOOK_URL=<your-sales-rep-webhook-url>
PORT=3000
ALLOWED_ORIGINS=http://localhost:3000

The Express server reads process.env at startup. Every service that calls an external API reads its own key from this file — nothing is hardcoded.

Step 3: Define shared types

Create src/lib/types.ts. Every service in this recipe shares these interfaces, so TypeScript can verify the shape of data as it flows through the pipeline:

import { z } from "zod";
 
export interface FileAttachment {
  filename: string;
  mimeType: string;
  buffer:

The LeadRequest Zod schema validates incoming API payloads: text is required and capped at 10,000 characters, while email, firstName, lastName, company, and metadata are all optional.

Step 4: Build the document parser

Create src/lib/parser.ts. This module extracts text from PDF and DOCX attachments so their content feeds into the classifier alongside the form text:

import { extractText, getDocumentProxy } from "unpdf";
import mammoth from "mammoth";
import type { FileAttachment, ParsedDocument } from "./types.js";
 
export

parseFile dispatches on MIME type: PDF uses unpdf’s getDocumentProxy + extractText, DOCX uses mammoth’s extractRawText. Any other MIME type throws a ParserError with code unsupported_mime_type. parseFiles wraps each parse in Promise.allSettled so one bad attachment does not crash the whole batch — only the successful parses are returned.

Step 5: Create the in-memory spend store

Create src/services/spend-store.ts. The BudgetController from @reaatech/agent-budget-engine expects a SpendStore instance. This in-memory implementation satisfies that interface without external infrastructure:

import { SpendStore } from "@reaatech/agent-budget-spend-tracker";
 
export function createInMemorySpendStore(): SpendStore {
  return new SpendStore();
}

The SpendStore class from @reaatech/agent-budget-spend-tracker already implements the full record, getSpend, getAllScopes, and reset API surface. The factory just returns a fresh instance shared across all requests.

Step 6: Set up the confidence classifier

Create src/services/classifier.ts. This wires the two classifiers from @reaatech/confidence-router-classifiers into a registry: a keyword classifier runs first (fast, no API cost), then falls back to the OpenAI LLM classifier for ambiguous inputs:

import { ClassifierRegistry, KeywordClassifier, LLMClassifier } from "@reaatech/confidence-router-classifiers";
 
interface ClassificationResult {
  predictions: Array<{ label: string; confidence: number }>;
  metadata?: Record<

getFallbackChain tries each enabled classifier in priority order until one succeeds. If all fail, the safe fallback returns "other" at confidence 1.0 so the pipeline always has a result.

Step 7: Configure the OpenAI pricing provider and budget controller

Create src/services/pricing.ts first, then src/services/budget.ts.

src/services/pricing.ts implements the PricingProvider interface expected by BudgetController:

export class OpenAIPricingProvider {
  estimateCost(modelId: string, estimatedInputTokens: number, _provider?: string): number {
    void _provider;
    const inputTokens = Math.max(0, estimatedInputTokens);
 
    const rate = this.getRate(modelId);

src/services/budget.ts wraps the BudgetController with per-user budget operations:

registerBudgetEvents wires hard-stop and threshold-breach controller events into Langfuse so every budget event is traceable.

Step 8: Set up lead routing

Create src/services/router.ts. This wraps the ConfidenceRouter from @reaatech/confidence-router and maps its internal decision types to the RoutingOutcome shape used throughout the pipeline:

import { ConfidenceRouter } from "@reaatech/confidence-router";
import type { RoutingOutcome } from "../lib/types.js";
 
interface ClassificationResult {
  predictions: Array<{ label: string; confidence: number }>;
  metadata?: Record<string, unknown>;
}

The default thresholds are 0.7 for routing and 0.3 for falling back. Confidence between those bounds triggers a clarification prompt. Pass a custom config to createLeadRouter to adjust these values.

Step 9: Wire up the Express server

Create server.ts at the project root. This is the entry point — it creates all service singletons once, wires them into the LeadProcessor, and mounts the HTTP routes:

import express from "express";
import cors from "cors";
import { createLeadRouter }

Start the server with:

terminal

npx tsx server.ts

Expected output: Lead intake server listening on port 3000.

Send a test request:

terminal

curl -X POST http://localhost:3000/api/lead \
  -H "Content-Type: application/json" \
  -H "x-user-id: user-1" \
  -d '{"text": "I want to buy a demo of your enterprise plan"}'

A successful route returns HTTP 201 with:

json

{
  "id": "<uuid>",
  "status": "routed",
  "message": "Lead processed",
  "routingDecision": { "action": "route", "target": "sales", "confidence": 0.92 },
  "budgetState": { "scopeKey": "user-1", "spent": 0, "limit": 0, "remaining": 0, "state": "Active" }
}

Check the health endpoint:

terminal

curl http://localhost:3000/api/health

Expected output: {"status":"ok","uptime":3.45}.

Next steps

Replace the hardcoded defineLeadBudget call with a per-request budget check so each user gets their own spend limit from a database or JWT claim.
Add OpenTelemetry exports alongside Langfuse so traces flow into your existing observability platform.
Extend the ClassifierRegistry with a third classifier (e.g., a vector-search embedding classifier) to improve accuracy on ambiguous inputs before falling back to the LLM.
Configure express.raw with a verify function to validate multipart body size and reject oversized uploads before they reach the handler.

Intro

Prerequisites

Node.js 22 or later
pnpm installed (npm install -g pnpm)
An OpenAI API key with access to gpt-5.2-mini
A Langfuse project (self-hosted or cloud.langfuse.com)
A HubSpot private app access token
A webhook URL to receive routed leads

Step 1: Install dependencies

Start from the project root. Copy the .env.example file, fill in your keys, then install all packages:

terminal

cp .env.example .env
pnpm install

Expected output: pnpm prints resolution tables and concludes with Done.

The package.json pins all dependencies exactly, including the four REAA packages:

json

{
  "dependencies": {
    "@reaatech/confidence-router": "0.1.0",
    "@reaatech/confidence-router-classifiers": "0.1.0",
    "@reaatech/agent-handoff": "0.1.0",
    "@reaatech/agent-budget-engine": "0.1.0",
    "openai": "6.38.0",
    "express": "5.2.1",
    "langfuse": "3.38.20",
    "unpdf": "1.6.2",
    "mammoth": "1.12.0",
    "@hubspot/api-client": "13.5.0",
    "zod":

Step 2: Configure environment variables

Open .env and fill in the values:

env

NODE_ENV=development
OPENAI_API_KEY=<your-openai-key>
LANGFUSE_PUBLIC_KEY=<your-langfuse-public-key>
LANGFUSE_SECRET_KEY=<your-langfuse-secret-key>
LANGFUSE_BASE_URL=https://cloud.langfuse.com
HUBSPOT_ACCESS_TOKEN=<your-hubspot-private-app-token>
LEAD_HANDOFF_WEBHOOK_URL=<your-sales-rep-webhook-url>
PORT=3000
ALLOWED_ORIGINS=http://localhost:3000

The Express server reads process.env at startup. Every service that calls an external API reads its own key from this file — nothing is hardcoded.

Step 3: Define shared types

Create src/lib/types.ts. Every service in this recipe shares these interfaces, so TypeScript can verify the shape of data as it flows through the pipeline:

import { z } from "zod";
 
export interface FileAttachment {
  filename: string;
  mimeType: string;
  buffer:

The LeadRequest Zod schema validates incoming API payloads: text is required and capped at 10,000 characters, while email, firstName, lastName, company, and metadata are all optional.

Step 4: Build the document parser

Create src/lib/parser.ts. This module extracts text from PDF and DOCX attachments so their content feeds into the classifier alongside the form text:

import { extractText, getDocumentProxy } from "unpdf";
import mammoth from "mammoth";
import type { FileAttachment, ParsedDocument } from "./types.js";
 
export

Step 5: Create the in-memory spend store

import { SpendStore } from "@reaatech/agent-budget-spend-tracker";
 
export function createInMemorySpendStore(): SpendStore {
  return new SpendStore();
}

Step 6: Set up the confidence classifier

import { ClassifierRegistry, KeywordClassifier, LLMClassifier } from "@reaatech/confidence-router-classifiers";
 
interface ClassificationResult {
  predictions: Array<{ label: string; confidence: number }>;
  metadata?: Record<

getFallbackChain tries each enabled classifier in priority order until one succeeds. If all fail, the safe fallback returns "other" at confidence 1.0 so the pipeline always has a result.

Step 7: Configure the OpenAI pricing provider and budget controller

Create src/services/pricing.ts first, then src/services/budget.ts.

src/services/pricing.ts implements the PricingProvider interface expected by BudgetController:

export class OpenAIPricingProvider {
  estimateCost(modelId: string, estimatedInputTokens: number, _provider?: string): number {
    void _provider;
    const inputTokens = Math.max(0, estimatedInputTokens);
 
    const rate = this.getRate(modelId);

src/services/budget.ts wraps the BudgetController with per-user budget operations:

registerBudgetEvents wires hard-stop and threshold-breach controller events into Langfuse so every budget event is traceable.

Step 8: Set up lead routing

Create src/services/router.ts. This wraps the ConfidenceRouter from @reaatech/confidence-router and maps its internal decision types to the RoutingOutcome shape used throughout the pipeline:

import { ConfidenceRouter } from "@reaatech/confidence-router";
import type { RoutingOutcome } from "../lib/types.js";
 
interface ClassificationResult {
  predictions: Array<{ label: string; confidence: number }>;
  metadata?: Record<string, unknown>;
}

Step 9: Wire up the Express server

Create server.ts at the project root. This is the entry point — it creates all service singletons once, wires them into the LeadProcessor, and mounts the HTTP routes:

import express from "express";
import cors from "cors";
import { createLeadRouter }

Start the server with:

terminal

npx tsx server.ts

Expected output: Lead intake server listening on port 3000.

Send a test request:

terminal

curl -X POST http://localhost:3000/api/lead \
  -H "Content-Type: application/json" \
  -H "x-user-id: user-1" \
  -d '{"text": "I want to buy a demo of your enterprise plan"}'

A successful route returns HTTP 201 with:

json

{
  "id": "<uuid>",
  "status": "routed",
  "message": "Lead processed",
  "routingDecision": { "action": "route", "target": "sales", "confidence": 0.92 },
  "budgetState": { "scopeKey": "user-1", "spent": 0, "limit": 0, "remaining": 0, "state": "Active" }
}

Check the health endpoint:

terminal

curl http://localhost:3000/api/health

Expected output: {"status":"ok","uptime":3.45}.

Next steps

Replace the hardcoded defineLeadBudget call with a per-request budget check so each user gets their own spend limit from a database or JWT claim.
Add OpenTelemetry exports alongside Langfuse so traces flow into your existing observability platform.
Extend the ClassifierRegistry with a third classifier (e.g., a vector-search embedding classifier) to improve accuracy on ambiguous inputs before falling back to the LLM.
Configure express.raw with a verify function to validate multipart body size and reject oversized uploads before they reach the handler.

OpenAI Lead Intake with Adaptive Routing

The problem

Built from

Intro

Prerequisites

Step 1: Install dependencies

Step 2: Configure environment variables

Step 3: Define shared types

Step 4: Build the document parser

Step 5: Create the in-memory spend store

Step 6: Set up the confidence classifier

Step 7: Configure the OpenAI pricing provider and budget controller

Step 8: Set up lead routing

Step 9: Wire up the Express server

Next steps

Example artifact

Comments

Intro

Prerequisites

Step 1: Install dependencies

Step 2: Configure environment variables

Step 3: Define shared types

Step 4: Build the document parser

Step 5: Create the in-memory spend store

Step 6: Set up the confidence classifier

Step 7: Configure the OpenAI pricing provider and budget controller

Step 8: Set up lead routing

Step 9: Wire up the Express server

Next steps