Cohere Lead Intake Agent for HubSpot SMB Sales

An AI intake system that processes documents, emails, and chat messages to capture leads and auto-populate HubSpot CRM with accurate, categorized data.

cohere hubspot lead-intake smb nextjs slack langfuse budget-guardrails

The problem

SMB sales teams spend hours manually entering leads from various channels, leading to data entry errors, slow follow-up, and missed opportunities.

Built from

Intro

This recipe builds an AI lead intake agent that processes business cards, PDFs, and emails to capture leads and auto-populate HubSpot CRM. It uses Cohere’s NLP models for extraction and classification, routes high-confidence leads straight into HubSpot, and flags low-confidence ones for human review via Slack. Budget enforcement caps monthly API spend per organization so costs stay predictable for SMBs.

Prerequisites

Node.js 22+ and pnpm installed
Cohere API key (COHERE_API_KEY)
HubSpot private app access token (HUBSPOT_ACCESS_TOKEN)
Slack bot token (SLACK_TOKEN) and a channel for review notifications (SLACK_VERIFICATION_CHANNEL)
Langfuse keys (LANGFUSE_SECRET_KEY, LANGFUSE_PUBLIC_KEY) for observability
Basic familiarity with TypeScript and REST APIs

Step 1: Install dependencies and configure environment

Start from the project root. Copy the env example and fill in your keys:

terminal

cp .env.example .env

Your .env needs at minimum:

env

COHERE_API_KEY=
HUBSPOT_ACCESS_TOKEN=
SLACK_TOKEN=

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

114 tests·99.1% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ and pnpm installed
Cohere API key (COHERE_API_KEY)
HubSpot private app access token (HUBSPOT_ACCESS_TOKEN)
Slack bot token (SLACK_TOKEN) and a channel for review notifications (SLACK_VERIFICATION_CHANNEL)
Langfuse keys (LANGFUSE_SECRET_KEY, LANGFUSE_PUBLIC_KEY) for observability
Basic familiarity with TypeScript and REST APIs

Step 1: Install dependencies and configure environment

Start from the project root. Copy the env example and fill in your keys:

terminal

cp .env.example .env

Your .env needs at minimum:

env

COHERE_API_KEY=
HUBSPOT_ACCESS_TOKEN=
SLACK_TOKEN=

File	Responsibility
`src/lib/types.ts`	All TypeScript interfaces and Zod schemas for leads, pipeline results, and extracted fields
`src/lib/cohere-client.ts`	Singleton Cohere client (`CohereClientV2`) that reads `COHERE_API_KEY` automatically
`src/lib/document.ts`	Document text extraction (PDF via `pdf-parse`, images via `tesseract.js`), text chunking, and field extraction via Cohere chat
`src/lib/classifier.ts`	Intent classification via `@reaatech/agent-mesh-classifier` + confidence routing via `@reaatech/confidence-router`
`src/lib/notify.ts`	Slack notifications for manual review via `@slack/web-api`
`src/lib/hubspot.ts`	HubSpot contact/company CRUD via `@hubspot/api-client`
`src/lib/budget.ts`	Budget lifecycle management via `@reaatech/agent-budget-engine`
`src/lib/handoff.ts`	Agent handoff orchestration (escalation + retry) via `@reaatech/agent-handoff`
`src/lib/langfuse.ts`	Observability traces via `langfuse`
`src/lib/store.ts`	In-memory lead store during a pipeline run

Route	Purpose
`app/api/leads/route.ts`	`POST` ingests files or email webhooks; `GET` is a health check; `OPTIONS` handles CORS
`app/api/leads/[id]/route.ts`	`GET` returns lead status and fields by ID

import { z } from "zod"; export const LeadStatusValues = ["PENDING_REVIEW", "READY", "CREATED", "FAILED"] as const; export type LeadStatus = (typeof LeadStatusValues)[number]; export const LeadStatus = { PENDING_REVIEW: "PENDING_REVIEW" as const, READY: "READY" as const, CREATED: "CREATED" as const, FAILED: "FAILED" as const, }; export interface LeadInput { source: string; email: string; firstName: string; lastName: string; company?: string; phone?: string; metadata?: Record<string, unknown>; } export interface LeadRecord { id: string; email: string; firstName: string; lastName: string; company?: string; phone?: string; industry?: string; intent?: string; source: string; confidence: number; status: LeadStatus; hubspotContactId?: string; } export interface PipelineResult { success: boolean; leadId?: string; hubspotContactId?: string; error?: string; decisionType?: string; } export interface ExtractedFields { firstName?: string; lastName?: string; email: string; company?: string; phone?: string; industry?: string; intent?: string; notes?: string; } export const LeadInputSchema = z.object({ source: z.string(), email: z.email(), firstName: z.string(), lastName: z.string(), company: z.string().optional(), phone: z.string().optional(), metadata: z.record(z.string(), z.unknown()).optional(), }); export const LeadRecordSchema = z.object({ id: z.string(), email: z.email(), firstName: z.string(), lastName: z.string(), company: z.string().optional(), phone: z.string().optional(), industry: z.string().optional(), intent: z.string().optional(), source: z.string(), confidence: z.number().min(0).max(1), status: z.enum(LeadStatusValues), hubspotContactId: z.string().optional(), }); export const ExtractedFieldsSchema = z.object({ firstName: z.string().optional(), lastName: z.string().optional(), email: z.email(), company: z.string().optional(), phone: z.string().optional(), industry: z.string().optional(), intent: z.string().optional(), notes: z.string().optional(), });

import { classifierService } from "@reaatech/agent-mesh-classifier"; import { ConfidenceRouter } from "@reaatech/confidence-router"; const leadIntakeRegistry = [ { agent_id: "book_demo", display_name: "Book Demo", description: "Handles requests to schedule a product demo", endpoint: "lead-intake://book_demo", type: "mcp" as const, is_default: false, confidence_threshold: 0.7, clarification_required: false, examples: ["I want to see a demo", "Schedule a demo call"], }, { agent_id: "pricing_inquiry", display_name: "Pricing Inquiry", description: "Handles questions about pricing and plans", endpoint: "lead-intake://pricing_inquiry", type: "mcp" as const, is_default: false, confidence_threshold: 0.7, clarification_required: false, examples: ["How much does it cost?", "What are your pricing plans?"], }, { agent_id: "support_request", display_name: "Support Request", description: "Handles technical support and help requests", endpoint: "lead-intake://support_request", type: "mcp" as const, is_default: false, confidence_threshold: 0.7, clarification_required: false, examples: ["I need help with login", "My account is broken"], }, { agent_id: "general_contact", display_name: "General Contact", description: "Handles general inquiries and messages", endpoint: "lead-intake://general_contact", type: "mcp" as const, is_default: true, confidence_threshold: 0.7, clarification_required: false, examples: ["I want to speak to someone", "Contact me"], }, ]; export async function classifyLeadInput( rawText: string, ): Promise<{ agent_id: string; confidence: number; intent_summary: string }> { const output = await classifierService.classify(rawText, leadIntakeRegistry); return { agent_id: output.agent_id, confidence: output.confidence, intent_summary: output.intent_summary, }; } const router = new ConfidenceRouter({ routeThreshold: 0.8, fallbackThreshold: 0.3, clarificationEnabled: false, }); export function routeClassification( agent_id: string, confidence: number, ): { decisionType: string; target: string } { const decision = router.decide({ predictions: [{ label: agent_id, confidence }], }); return { decisionType: decision.type, target: decision.target ?? "", }; } export async function classifyAndRoute( rawText: string, ): Promise<{ classification: { agent_id: string; confidence: number; intent_summary: string; }; decision: { decisionType: string; target: string }; }> { const classification = await classifyLeadInput(rawText); const decision = routeClassification( classification.agent_id, classification.confidence, ); return { classification, decision }; }

import { createHandoffConfig, TypedEventEmitter, withRetry, HandoffError, } from "@reaatech/agent-handoff"; import type { HandoffConfig } from "@reaatech/agent-handoff"; import { LeadStatus } from "./types.js"; import type { LeadRecord, PipelineResult } from "./types.js"; import { sendManualReviewNotification } from "./notify.js"; import { pushLeadToHubspot } from "./hubspot.js"; const handoffEmitter = new TypedEventEmitter<{ "lead-routed": { leadId: string; route: string }; "lead-escalated": { leadId: string; reason: string }; }>(); export function createLeadHandoffConfig( overrides?: Partial<Record<string, unknown>>, ): HandoffConfig { const defaults = createHandoffConfig({ routing: { minConfidenceThreshold: 0.6 }, }); if (!overrides) { return defaults; } return { ...defaults, ...overrides }; } export async function escalateForManualReview( lead: LeadRecord, reason: string, ): Promise<void> { handoffEmitter.emit("lead-escalated", { leadId: lead.id, reason }); await sendManualReviewNotification(lead, reason); lead.status = LeadStatus.PENDING_REVIEW; } export async function routeToHubspot( lead: LeadRecord, ): Promise<PipelineResult> { handoffEmitter.emit("lead-routed", { leadId: lead.id, route: "hubspot" }); try { const result = await withRetry(() => pushLeadToHubspot(lead), { maxRetries: 3, backoff: "exponential", baseDelayMs: 100, maxDelayMs: 5000, shouldRetry: (err: unknown) => err instanceof Error, }); return { success: true, leadId: lead.id, hubspotContactId: result.contactId, }; } catch (e) { return handleHandoffError(e, lead); } } export function handleHandoffError( error: unknown, lead: LeadRecord, ): PipelineResult { if (error instanceof HandoffError) { return { success: false, leadId: lead.id, error: error.message, decisionType: error.code, }; } const message = error instanceof Error ? error.message : String(error); return { success: false, leadId: lead.id, error: message, decisionType: "HANDOFF_ERROR", }; }

Cohere Lead Intake Agent for HubSpot SMB Sales

The problem

Built from

Intro

Prerequisites

Step 1: Install dependencies and configure environment

Example artifact

Comments

Intro

Prerequisites

Step 1: Install dependencies and configure environment

Step 2: Explore the source layout

Step 3: Define types and schemas

Step 4: Initialize the Cohere client

Step 5: Build document extraction and field parsing

Step 6: Classify intent and route by confidence

Step 7: Send Slack notifications

Step 8: Push leads to HubSpot

Step 9: Enforce monthly budgets

Step 10: Orchestrate handoff and escalation

Step 11: Add observability with Langfuse traces

Step 12: Wire up the lead ingestion API route

Step 13: Add the lead status lookup route

Step 14: Set up instrumentation for Node.js startup

Step 15: Export public API from `src/index.ts`

Step 16: Add the in-memory lead store

Step 17: Run the tests

Next steps

Cohere Lead Intake Agent for HubSpot SMB Sales

The problem

Built from

Intro

Prerequisites

Step 1: Install dependencies and configure environment

Example artifact

Intro

Prerequisites

Step 1: Install dependencies and configure environment

Step 2: Explore the source layout

Step 3: Define types and schemas

Step 4: Initialize the Cohere client

Step 5: Build document extraction and field parsing

Step 6: Classify intent and route by confidence

Step 7: Send Slack notifications

Step 8: Push leads to HubSpot

Step 9: Enforce monthly budgets

Step 10: Orchestrate handoff and escalation

Step 11: Add observability with Langfuse traces

Step 12: Wire up the lead ingestion API route

Step 13: Add the lead status lookup route

Step 14: Set up instrumentation for Node.js startup

Step 15: Export public API from src/index.ts

Step 16: Add the in-memory lead store

Step 17: Run the tests

Next steps

Step 15: Export public API from `src/index.ts`