Intake Automation Agent for Solo Immigration Attorney

Slash intake time from 30 minutes to 5 with AI-powered client screening and conflict checks.

legal-intake immigration ai-agent nextjs fastify rag guardrails lead-intake

The problem

A solo immigration attorney spends over 30 minutes per new client manually gathering case details, checking for conflicts across years of paper files, and entering data into the case management system. This administrative drag means fewer billable hours and delayed responses to prospective clients. The attorney often loses leads because intake takes too long and the process feels impersonal. They need a way to automate the initial triage without sacrificing accuracy or compliance.

Built from

Intro

This tutorial builds an AI-powered intake automation agent for a solo immigration attorney. A prospective client sends a message through a web form, and the agent screens them, checks for conflicts against past cases stored in a Postgres + pgvector database, classifies their legal need, generates a structured case summary, and responds with empathy — all while scrubbing PII, enforcing compliance disclaimers, and tracking telemetry with Langfuse. By the end, intake drops from 30 minutes to 5.

You’ll wire up 6 REAA packages (agent-mesh, hybrid-rag, agent-memory, guardrail-chain, agent-handoff, llm-cache) into a Next.js 16 App Router project with Zod-validated env config, an LLM service built on the Vercel AI SDK (ai), a PDF/OCR document parser, and a full test suite with msw HTTP mocking.

Prerequisites

Node.js 22+ with pnpm 10 installed
An OpenAI API key with access to gpt-5.2 and text-embedding-3-small
A Langfuse account (free tier works) with public and secret keys
A PostgreSQL database with the pgvector extension enabled
Familiarity with Next.js App Router route handlers and TypeScript generics

Step 1: Scaffold the project and install dependencies

Start from an empty directory. Create the project with Next.js and install every dependency at exact pinned versions.

terminal

pnpm create next@16.2.7

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

168 kB·139 tests·99.1% coverage·vitest passing

SHA-2565fa8c22aef7820effbdf996d99dfa10507d27f9703a5311be36c6efe34bba145

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ with pnpm 10 installed
An OpenAI API key with access to gpt-5.2 and text-embedding-3-small
A Langfuse account (free tier works) with public and secret keys
A PostgreSQL database with the pgvector extension enabled
Familiarity with Next.js App Router route handlers and TypeScript generics

Step 1: Scaffold the project and install dependencies

Start from an empty directory. Create the project with Next.js and install every dependency at exact pinned versions.

terminal

pnpm create next@16.2.7

import { sql } from "@vercel/postgres"; import pgvector from "pgvector"; import type { IntakeSession, DocumentInfo } from "../lib/types.js"; export async function initDatabase() { await sql`CREATE EXTENSION IF NOT EXISTS vector`; await sql`CREATE TABLE IF NOT EXISTS intake_sessions (id UUID PRIMARY KEY DEFAULT gen_random_uuid(), client_name TEXT NOT NULL, client_email TEXT, client_phone TEXT, case_description TEXT, status TEXT DEFAULT 'active', conflict_status TEXT, created_at TIMESTAMPTZ DEFAULT NOW(), updated_at TIMESTAMPTZ DEFAULT NOW())`; await sql`CREATE TABLE IF NOT EXISTS case_documents (id UUID PRIMARY KEY DEFAULT gen_random_uuid(), session_id UUID REFERENCES intake_sessions(id), file_name TEXT, mime_type TEXT, text_content TEXT, embedding vector(1536), created_at TIMESTAMPTZ DEFAULT NOW())`; } export function toSql(vector: number[]) { return pgvector.toSql(vector); } export async function saveSession(session: IntakeSession) { await sql`INSERT INTO intake_sessions (id, client_name, client_email, case_description, status) VALUES (${session.sessionId}, ${session.clientName}, ${session.clientEmail}, ${session.caseDescription}, ${session.status}) ON CONFLICT (id) DO UPDATE SET client_name = ${session.clientName}, client_email = ${session.clientEmail}, case_description = ${session.caseDescription}, status = ${session.status}, updated_at = NOW()`; } export async function getSession(sessionId: string): Promise<IntakeSession | null> { const { rows } = await sql`SELECT * FROM intake_sessions WHERE id = ${sessionId}`; if (rows.length === 0) return null; const row = rows[0]; return { sessionId: String(row.id), clientName: String(row.client_name), clientEmail: String(row.client_email), caseDescription: String(row.case_description), status: String(row.status), createdAt: new Date(String(row.created_at)), }; } export async function saveDocument(doc: DocumentInfo, emb: number[]) { await sql`INSERT INTO case_documents (id, file_name, mime_type, text_content, embedding) VALUES (${doc.id}, ${doc.fileName}, ${doc.mimeType}, ${doc.textContent}, ${toSql(emb)}::vector)`; } export async function searchSimilarDocuments(embedding: number[], limit: number = 10) { const { rows } = await sql`SELECT id, session_id, file_name, text_content FROM case_documents ORDER BY embedding <-> ${toSql(embedding)}::vector LIMIT ${limit}`; return rows; }

import { ChainBuilder, type Guardrail, type GuardrailResult, type ChainContext, setLogger, ConsoleLogger, } from "@reaatech/guardrail-chain"; setLogger(new ConsoleLogger()); class PIIScrubGuardrail implements Guardrail<string, string> { id = "pii-scrub"; name = "PII Scrubber"; type = "input" as const; enabled = true; execute(input: string, _ctx: ChainContext): Promise<GuardrailResult<string>> { void _ctx; const redactedInput = input .replace(/\d{3}-\d{2}-\d{4}/g, "[REDACTED SSN]") .replace(/\+\d{1,3}[-.\s]?\d{3}[-.\s]?\d{3}[-.\s]?\d{4}/g, "[REDACTED PHONE]") .replace(/\S+@\S+\.\S+/g, "[REDACTED EMAIL]"); return Promise.resolve({ passed: true, output: redactedInput }); } } class ComplianceGuardrail implements Guardrail<string, string> { id = "compliance-check"; name = "Compliance Check"; type = "output" as const; enabled = true; execute(input: string, _ctx: ChainContext): Promise<GuardrailResult<string>> { void _ctx; const disclaimerMissing = !input.includes("not legal advice"); return Promise.resolve({ passed: !disclaimerMissing, output: input, metadata: { missingDisclaimer: disclaimerMissing, duration: 0 }, }); } } const BLOCKLIST: string[] = []; class ToxicityGuardrail implements Guardrail<string, string> { id = "toxicity-filter"; name = "Toxicity Filter"; type = "output" as const; enabled = true; execute(input: string, _ctx: ChainContext): Promise<GuardrailResult<string>> { void _ctx; const flaggedTerms = BLOCKLIST.filter((term) => input.toLowerCase().includes(term.toLowerCase())); if (flaggedTerms.length > 0) { return Promise.resolve({ passed: false, output: input, metadata: { flaggedTerms, duration: 0 } }); } return Promise.resolve({ passed: true, output: input }); } } let _chain: ReturnType<typeof ChainBuilder.prototype.build> | null = null; export function getGuardrailChain() { if (!_chain) { _chain = new ChainBuilder() .withBudget({ maxLatencyMs: 1000, maxTokens: 4000 }) .withGuardrail(new PIIScrubGuardrail()) .withGuardrail(new ComplianceGuardrail()) .withGuardrail(new ToxicityGuardrail()) .withSlowGuardrailSkipping(true) .withErrorHandling({ maxRetries: 2, retryDelayMs: 200 }) .build(); } return _chain; } export async function runInputGuardrails(input: string) { const chain = getGuardrailChain(); const result = await chain.execute(input); return { passed: result.success, output: typeof result.output === "string" ? result.output : input }; } export async function runOutputGuardrails(output: string) { const chain = getGuardrailChain(); const result = await chain.execute(output); return { passed: result.success, output: typeof result.output === "string" ? result.output : output }; }

import { CacheEngine, InMemoryAdapter, OpenAIEmbedder } from "@reaatech/llm-cache"; import { env } from "../lib/env.js"; export let _cache: CacheEngine | null = null; export function getCache(): CacheEngine { if (!_cache) { _cache = new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder: new OpenAIEmbedder({ provider: "openai", model: "text-embedding-3-small", dimensions: 1536, apiKey: env.OPENAI_API_KEY, }), config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "text-embedding-3-small", dimensions: 1536, batchSize: 100, maxRetries: 3, }, similarity: { threshold: env.LLM_CACHE_SIMILARITY_THRESHOLD, metric: "cosine", maxResults: 5, }, ttl: { default: 3600, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "intake", }, cost: { enabled: false, currency: "USD", }, observability: { metrics: false, tracing: false, logging: "error", }, }, }); } return _cache; } export async function getCachedOrGenerate( prompt: string, generateFn: () => Promise<string>, opts?: { model?: string; useCase?: string }, ): Promise<string> { const cache = getCache(); const model = opts?.model ?? env.DEFAULT_LLM_MODEL; const useCase = opts?.useCase ?? "intake"; const hit = await cache.get(prompt, { model, modelVersion: model, useCase }); if (hit.hit) { return String(hit.entry.response); } const result = await generateFn(); await cache.set(prompt, { answer: result }, { model, modelVersion: model, useCase }); return result; }

Intake Automation Agent for Solo Immigration Attorney

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables with Zod

Step 3: Define the domain types and prompt constants

Step 4: Set up the database with pgvector

Step 5: Build the LLM service with the Vercel AI SDK

Step 6: Build the document parser with pdf-parse and Tesseract.js

Step 7: Implement the guardrail chain

Step 8: Add memory for client conversation context

Step 9: Set up the LLM response cache

Step 10: Build the conflict checker

Step 11: Create the intake agent orchestrator

Step 12: Wire up API routes

Step 13: Run the tests

Next steps