Azure AI Document Pipeline for SMB Medical Claim Processing

Ingest EOBs and insurance claims from PDFs, repair malformed LLM outputs, and route low-confidence extractions for human review—all with cost-aware caching.

azure-ai document-pipeline medical-claims nextjs azure-openai azure-document-intelligence structured-repair confidence-router llm-cache cost-telemetry

The problem

Small medical practices and billing companies manually extract data from Explanation of Benefits (EOB) documents and insurance claims, leading to errors, delays, and high administrative costs.

Built from

Intro

This recipe builds an automated document pipeline for medical claim processing. You’ll create a Next.js App Router API that ingests claim PDFs and images, extracts text with Azure Document Intelligence, converts it to structured JSON via Azure OpenAI, repairs malformed LLM outputs with @reaatech/structured-repair-core, routes low-confidence extractions to a Postgres-backed human review queue using @reaatech/confidence-router-core, and enforces daily spend limits with @reaatech/agent-budget-engine.

Prerequisites

Node.js >= 22 and pnpm >= 10
An Azure Document Intelligence resource (endpoint + key)
An Azure OpenAI resource with a GPT-4o deployment (endpoint + key + deployment name)
A Postgres database (local or remote) with a review_queue table
A Redis instance (optional — caching degrades gracefully if absent)
A Langfuse project (optional — instrumentation initializes Langfuse at startup if keys are set)
Familiarity with TypeScript, Next.js App Router, and basic Azure resource provisioning

Step 1: Configure environment variables

The pipeline reads its configuration from environment variables. Every variable has a placeholder in .env.example. Copy the file and fill in your real values:

terminal

cp .env.example .env.local

Open .env.local and replace the placeholders. Here’s the full set of variables the pipeline expects:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

175 kB·98 tests·95.5% coverage·vitest passing

SHA-256eb1e56ffb52a984432ecd8e67ed7993595986a0e74f341385ab0b89043b960b1

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 and pnpm >= 10
An Azure Document Intelligence resource (endpoint + key)
An Azure OpenAI resource with a GPT-4o deployment (endpoint + key + deployment name)
A Postgres database (local or remote) with a review_queue table
A Redis instance (optional — caching degrades gracefully if absent)
A Langfuse project (optional — instrumentation initializes Langfuse at startup if keys are set)
Familiarity with TypeScript, Next.js App Router, and basic Azure resource provisioning

Step 1: Configure environment variables

The pipeline reads its configuration from environment variables. Every variable has a placeholder in .env.example. Copy the file and fill in your real values:

terminal

cp .env.example .env.local

Open .env.local and replace the placeholders. Here’s the full set of variables the pipeline expects:

import "@azure/openai"; import pRetry from "p-retry"; import { ClaimSchema, type ExtractedClaim } from "../types/claim.js"; import type { ProcessingResult } from "../types/document.js"; const MAX_CHARS = 128_000; export async function extractClaimJson(rawText: string): Promise<string> { const truncated = rawText.length > MAX_CHARS ? rawText.slice(0, MAX_CHARS) : rawText; const run = async () => { const endpoint = (process.env.AZURE_OPENAI_ENDPOINT ?? "").replace(/\/$/, ""); const deployment = process.env.AZURE_OPENAI_DEPLOYMENT ?? ""; const url = `${endpoint}/openai/deployments/${deployment}/chat/completions?api-version=${String(2024)}-10-21`; const response = await fetch(url, { method: "POST", headers: { "Content-Type": "application/json", "api-key": process.env.AZURE_OPENAI_KEY ?? "" }, body: JSON.stringify({ messages: [ { role: "system", content: `You are a medical claim extraction assistant. Extract structured JSON from the following document text. The JSON must match this Zod schema exactly: { patientName: string, patientId: string, dateOfService: string, providerName: string, diagnosisCodes: string[], procedureCodes: string[], totalAmount: number, claimNumber: string, insuranceProvider: string } Return ONLY valid JSON without markdown fences or explanation.` }, { role: "user", content: truncated }, ], max_tokens: 4096, temperature: 0.1, }), }); if (!response.ok) { throw new Error(`Azure OpenAI API error ${String(response.status)}`); } const data = await response.json() as { choices?: Array<{ message?: { content?: string | null } }> }; return data.choices?.[0]?.message?.content ?? ""; }; return pRetry(run, { retries: 3 }); } export function mapToSchema(rawJson: string): ProcessingResult<ExtractedClaim> { try { return { success: true, data: ClaimSchema.parse(JSON.parse(rawJson)) }; } catch (err) { return { success: false, error: err instanceof Error ? err.message : "llm returned non-json" }; } }

import { CacheEngine, InMemoryAdapter, type EmbeddingProvider } from "@reaatech/llm-cache"; import { RedisAdapter } from "@reaatech/llm-cache-adapters-redis"; class InMemoryEmbedder implements EmbeddingProvider { embed(_text: string, _expectedDimensions?: number): Promise<number[]> { void _text; void _expectedDimensions; return Promise.resolve(new Array<number>(1536).fill(0)); } embedBatch(_texts: string[], _expectedDimensions?: number): Promise<number[][]> { void _texts; void _expectedDimensions; return Promise.resolve([new Array<number>(1536).fill(0)]); } } export async function createCacheEngine(redisUrl: string): Promise<CacheEngine> { const adapter = new RedisAdapter({ url: redisUrl }); try { await adapter.connect(); } catch { return new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder: new InMemoryEmbedder(), config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "none", dimensions: 1536, batchSize: 100, maxRetries: 3 }, similarity: { threshold: 0.85, metric: "cosine", maxResults: 5 }, ttl: { default: 3600, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {} }, segmentation: { enabled: true, defaultUseCase: "claim-extraction" }, cost: { enabled: false, currency: "USD" }, observability: { metrics: false, tracing: false, logging: "error" }, }, }); } const cache = new CacheEngine({ storage: adapter, vectorStorage: new InMemoryAdapter(), embedder: new InMemoryEmbedder(), config: { storage: { adapter: "redis" }, vectorStorage: { adapter: "memory" }, embedding: { // The Zod schema for CacheEngineConfig only allows "openai" as the provider value // even though we use a custom InMemoryEmbedder. This is a known limitation of the type. provider: "openai", model: "none", dimensions: 1536, batchSize: 100, maxRetries: 3, }, similarity: { threshold: 0.85, metric: "cosine", maxResults: 5, }, ttl: { default: 3600, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "claim-extraction" }, cost: { enabled: false, currency: "USD" }, observability: { metrics: false, tracing: false, logging: "error" }, }, }); return cache; } export async function checkCache( cache: CacheEngine, prompt: string, modelVersion: string ) { return cache.get(prompt, { useCase: "claim-extraction", model: "azure-gpt-4o", modelVersion, }); } export async function storeCache( cache: CacheEngine, prompt: string, response: unknown, modelVersion: string ) { return cache.set(prompt, response, { useCase: "claim-extraction", model: "azure-gpt-4o", modelVersion, }); }

import { BudgetController } from "@reaatech/agent-budget-engine"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { BudgetScope } from "@reaatech/agent-budget-types"; import { generateId } from "@reaatech/llm-cost-telemetry"; import { BudgetExceededError } from "../lib/errors.js"; const DAILY_BUDGET = Number(process.env.BUDGET_DAILY_LIMIT) || 10.0; class InMemorySpendStore extends SpendStore { private _map = new Map<string, number>(); record(entry: { scopeType: string; scopeKey: string; cost: number }): number { const key = `${entry.scopeType}:${entry.scopeKey}`; const current = this._map.get(key) ?? 0; this._map.set(key, current + entry.cost); return 1; } getSpend(scopeType: string, scopeKey: string): number { return this._map.get(`${scopeType}:${scopeKey}`) ?? 0; } getTotal(scopeType: string, scopeKey: string): number { return this._map.get(`${scopeType}:${scopeKey}`) ?? 0; } reset(scopeType: string, scopeKey: string): void { this._map.delete(`${scopeType}:${scopeKey}`); } } const store = new InMemorySpendStore(); const controller = new BudgetController({ spendTracker: store }); controller.defineBudget({ scopeType: BudgetScope.User, scopeKey: "claims-pipeline", limit: DAILY_BUDGET, policy: { softCap: 0.8, hardCap: 1.0 }, }); export function checkBudget(estimatedCost: number) { const result = controller.check({ scopeType: BudgetScope.User, scopeKey: "claims-pipeline", estimatedCost, modelId: "azure-gpt-4o", tools: [], }); if (!result.allowed) { throw new BudgetExceededError("budget exceeded"); } return result; } export function recordSpend(cost: number, inputTokens: number, outputTokens: number): void { controller.record({ requestId: generateId(), scopeType: BudgetScope.User, scopeKey: "claims-pipeline", cost, inputTokens, outputTokens, modelId: "azure-gpt-4o", provider: "azure", timestamp: new Date(), }); } export function getBudgetState() { return controller.getState(BudgetScope.User, "claims-pipeline") ?? { state: "Active" as const, spent: 0, remaining: DAILY_BUDGET }; }

Azure AI Document Pipeline for SMB Medical Claim Processing

The problem

Built from

Intro

Prerequisites

Step 1: Configure environment variables

Example artifact

Comments

Intro

Prerequisites

Step 1: Configure environment variables

Step 2: Install dependencies and verify the scaffold

Step 3: Define the claim schema and shared types

Step 4: Create utility modules

Step 5: Build the Azure Document Intelligence service

Step 6: Build the Azure OpenAI LLM service

Step 7: Build the claim repair service

Step 8: Build the confidence router

Step 9: Build the LLM cache service with Redis

Step 10: Build cost telemetry and budget enforcement

Step 11: Build the image preprocessor and review queue

Step 12: Build the pipeline orchestrator

Step 13: Create the API routes

Step 14: Configure instrumentation and Next.js

Step 15: Run the type checker and tests

Next steps