Google Gemini Medical Claim Extraction for SMB Practices

Automatically pull patient demographics, diagnosis codes, and billing line items from scanned claim forms and PDFs, with built-in PII redaction and audit.

google-gemini medical-claim-extraction document-pipeline pii-redaction express llm-cost-tracking

The problem

Small medical practices spend 5–8 hours per week manually rekeying data from faxed or scanned claim forms. Errors in insurance coding cause denials and delayed payments, while privacy regulations demand strict data handling.

Built from

Intro

This recipe builds an end-to-end medical claim extraction pipeline for small medical practices. You’ll create a Next.js API that accepts uploaded claim-form PDFs, extracts text via LlamaParse with an OCR fallback, sends the text to Google Gemini for structured data extraction, repairs and validates the JSON output against a Zod schema, redacts PII through a guardrail chain, tracks cost via LLM cost telemetry, enforces a daily budget cap, and maintains session state across batch processing. The pipeline is backed by BullMQ for async job processing and Supabase for storage.

This tutorial is for developers familiar with TypeScript and Next.js who want to see how multiple REAA packages snap together into a document pipeline.

Prerequisites

Node.js 22+ and pnpm 10+
A Google Gemini API key (get one at https://aistudio.google.com/apikey)
A Supabase project with storage and a claim_extractions table
A LlamaCloud API key (for LlamaParse PDF parsing)
Redis server running locally on port 6379 (for BullMQ)
Basic familiarity with Next.js App Router route handlers

Step 1: Set up environment variables

The scaffold ships a .env.example with placeholder entries. Copy it to .env.local and fill in your credentials:

terminal

cp .env.example .env.local

The file defines every variable the pipeline reads:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

182 kB·96 tests·98.6% coverage·vitest passing

SHA-256a44a9f379f48b0ef8d42909dce054e53eef23f1a375aa8b84dd9faf8bd6c53a5

Book a conversation All solutions

Comments

Loading comments…

Intro

This tutorial is for developers familiar with TypeScript and Next.js who want to see how multiple REAA packages snap together into a document pipeline.

Prerequisites

Node.js 22+ and pnpm 10+
A Google Gemini API key (get one at https://aistudio.google.com/apikey)
A Supabase project with storage and a claim_extractions table
A LlamaCloud API key (for LlamaParse PDF parsing)
Redis server running locally on port 6379 (for BullMQ)
Basic familiarity with Next.js App Router route handlers

Step 1: Set up environment variables

The scaffold ships a .env.example with placeholder entries. Copy it to .env.local and fill in your credentials:

terminal

cp .env.example .env.local

The file defines every variable the pipeline reads:

import { extractWithOcrFallback } from "../services/pdf-ingest.js"; import { extractClaimWithRetry } from "../services/gemini-extractor.js"; import { repairClaimOutput } from "../services/repair-service.js"; import { redactPii } from "../services/guardrail-service.js"; import { createGeminiCostSpan, recordExtractionSpan } from "../services/cost-telemetry.js"; import { checkExtractionBudget, recordExtractionSpend } from "../services/budget-service.js"; import { createClaimBatchSession, trackClaimProgress, endClaimBatchSession } from "../services/session-service.js"; import { storeExtractionResult } from "../services/storage.js"; import type { ExtractionResult, ClaimForm } from "../schemas/claim.js"; function emptyClaimForm(): ClaimForm { return { patient: { firstName: "", lastName: "", dob: "", gender: "", insuranceId: "" }, diagnoses: [], lineItems: [], providerNpi: "", claimDate: "", totalCharges: 0, }; } export async function processSingleClaim(claimId: string, buffer: Uint8Array): Promise<ExtractionResult> { const budgetCheck = checkExtractionBudget(0.002, "gemini-2.5-flash"); if (!budgetCheck.allowed) { throw new Error("Budget exceeded"); } const { text: rawText } = await extractWithOcrFallback(buffer); const geminiOutput = await extractClaimWithRetry(rawText); const repairResult = repairClaimOutput(geminiOutput); const structuredData: ClaimForm = repairResult.data ?? emptyClaimForm(); const redactedJson = await redactPii(JSON.stringify(structuredData)); const estimatedInputTokens = Math.ceil(rawText.length / 4); const estimatedOutputTokens = Math.ceil(geminiOutput.length / 4); const span = createGeminiCostSpan(estimatedInputTokens, estimatedOutputTokens, "default", "claim-extraction"); await recordExtractionSpan(span); recordExtractionSpend("*", span.costUsd, span.inputTokens, span.outputTokens, "gemini-2.5-flash"); const result: ExtractionResult = { claimId, rawText, structured: JSON.parse(redactedJson) as ClaimForm, confidence: repairResult.success ? 0.9 : 0.5, repairSteps: repairResult.steps.map((s) => s.strategy), fieldErrors: repairResult.fieldErrors ?? [], }; await storeExtractionResult(claimId, result); return result; } export async function processClaimBatch( files: Array<{ claimId: string; buffer: Uint8Array }>, userId: string, ): Promise<ExtractionResult[]> { const session = await createClaimBatchSession(userId); const results: ExtractionResult[] = []; for (const [i, file] of files.entries()) { const { claimId, buffer } = file; const result = await processSingleClaim(claimId, buffer); results.push(result); await trackClaimProgress(session.id, i, files.length, claimId); } await endClaimBatchSession(session.id); return results; }

Google Gemini Medical Claim Extraction for SMB Practices

The problem

Built from

Intro

Prerequisites

Step 1: Set up environment variables

Example artifact

Comments

Intro

Prerequisites

Step 1: Set up environment variables

Step 2: Define the claim extraction schemas

Step 3: Create the PDF ingestion service

Step 4: Create the Google Gemini extraction service

Step 5: Wire up structured-repair-core for output repair

Step 6: Build the PII redaction guardrail chain

Step 7: Track extraction cost with llm-cost-telemetry

Step 8: Cap daily spend with the agent budget engine

Step 9: Track batch sessions with session-continuity

Step 10: Wire up the Supabase storage layer

Step 11: Wire up the BullMQ job queue

Step 12: Wire the ingestion pipeline

Step 13: Create the API route handlers

Step 14: Run the tests

Next steps