Vertex AI Contract Pay App Extraction for SMB Construction

Extract line items, retainage, and change orders from contractor payment applications using Vertex AI and automated repair.

vertex-ai document-pipeline construction pay-app express typescript pdf-extraction confidence-router

The problem

SMB construction project managers spend hours manually entering payment application data from PDFs into accounting software, leading to errors, delayed payments, and cash flow issues.

Built from

Intro

Construction project managers at small and medium businesses spend hours manually keying payment application data from contractor PDFs into accounting software. This recipe builds a Next.js app that automates that process: you upload a PDF, it extracts line items, retainage, and change orders using Vertex AI’s Gemini model, repairs noisy OCR output with structured repair, flags low-confidence fields for human review, and tracks cost per extraction — all through a single POST endpoint.

Prerequisites

Node.js 22+ and pnpm 10 installed on your machine
A Google Cloud Project with the Vertex AI API enabled
A service account key (JSON) for Vertex AI authentication; set GOOGLE_APPLICATION_CREDENTIALS to its path
Basic familiarity with TypeScript and Next.js App Router conventions

Step 1: Scaffold the project and install dependencies

Create a Next.js 16 project with the App Router. The scaffold gives you package.json, tsconfig.json, next.config.ts, and the app/ directory. After scaffolding, add all the dependencies.

terminal

npx create-next-app@latest vertex-ai-contract-pay-app-extraction --typescript --eslint --app --src-dir --import-alias "@/*" --use-pnpm

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

174 kB·63 tests·99.5% coverage·vitest passing

SHA-2566798ca3c27f17e9e91256646f8fb0af82178775f8c06999355a99e4db9443de2

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ and pnpm 10 installed on your machine
A Google Cloud Project with the Vertex AI API enabled
A service account key (JSON) for Vertex AI authentication; set GOOGLE_APPLICATION_CREDENTIALS to its path
Basic familiarity with TypeScript and Next.js App Router conventions

Step 1: Scaffold the project and install dependencies

Create a Next.js 16 project with the App Router. The scaffold gives you package.json, tsconfig.json, next.config.ts, and the app/ directory. After scaffolding, add all the dependencies.

terminal

npx create-next-app@latest vertex-ai-contract-pay-app-extraction --typescript --eslint --app --src-dir --import-alias "@/*" --use-pnpm

"use client"; import { useState, type FormEvent } from "react"; export default function Home() { const [file, setFile] = useState<File | null>(null); const [loading, setLoading] = useState(false); const [result, setResult] = useState<Record<string, unknown> | null>(null); const [error, setError] = useState<string | null>(null); async function handleSubmit(e: FormEvent<HTMLFormElement>) { e.preventDefault(); if (!file) return; setLoading(true); setError(null); setResult(null); try { const formData = new FormData(); formData.append("file", file); const res = await fetch("/api/upload", { method: "POST", body: formData }); const data: Record<string, unknown> = await res.json() as Record<string, unknown>; if (!res.ok) { setError(typeof data.error === "string" ? data.error : "Unknown error"); } else { setResult(data); } } catch (err) { setError(String(err)); } finally { setLoading(false); } } return ( <main style={{ maxWidth: 720, margin: "0 auto", padding: "2rem 1rem", fontFamily: "system-ui, sans-serif" }}> <h1>Contract Pay App Extraction</h1> <p style={{ color: "#555" }}> Upload a contractor payment application PDF to extract line items, retainage, and change orders. </p> <form onSubmit={(e: FormEvent<HTMLFormElement>) => { void handleSubmit(e); }} style={{ margin: "1.5rem 0" }}> <input type="file" accept=".pdf,application/pdf" onChange={(e) => { setFile(e.target.files?.[0] ?? null); }} style={{ display: "block", marginBottom: "0.75rem" }} /> <button type="submit" disabled={!file || loading} style={{ padding: "0.5rem 1.5rem", cursor: file ? "pointer" : "not-allowed", opacity: file ? 1 : 0.5, }} > {loading ? "Processing..." : "Extract"} </button> </form> {error && ( <div style={{ color: "red", padding: "1rem", background: "#ffeeee", borderRadius: 4 }}> {error} </div> )} {result && ( <div style={{ marginTop: "1.5rem" }}> <h2>Extraction Result</h2> <div> {(result.routingDecision as Record<string, unknown> | undefined)?.needsReview ? ( <div style={{ color: "#cc0000", padding: "0.75rem", background: "#fff0f0", borderRadius: 4, marginBottom: "1rem" }}> Needs Review — Some fields have low confidence scores. </div> ) : null} </div> <pre style={{ background: "#f5f5f5", padding: "1rem", borderRadius: 4, overflowX: "auto", fontSize: "0.85rem" }}> {JSON.stringify(result, null, 2)} </pre> </div> )} </main> ); }

import { describe, it, expect } from "vitest"; import { LineItemSchema, ChangeOrderSchema, PaymentApplicationSchema } from "../../src/services/payment-app-schema.js"; const validLineItem = { itemNumber: "001", description: "Foundation work", scheduledValue: 50000, workCompletedFromStart: 25000, workCompletedThisPeriod: 5000, materialsStored: 3000, totalCompletedAndStored: 28000, balanceToFinish: 22000, }; const validChangeOrder = { changeOrderNumber: "CO-001", description: "Extra foundation depth", amount: 5000, approved: true, }; const validPaymentApp = { id: "pa-001", projectId: "proj-123", contractorName: "ABC Construction", applicationDate: "2025-06-01", periodFrom: "2025-05-01", periodTo: "2025-05-31", lineItems: [validLineItem], retainagePercentage: 10, retainageAmount: 5000, changeOrders: [validChangeOrder], totalWorkCompleted: 50000, totalRetainage: 5000, netPaymentDue: 45000, status: "extracted" as const, confidence: 0.95, extractedAt: "2025-06-01T12:00:00Z", }; describe("LineItemSchema", () => { it("parses a valid line item", () => { const result = LineItemSchema.parse(validLineItem); expect(result.description).toBe("Foundation work"); expect(result.scheduledValue).toBe(50000); }); }); describe("ChangeOrderSchema", () => { it("parses a valid change order", () => { const result = ChangeOrderSchema.parse(validChangeOrder); expect(result.changeOrderNumber).toBe("CO-001"); expect(result.approved).toBe(true); }); }); describe("PaymentApplicationSchema", () => { it("parses a valid payment application", () => { const result = PaymentApplicationSchema.parse(validPaymentApp); expect(result.contractorName).toBe("ABC Construction"); expect(result.lineItems).toHaveLength(1); expect(result.changeOrders).toHaveLength(1); }); it("rejects missing contractorName", () => { const invalid = { ...validPaymentApp, contractorName: undefined }; expect(() => PaymentApplicationSchema.parse(invalid)).toThrow(); }); it("accepts empty lineItems array", () => { const result = PaymentApplicationSchema.parse({ ...validPaymentApp, lineItems: [] }); expect(result.lineItems).toHaveLength(0); }); it("rejects retainagePercentage over 100", () => { const invalid = { ...validPaymentApp, retainagePercentage: 150 }; expect(() => PaymentApplicationSchema.parse(invalid)).toThrow(); }); });

Vertex AI Contract Pay App Extraction for SMB Construction

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Define the payment application data schema

Step 3: Build the PDF text extractor

Step 4: Create the Vertex AI provider

Step 5: Implement storage adapters for session continuity

Step 6: Create the token counter and session manager

Step 7: Build the cost tracker

Step 8: Set up the confidence router

Step 9: Wire the extraction pipeline

Step 10: Create the API route and upload form

Step 11: Write and run the tests

Next steps