Auto-Takeoff Agent for Small GC Bid Prep

Convert plan sets to BOM + sub RFPs in minutes, not days.

construction-estimating document-pipeline bill-of-materials request-for-proposal nextjs fastify typescript openai ocr takeoff-agent

The problem

A GC estimator spends days manually measuring plan sets and spec docs to produce a bill of materials and subcontractor RFPs. Errors in takeoff lead to underbid losses or overbid rejections. The estimator needs a way to automate extraction of quantities, materials, and specs from PDFs and images, then generate structured RFPs for subs.

Built from

Intro

This tutorial builds an Auto-Takeoff Agent — a system that automates the construction estimating workflow. Given a set of architectural plan documents (PDFs, images), it runs OCR to extract text and tables, uses an LLM to interpret the data and produce a structured Bill of Materials (BOM), groups BOM line items by trade, and generates subcontractor Request for Proposal (RFP) documents. It also enforces per-job and per-user spending budgets and caches LLM results to save time and cost on repeated inputs.

You’ll use a Next.js App Router frontend and API layer backed by six REAA packages that each handle one piece of the pipeline: document extraction, LLM caching, budget enforcement, task persistence, agent mesh types, and markdown validation. You’ll also add a Fastify server alongside Next.js for the heavy pipeline endpoints. By the end you’ll have a working system you can test with a single POST request.

Prerequisites

Node.js 22+ and pnpm 10
An OpenAI API key (set as OPENAI_API_KEY in .env)
Basic familiarity with Next.js App Router, TypeScript, and Zod schemas

Step 1: Scaffold the project

Start with an empty directory and create the project skeleton. This is a Next.js 16+ App Router project with TypeScript. Create package.json with exact-pinned dependencies:

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

188 kB·127 tests·96.5% coverage·vitest passing

SHA-256c33a0b304bc0753273d844dc108829652e27e7b4e6ed9537714e1ca28e1f9684

Book a conversation All solutions

Comments

Loading comments…

import { createDocumentExtractionOperations } from "@reaatech/media-pipeline-mcp-doc-extraction"; export class DocumentExtractionError extends Error { constructor( message: string, public readonly artifactId: string, public readonly operation: string, cause?: unknown, ) { super(message); this.name = "DocumentExtractionError"; if (cause instanceof Error) { this.cause = cause; } } } export interface OcrOptions { format?: "plain-text" | "structured-json" | "markdown"; language?: string; } export interface TableOptions { outputFormat?: "markdown" | "json"; } export interface FieldSchema { name: string; type: "string" | "number" | "date" | "boolean" | "array"; description?: string; } export interface SummarizeOptions { length?: "short" | "medium" | "long"; style?: "bullet-points" | "paragraph" | "executive"; } export function createDocumentExtractionService( artifactRegistry: never, storage: never, ) { return createDocumentExtractionOperations(artifactRegistry, storage); } export async function extractDocumentText( ops: ReturnType<typeof createDocumentExtractionOperations>, artifactId: string, options?: OcrOptions, ) { try { return await ops.ocr({ artifactId, format: options?.format ?? "markdown", language: options?.language, }); } catch (err) { throw new DocumentExtractionError( `OCR failed for ${artifactId}`, artifactId, "ocr", err, ); } } export async function extractTables( ops: ReturnType<typeof createDocumentExtractionOperations>, artifactId: string, options?: TableOptions, ) { try { return await ops.extractTables({ artifactId, outputFormat: options?.outputFormat ?? "json", }); } catch (err) { throw new DocumentExtractionError( `Table extraction failed for ${artifactId}`, artifactId, "extractTables", err, ); } } export async function extractStructuredFields( ops: ReturnType<typeof createDocumentExtractionOperations>, artifactId: string, fields: FieldSchema[], ) { try { return await ops.extractFields({ artifactId, fields }); } catch (err) { throw new DocumentExtractionError( `Field extraction failed for ${artifactId}`, artifactId, "extractFields", err, ); } } export async function summarizeContent( ops: ReturnType<typeof createDocumentExtractionOperations>, artifactId: string, options?: SummarizeOptions, ) { try { return await ops.summarize({ artifactId, length: options?.length ?? "medium", style: options?.style ?? "paragraph", }); } catch (err) { throw new DocumentExtractionError( `Summarization failed for ${artifactId}`, artifactId, "summarize", err, ); } }

import { BudgetController } from "@reaatech/agent-budget-engine"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { BudgetScope } from "@reaatech/agent-budget-types"; export interface SpendRecord { requestId: string; scopeType: BudgetScope; scopeKey: string; cost: number; inputTokens: number; outputTokens: number; modelId: string; provider: string; } export function createBudgetManager() { const store = new SpendStore(); const controller = new BudgetController({ spendTracker: store }); controller.defineBudget({ scopeType: BudgetScope.User, scopeKey: "*", limit: 5.0, policy: { softCap: 0.8, hardCap: 1.0 }, }); controller.defineBudget({ scopeType: BudgetScope.User, scopeKey: "plan-set", limit: 1.0, policy: { softCap: 0.8, hardCap: 1.0 }, }); controller.on("hard-stop", (event: { scopeType: string; scopeKey: string; spent: number; limit: number }) => { console.error( `Budget hard-stop: ${event.scopeType}:${event.scopeKey} exhausted (${String(event.spent)}/${String(event.limit)})`, ); }); controller.on("threshold-breach", (event: { scopeType: string; scopeKey: string; threshold: number }) => { const pct = Math.round(event.threshold * 100); if (pct >= 80) { console.warn( `Budget at ${String(pct)}% for ${event.scopeType}:${event.scopeKey}`, ); } }); return controller; } export function preflightCheck( controller: BudgetController, scope: { type: BudgetScope; key: string }, estimatedCost: number, modelId: string, ) { return controller.check({ scopeType: scope.type, scopeKey: scope.key, estimatedCost, modelId, tools: [], }); } export function recordSpend( controller: BudgetController, entry: SpendRecord, ) { controller.record({ requestId: entry.requestId, scopeType: entry.scopeType, scopeKey: entry.scopeKey, cost: entry.cost, inputTokens: entry.inputTokens, outputTokens: entry.outputTokens, modelId: entry.modelId, provider: entry.provider, timestamp: new Date(), }); } export function getBudgetStatus( controller: BudgetController, scopeType: BudgetScope, scopeKey: string, ) { const state = controller.getState(scopeType, scopeKey); return { spent: state?.spent ?? 0, remaining: state?.remaining ?? 0, state: state?.state ?? "unknown", }; }

import type { BillOfMaterials } from "../types/bom.js"; import type { MaterialItem } from "../types/material.js"; import type { TradeScope } from "../types/trade-scope.js"; import type { RfpDocument } from "../types/rfp.js"; import { generateRfp as generateRfpLlm } from "./llm-processor.js"; import { createJob } from "./task-persistence.js"; import { BudgetScope } from "@reaatech/agent-budget-types"; export interface RfpGenerationServices { taskStore: { create: (task: { id: string; status: string; rfp?: unknown; }) => Promise<void>; }; budgetController?: { record: (entry: { requestId: string; scopeType: string; scopeKey: string; cost: number; inputTokens: number; outputTokens: number; modelId: string; provider: string; timestamp: Date; }) => Promise<void>; }; } export interface RfpGenerationResult { rfps: RfpDocument[]; summary: { totalTrades: number; totalItems: number; estimatedValue: number; }; } export async function generateSubcontractorRfps( bom: BillOfMaterials, projectContext: Record<string, string> | undefined, services: RfpGenerationServices, ): Promise<RfpGenerationResult> { if (bom.items.length === 0) { return { rfps: [], summary: { totalTrades: 0, totalItems: 0, estimatedValue: 0 }, }; } const tradeGroups = new Map<string, MaterialItem[]>(); for (const item of bom.items) { const existing = tradeGroups.get(item.category) ?? []; existing.push(item); tradeGroups.set(item.category, existing); } const rfps: RfpDocument[] = []; for (const [trade, items] of tradeGroups) { const tradeScope: TradeScope = { trade, lineItems: items, }; const scopeOfWork = await generateRfpLlm(tradeScope, projectContext); void services.budgetController?.record({ requestId: `rfp-${trade.toLowerCase().replace(/\s+/g, "-")}-${String(Date.now())}`, scopeType: BudgetScope.User, scopeKey: trade, cost: 0.001, inputTokens: 100, outputTokens: 30, modelId: "gpt-5.2", provider: "openai", timestamp: new Date(), }); const rfp: RfpDocument = { trade, scopeOfWork, bidDueBy: projectContext?.bidDueBy ? new Date(projectContext.bidDueBy) : new Date(Date.now() + 14 * 24 * 60 * 60 * 1000), status: "pending", }; rfps.push(rfp); await createJob(services.taskStore as never, { id: `rfp-${trade.toLowerCase().replace(/\s+/g, "-")}-${String(Date.now())}`, status: "pending", rfp, }); } const totalItems = bom.items.length; return { rfps, summary: { totalTrades: tradeGroups.size, totalItems, estimatedValue: 0, }, }; }

Auto-Takeoff Agent for Small GC Bid Prep

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Define the domain types with Zod schemas

Step 3: Build the document extraction service

Step 4: Add LLM caching with cache-manager

Step 5: Wire up the budget engine

Step 6: Create task persistence

Step 7: Implement the LLM processor

Step 8: Orchestrate the takeoff pipeline

Step 9: Generate subcontractor RFPs from the BOM

Step 10: Wire Next.js API routes

Step 11: Add the home page and Fastify server

Step 12: Run the tests and verify

Next steps