Field Punch-List Agent for GC Superintendents

Capture photo + voice memo punch items that sync directly to PM tools.

construction voice-agent punch-list nextjs hono deepgram openai field-capture

The problem

A superintendent on a jobsite finds defects but has to manually photograph, write notes, and later transcribe into project management software. Items get lost or delayed, causing rework and owner frustration. The superintendent needs a mobile-first agent that ingests photos and voice memos, extracts actionable items, and syncs them to the PM system with status tracking.

Built from

Intro

This tutorial walks you through building a Field Punch-List Agent for general contractor superintendents. You’ll create a Next.js 16 + Hono voice-agent that accepts audio recordings and photos from a jobsite, transcribes them with Deepgram, extracts actionable punch items using an LLM, runs them through content guardrails, stores them in agent memory, and syncs them to an external project management tool via webhook. By the end, you’ll have a fully tested API that a mobile field app can call to capture and track construction defects in real time.

Prerequisites

Node.js 22+ and pnpm 10 installed on your machine
A Deepgram API key for speech-to-text transcription (Nova-2 model)
An OpenAI API key for punch-item extraction via generateText from the Vercel AI SDK
A Langfuse account (free tier works) for observability tracing
A webhook URL from your project management tool (or a placeholder for testing)
Familiarity with TypeScript, Next.js App Router, and basic Express/Hono patterns

Step 1: Scaffold the Next.js project and install dependencies

Start with a fresh Next.js 16 project using the App Router. Create the directory and initialize it:

terminal

npx create-next-app@16 --typescript --app --import-alias "@/*" --use-pnpm agnostic-punch-list-field-capture

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

206 kB·135 tests·95.5% coverage·vitest passing

SHA-2563e9dedd9e9727deef3d887cee384114c71991ec5f8fc13260ca5478f94f7772c

Book a conversation All solutions

Comments

Loading comments…

import { DeepgramProvider } from "@reaatech/media-pipeline-mcp-deepgram"; import { ValidationError, TranscriptionError } from "../lib/errors.js"; let provider: DeepgramProvider | null = null; export function createTranscriptionProvider(): DeepgramProvider { const apiKey = process.env.DEEPGRAM_API_KEY; if (!apiKey) { throw new ValidationError("DEEPGRAM_API_KEY is required"); } provider = new DeepgramProvider({ apiKey, models: { stt: "nova-2" } }); return provider; } export function getTranscriptionProvider(): DeepgramProvider { if (!provider) { return createTranscriptionProvider(); } return provider; } interface DeepgramSttParsed { transcript?: string; confidence?: number; segments?: unknown[]; error?: string; } interface TranscriptResult { transcript: string; confidence: number; segments: unknown[]; } export async function transcribeAudio( audioBuffer: Buffer, language = "en", ): Promise<TranscriptResult> { if (audioBuffer.length === 0) { throw new ValidationError("Audio buffer is empty"); } const prov = getTranscriptionProvider(); const result = await prov.execute({ operation: "audio.stt", params: { audio_data: audioBuffer, language, diarize: true }, config: {}, }); if (!Buffer.isBuffer(result.data)) { throw new TranscriptionError("Expected Buffer response from Deepgram"); } const parsed = JSON.parse(result.data.toString()) as DeepgramSttParsed; if (parsed.error) { throw new TranscriptionError(`Deepgram API error: ${parsed.error}`); } return { transcript: parsed.transcript ?? "", confidence: parsed.confidence ?? 0, segments: parsed.segments ?? [], }; } interface DeepgramDiarizationParsed { speakers?: number; segments?: Array<{ speaker: number; text: string; start: number; end: number; confidence: number; }>; } interface DiarizationResult { speakers: number; segments: Array<{ speaker: number; text: string; start: number; end: number; confidence: number; }>; } export async function transcribeWithDiarization( audioBuffer: Buffer, ): Promise<DiarizationResult> { const prov = getTranscriptionProvider(); const result = await prov.execute({ operation: "audio.diarize", params: { audio_data: audioBuffer, language: "en" }, config: {}, }); if (!Buffer.isBuffer(result.data)) { throw new TranscriptionError("Expected Buffer response from Deepgram"); } const parsed = JSON.parse(result.data.toString()) as DeepgramDiarizationParsed; return { speakers: parsed.speakers ?? 0, segments: parsed.segments ?? [], }; } export async function checkProviderHealth(): Promise<{ healthy: boolean; latency: number }> { const prov = getTranscriptionProvider(); const health = await prov.healthCheck(); return { healthy: health.healthy, latency: health.latency ?? 0 }; }

export async function extractPunchItemsFromPhotoDescription( photoAnalysisText: string, ): Promise<{ items: PunchItem[]; summary: string }> { if (!photoAnalysisText || photoAnalysisText.trim().length === 0) { return { items: [], summary: "" }; } try { const result = await generateText({ model: "openai/gpt-5.2", output: Output.object({ schema: PunchListExtractionSchema }), system: PHOTO_SYSTEM_PROMPT, prompt: photoAnalysisText, }); const parsed = PunchListExtractionSchema.safeParse(result.output); let rawItems: Array<{ title: string; description: string; location?: string; severity: "low" | "medium" | "high" | "critical"; category?: string }> = []; let summary = ""; if (parsed.success) { rawItems = parsed.data.items; summary = parsed.data.summary; } else { const raw = result.output; if (Array.isArray(raw.items)) { const filtered = raw.items .map(item => PunchItemSchema.safeParse(item)) .filter((r): r is { success: true; data: { title: string; description: string; location?: string; severity: "low" | "medium" | "high" | "critical"; category?: string } } => r.success) .map(r => r.data); const failedCount = raw.items.length - filtered.length; if (failedCount > 0) { console.error(`Filtered out ${String(failedCount)} malformed punch items during photo extraction`); } rawItems = filtered; } summary = typeof raw.summary === "string" ? raw.summary : ""; } const timestamp = new Date().toISOString(); const items: PunchItem[] = rawItems.map(item => ({ id: crypto.randomUUID(), title: item.title, description: item.description, location: item.location, severity: item.severity, status: "open" as const, photoUrls: [], projectId: "default", createdAt: timestamp, updatedAt: timestamp, })); return { items, summary }; } catch (err) { throw new PipelineError( `Photo extraction failed: ${err instanceof Error ? err.message : "unknown error"}`, ); } } export async function classifyPunchItemSeverity( description: string, ): Promise<"low" | "medium" | "high" | "critical"> { const severitySchema = z.object({ severity: z.enum(["low", "medium", "high", "critical"]) }); try { const result = await generateText({ model: "openai/gpt-5.2", output: Output.object({ schema: severitySchema }), system: SEVERITY_SYSTEM_PROMPT, prompt: description, }); const parsed = severitySchema.parse(result.output); return parsed.severity; } catch { return "medium"; } }

import { NextRequest, NextResponse } from "next/server"; import { z } from "zod"; import type { PunchItem } from "../../../src/types.js"; import { AppError } from "../../../src/lib/errors.js"; import { getAllPunchItems, createPunchItem } from "../../../src/lib/punch-item-store.js"; import { createMemoryService, storePunchItemMemory } from "../../../src/services/memory-service.js"; const CreatePunchItemSchema = z.object({ projectId: z.string().min(1), title: z.string().min(1), description: z.string().min(1), location: z.string().optional(), severity: z.enum(["low", "medium", "high", "critical"]).optional(), photoUrls: z.array(z.string()).optional(), }); export function GET(req: NextRequest) { try { const projectId = req.nextUrl.searchParams.get("projectId"); const status = req.nextUrl.searchParams.get("status"); const items = getAllPunchItems(); const filtered = items.filter(item => { if (projectId && item.projectId !== projectId) return false; if (status && item.status !== status) return false; return true; }); return NextResponse.json(filtered); } catch (err) { if (err instanceof AppError) { return NextResponse.json({ error: err.code, message: err.message }, { status: err.statusCode }); } return NextResponse.json({ error: "internal_error", message: "Internal server error" }, { status: 500 }); } } export async function POST(req: NextRequest) { try { const body: unknown = await req.json(); const parsed = CreatePunchItemSchema.safeParse(body); if (!parsed.success) { const details = parsed.error.issues.map((i) => ({ path: i.path.join("."), message: i.message })); return NextResponse.json({ error: "validation_error", details }, { status: 400 }); } const now = new Date().toISOString(); const item: PunchItem = { id: crypto.randomUUID(), title: parsed.data.title, description: parsed.data.description, location: parsed.data.location, severity: parsed.data.severity ?? "medium", status: "open", photoUrls: parsed.data.photoUrls ?? [], projectId: parsed.data.projectId, createdAt: now, updatedAt: now, }; createPunchItem(item); createMemoryService(); await storePunchItemMemory(`project-${item.projectId}`, item); return NextResponse.json(item, { status: 201 }); } catch (err) { if (err instanceof AppError) { return NextResponse.json({ error: err.code, message: err.message }, { status: err.statusCode }); } if (err instanceof SyntaxError) { return NextResponse.json({ error: "validation_error", message: "Invalid JSON body" }, { status: 400 }); } return NextResponse.json({ error: "internal_error", message: "Internal server error" }, { status: 500 }); } }

import { NextRequest, NextResponse } from "next/server"; import { z } from "zod"; import { AppError } from "../../../../src/lib/errors.js"; import { getPunchItemById, updatePunchItem } from "../../../../src/lib/punch-item-store.js"; const UpdatePunchItemSchema = z.object({ title: z.string().min(1).optional(), description: z.string().min(1).optional(), location: z.string().optional(), severity: z.enum(["low", "medium", "high", "critical"]).optional(), status: z.enum(["open", "in_progress", "resolved", "flagged"]).optional(), photoUrls: z.array(z.string()).optional(), }); export async function GET( _req: NextRequest, { params }: { params: Promise<{ id: string }> }, ) { try { const { id } = await params; const item = getPunchItemById(id); if (!item) { return NextResponse.json({ error: "not_found", message: "Punch item not found" }, { status: 404 }); } return NextResponse.json(item); } catch (err) { if (err instanceof AppError) { return NextResponse.json({ error: err.code, message: err.message }, { status: err.statusCode }); } return NextResponse.json({ error: "internal_error", message: "Internal server error" }, { status: 500 }); } } export async function PATCH( req: NextRequest, { params }: { params: Promise<{ id: string }> }, ) { try { const { id } = await params; const body: unknown = await req.json(); const parsed = UpdatePunchItemSchema.safeParse(body); if (!parsed.success) { const details = parsed.error.issues.map((i) => ({ path: i.path.join("."), message: i.message })); return NextResponse.json({ error: "validation_error", details }, { status: 400 }); } const existing = getPunchItemById(id); if (!existing) { return NextResponse.json({ error: "not_found", message: "Punch item not found" }, { status: 404 }); } const updatedItem = updatePunchItem(id, parsed.data); return NextResponse.json(updatedItem); } catch (err) { if (err instanceof AppError) { return NextResponse.json({ error: err.code, message: err.message }, { status: err.statusCode }); } if (err instanceof SyntaxError) { return NextResponse.json({ error: "validation_error", message: "Invalid JSON body" }, { status: 400 }); } return NextResponse.json({ error: "internal_error", message: "Internal server error" }, { status: 500 }); } }

Field Punch-List Agent for GC Superintendents

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Step 2: Configure environment variables with Zod validation

Step 3: Define the domain types and error hierarchy

Step 4: Create the Deepgram transcription service

Step 5: Build the LLM-powered punch-list extractor

Step 6: Set up the agent memory service

Step 7: Implement content guardrails and API auth

Step 8: Build the webhook sync service with retry and dedup

Step 9: Wire the voice pipeline

Step 10: Create the observability service

Step 11: Create the Hono mobile API server

Step 12: Add the Next.js REST API routes for item management

Step 13: Set up Next.js instrumentation

Step 14: Run the tests

Next steps