Anthropic RAG Knowledge Base for Construction Project Specs

Let field crews instantly query building specifications, codes, and submittals via a Claude‑powered retrieval pipeline backed by hybrid search.

typescript nextjs rag anthropic qdrant hybrid-search construction knowledge-base

The problem

Construction SMBs store technical specs in scattered PDFs and drive‑by memory, forcing builders to pause work and call the office. A simple RAG system would give them hands‑free answers, but off‑the‑shelf tools return shallow results from dense technical documents.

Built from

Intro

You will build a hybrid RAG (Retrieval-Augmented Generation) pipeline that lets construction field crews query building specifications, codes, and submittals stored as PDFs in S3. The system uses Voyage AI embeddings, Qdrant as the vector store, BM25 keyword search, a Cohere cross-encoder reranker, and Claude Haiku to generate answers constrained to the source documents.

By the end you will have a Next.js API with two endpoints: one to ingest PDFs from S3 into Qdrant, and one to query the knowledge base with natural language.

Prerequisites

Node.js 22+ installed
pnpm 10+ installed (the project uses pnpm)
A Qdrant instance running (easiest with Docker: docker run -p 6333:6333 qdrant/qdrant)
An S3 bucket with at least one PDF file (or a bucket you can upload test PDFs to)
API keys for: Anthropic, Voyage AI, Cohere, and AWS S3
A Langfuse account for observability (optional — the code degrades gracefully without it)

Step 1: Clone the project and install dependencies

The project already has a scaffolded Next.js 16 (App Router) structure. You do not need to create it from scratch.

terminal

cd /home/rick/solutions-worker/builds/24186da5-ecbc-4687-b235-c3dac0a40bf3
pnpm install

Expected output: pnpm lists all installed packages and finishes without errors. If you see peer dependency warnings they are safe to ignore.

Step 2: Configure environment variables

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

164 kB·55 tests·95.8% coverage·vitest passing

SHA-2560bd06d49230251a7b7aac6d84db6b62c535eed60a1a7178517d23836d951ee4c

Book a conversation All solutions

Comments

Loading comments…

import { VoyageAIClient, VoyageAIError } from 'voyageai'; interface EmbeddingDataItem { embedding: number[]; token_count: number; index: number; } export class VoyageEmbeddingAdapter { private client: VoyageAIClient; private model: string; constructor(apiKey: string, model: string = 'voyage-3-lite') { this.client = new VoyageAIClient({ apiKey }); this.model = model; } async embed(text: string): Promise<{ embedding: number[]; tokens: number; cost: number }> { const resp = await this.client.embed({ input: text, model: this.model }); const items = resp.data as EmbeddingDataItem[]; const item = items[0]; const embedding = item.embedding; const tokens = item.token_count; const cost = getCost(tokens, this.model); return { embedding, tokens, cost }; } async embedBatch( texts: string[] ): Promise<{ embedding: number[]; tokens: number; cost: number }[]> { const results: { embedding: number[]; tokens: number; cost: number }[] = []; const batchSize = 100; for (let i = 0; i < texts.length; i += batchSize) { const batch = texts.slice(i, i + batchSize); const resp = await this.embedBatchRequest(batch); const items = resp.data; for (const item of items) { const tokens = item.token_count; results.push({ embedding: item.embedding, tokens, cost: getCost(tokens, this.model), }); } } return results; } private async embedBatchRequest( input: string[] ): Promise<{ data: EmbeddingDataItem[] }> { try { return await this.client.embed({ input, model: this.model }) as { data: EmbeddingDataItem[]; }; } catch (err) { if (err instanceof VoyageAIError && err.statusCode === 429) { const body = err.body as { retryAfter?: number } | undefined; const retryAfter = body !== undefined && body.retryAfter !== undefined ? body.retryAfter : 1; await new Promise((resolve) => setTimeout(resolve, retryAfter * 1000)); return await this.client.embed({ input, model: this.model }) as { data: EmbeddingDataItem[]; }; } throw err; } } static getDimension(model: string): number { switch (model) { case 'voyage-3': return 2048; case 'voyage-embeddings-multilingual-3': return 256; default: return 1024; } } } export function getCost(tokens: number, model: string): number { const rate = model === 'voyage-3' ? 0.50 : 0.10; return (tokens / 1_000_000) * rate; } export function createVoyageEmbedder( apiKey: string, model?: string ): VoyageEmbeddingAdapter { return new VoyageEmbeddingAdapter(apiKey, model); }

Anthropic RAG Knowledge Base for Construction Project Specs

The problem

Built from

Intro

Prerequisites

Step 1: Clone the project and install dependencies

Step 2: Configure environment variables

Example artifact

Comments

Intro

Prerequisites

Step 1: Clone the project and install dependencies

Step 2: Configure environment variables

Step 3: Explore the source layout

Step 4: Run the test suite

Step 5: Type-check and lint

Step 6: Ingest PDF specs from S3

Step 7: Query the knowledge base

Step 8: Inspect the embedding adapter

Step 9: Understand the hybrid retriever

Step 10: Stream answers from Claude

Step 11: Protect the ingest endpoint

Step 12: Run the full quality gates

Next steps