Perplexity Knowledge Agent for Notion SMB FAQ

A conversational AI that answers employee and customer questions by searching Notion workspaces, using Perplexity's search and REAA's hybrid RAG for context-rich, cited responses.

typescript nextjs perplexity notion qdrant hybrid-rag knowledge-agent

The problem

Small businesses store institutional knowledge in Notion but struggle to find answers quickly across scattered pages, leading to repetitive questions and lost productivity.

Built from

Intro

This recipe builds a conversational AI that answers employee and customer questions by searching your Notion workspace. When a question comes in, the pipeline retrieves the most relevant Notion page chunks using Qdrant vector search, then hands the query and context to Perplexity’s pplx-70b-online model to generate a cited answer — all while keeping your combined Perplexity and embedding costs under a configurable budget.

Prerequisites

Node.js 22+ installed
pnpm as the package manager
Notion integration token — create one at https://www.notion.so/profile/integrations and share it with your target database
Perplexity API key — sign up at https://perplexity.ai and create a key from your dashboard
Qdrant instance — use the cloud service or run locally with docker run -p 6333:6333 qdrant/qdrant

Step 1: Clone the scaffold and install dependencies

The project starts from a Next.js scaffold with all packages pre-installed. Verify the dependencies in package.json match these exact versions before you begin:

json

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

105 tests·99.3% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

export async function generateAnswer( client: ReturnType<typeof createPerplexityClient>, query: string, contextChunks: RetrievalResult[], sessionHistory?: Array<{ content: string }>, ): Promise<{ answer: string; citations: Array<{ sourceTitle: string; excerpt: string }>; usage: { inputTokens: number; outputTokens: number } }> { const contextBlock = contextChunks.length > 0 ? "Context chunks:\n" + contextChunks.map((c, i) => `${String(i + 1)}. ${c.content}`).join("\n") : ""; const systemPrompt = "You are a knowledge agent. Answer the user's question using the context chunks below. Cite each source by its title and include a short excerpt. Output valid JSON with an 'answer' field and a 'citations' array of {sourceTitle, excerpt}."; const systemContent = contextBlock ? `${systemPrompt}\n\n${contextBlock}` : systemPrompt; const rawMessages: Array<{ role: string; content: string }> = [ { role: "system", content: systemContent }, ]; if (sessionHistory) { for (const mem of sessionHistory) { try { const parsed = JSON.parse(mem.content) as { speaker: string; text: string }; rawMessages.push({ role: parsed.speaker === "user" ? "user" : "assistant", content: parsed.text, }); } catch { rawMessages.push({ role: "user", content: mem.content }); } } } rawMessages.push({ role: "user", content: query }); const request = new ChatCompletionsPostRequest(); request.model = ChatCompletionsPostRequestModelEnum.Pplx70bOnline; request.messages = rawMessages; try { const result = await client.chatCompletionsPost(request); const inputTokens = result.usage?.promptTokens ?? 0; const outputTokens = result.usage?.completionTokens ?? 0; const rawContent = result.choices?.[0]?.message?.content ?? ""; let answer: string; let citations: Array<{ sourceTitle: string; excerpt: string }> = []; try { const parsed: unknown = JSON.parse(rawContent); const parsedObj = parsed as Record<string, unknown>; answer = rawContent; if (Array.isArray(parsedObj.citations)) { citations = parsedObj.citations.map((c: unknown) => { const cit = c as Record<string, unknown>; const st = cit.sourceTitle; const ex = cit.excerpt; return { sourceTitle: typeof st === "string" ? st : "", excerpt: typeof ex === "string" ? ex : "", }; }); } } catch { answer = rawContent; } return { answer, citations, usage: { inputTokens, outputTokens } }; } catch (err) { throw new Error(`Perplexity API error: ${String(err)}`); } }

import { config } from "../lib/config.js"; import { createNotionClient, indexNotionWorkspace } from "../lib/notion-indexer.js"; import { generateChunkEmbeddings } from "../lib/embedding.js"; import { createQdrantWrapper, deleteAndRecreateCollection, upsertChunksToQdrant } from "../lib/qdrant-store.js"; import { getLangfuseClient, createTrace } from "../services/observability.js"; import type { IndexingJobResult } from "../lib/types.js"; export async function runNotionIndexingJob(): Promise<IndexingJobResult> { const startTime = Date.now(); const errors: string[] = []; let pagesProcessed = 0; let chunksCreated = 0; try { const trace = createTrace("notion-indexing-job"); trace.generation({ name: "indexing-start", input: {}, output: {} }); const notionClient = createNotionClient(config.NOTION_TOKEN); const chunks = await indexNotionWorkspace(notionClient, config.NOTION_DATABASE_ID); const pageIds = new Set(chunks.map((c) => c.documentId)); pagesProcessed = pageIds.size; chunksCreated = chunks.length; for (const chunk of chunks) { if (chunk.metadata.error) { errors.push(`Page ${chunk.documentId}: ${chunk.metadata.error as string}`); } } if (chunks.length > 0) { const wrapper = await createQdrantWrapper(config); await deleteAndRecreateCollection(wrapper); const texts = chunks.map((c) => c.content); const embeddings = await generateChunkEmbeddings(texts, 32); await upsertChunksToQdrant(wrapper, chunks, embeddings); } const durationMs = Date.now() - startTime; trace.generation({ name: "indexing-complete", input: {}, output: { pagesProcessed, chunksCreated, errors, durationMs }, }); const lfClient = getLangfuseClient(); if (lfClient) { console.log(`[indexing] Completed: ${String(pagesProcessed)} pages, ${String(chunksCreated)} chunks in ${String(durationMs)}ms`); } return { pagesProcessed, chunksCreated, errors, durationMs }; } catch (error) { const durationMs = Date.now() - startTime; const message = error instanceof Error ? error.message : String(error); errors.push(message); return { pagesProcessed, chunksCreated, errors, durationMs }; } }

Perplexity Knowledge Agent for Notion SMB FAQ

The problem

Built from

Intro

Prerequisites

Step 1: Clone the scaffold and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Clone the scaffold and install dependencies

Step 2: Configure your environment

Step 3: Create the Notion indexer

Step 4: Set up the embedding module

Step 5: Create the Qdrant vector store wrapper

Step 6: Set up the session store

Step 7: Create the budget guard

Step 8: Build the Perplexity client

Step 9: Add Zod validation and JSON repair

Step 10: Assemble the answer generator service

Step 11: Create the chat API route

Step 12: Set up observability with Langfuse

Step 13: Wire up the cron indexing job

Step 14: Configure Next.js instrumentation for startup

Step 15: Run the application

Next steps