Cohere Knowledge Agent for PostgreSQL-Backed Financial Analytics

A conversational knowledge agent that answers financial questions by querying a PostgreSQL-backed knowledge base with semantic caching.

cohere knowledge-agent financial-analytics postgresql semantic-caching express rag nextjs

The problem

SMB finance teams waste hours querying SQL databases manually for transaction details, cash flow patterns, and expense trends. Existing BI tools require technical expertise, leaving non-technical staff reliant on ad-hoc data pulls.

Built from

Intro

This tutorial builds a conversational knowledge agent powered by Cohere that answers financial questions by querying a PostgreSQL-backed knowledge base with semantic caching. The agent understands natural language questions about transactions, expenses, and cash flow — letting small business owners ask “How much did I spend on office supplies last month?” and get an answer from their financial data without writing SQL.

You’ll build it inside a Next.js project shell, with an Express API server backed by pgvector for vector search, fastembed for embeddings, and several REAA packages that handle multi-turn memory, confidence-based routing, semantic caching, and structured output repair. A simple Next.js chat UI at app/page.tsx ships with the scaffold so you can try the API right away.

Prerequisites

Node.js 22+ and pnpm 10+
A Cohere API key (from the Cohere dashboard)
A PostgreSQL instance with the pgvector extension enabled
Langfuse account (optional, for observability)
Familiarity with TypeScript and Express middleware patterns

Step 1: Scaffold the project and install dependencies

The project starts from a Next.js 16 scaffold with the App Router. Create the project structure and install all dependencies:

terminal

mkdir cohere-financial-agent && cd cohere-financial-agent
 
# Initialize package.json
pnpm init

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

185 kB·88 tests·98.6% coverage·vitest passing

SHA-2561577d2790a7e5121894db598ad33515aeff6eb5cbdf36c7a8d02d6ded7f9ec45

Book a conversation All solutions

Comments

Loading comments…

import pg from "pg"; import pgvector from "pgvector/pg"; import { drizzle } from "drizzle-orm/node-postgres"; import { pgTable, serial, text, decimal, timestamp, vector } from "drizzle-orm/pg-core"; import { l2Distance, eq } from "drizzle-orm"; import { withRetry } from "@reaatech/agent-memory-core"; import { config } from "../lib/config.js"; export const financialTransactions = pgTable("financial_transactions", { id: serial("id").primaryKey(), description: text("description"), amount: decimal("amount"), category: text("category"), date: timestamp("date"), embedding: vector("embedding", { dimensions: 384 }), }); export const conversationHistory = pgTable("conversation_history", { id: serial("id").primaryKey(), sessionId: text("session_id").notNull(), role: text("role").notNull(), content: text("content").notNull(), createdAt: timestamp("created_at").defaultNow(), }); const pool = new pg.Pool({ connectionString: config.DATABASE_URL, onConnect(client) { void pgvector.registerTypes(client); }, }); export const db = drizzle(pool); export async function searchSimilarTransactions( embedding: number[], limit?: number, ) { return withRetry(() => db .select() .from(financialTransactions) .orderBy(l2Distance(financialTransactions.embedding, embedding)) .limit(limit ?? 5), ); } export async function storeConversation( sessionId: string, role: string, content: string, ) { await withRetry(() => db.insert(conversationHistory).values({ sessionId, role, content }), ); } export async function getConversationHistory( sessionId: string, limit?: number, ) { return withRetry(() => db .select() .from(conversationHistory) .where(eq(conversationHistory.sessionId, sessionId)) .orderBy(conversationHistory.createdAt) .limit(limit ?? 20), ); } export async function insertTransaction(record: { description?: string; amount?: string; category?: string; date?: Date; embedding?: number[]; }) { await withRetry(() => db.insert(financialTransactions).values(record)); } export { pool };

import { Memory, MemoryType, MemoryImportance, MemorySource, MemoryLifecycle, setLogger, getLogger, type ConversationTurn, } from "@reaatech/agent-memory-core"; import { storeConversation, getConversationHistory, } from "../services/database.js"; export { setLogger, getLogger }; export async function getSessionContext( sessionId: string, ): Promise<ConversationTurn[]> { const rows = await getConversationHistory(sessionId); return rows.map((row) => ({ speaker: row.role === "user" ? ("user" as const) : ("agent" as const), content: row.content, timestamp: row.createdAt ?? new Date(), })); } export async function addTurn( sessionId: string, role: string, content: string, ): Promise<void> { await storeConversation(sessionId, role, content); const memory: Memory = { id: crypto.randomUUID(), tenantId: "default", ownerId: sessionId, content, type: MemoryType.EPISODIC, category: "conversation", source: role === "user" ? MemorySource.USER_STATEMENT : MemorySource.AGENT_INFERENCE, importance: MemoryImportance.MEDIUM, confidence: 1, tags: ["conversation", role], lifecycle: MemoryLifecycle.ACTIVE, createdAt: new Date(), updatedAt: new Date(), lastAccessedAt: new Date(), embeddings: { vector: [], model: "BGEBaseEN", dimensions: 384, }, version: 1, history: [], }; getLogger().info(`Memory turn added for session ${sessionId}`, { role, memoryId: memory.id, }); } export async function clearSession(sessionId: string): Promise<void> { const logger = getLogger(); logger.info(`Clearing session ${sessionId}`); const turns = await getConversationHistory(sessionId); for (const turn of turns) { const memory: Memory = { id: crypto.randomUUID(), tenantId: "default", ownerId: sessionId, content: turn.content, type: MemoryType.EPISODIC, category: "conversation", source: turn.role === "user" ? MemorySource.USER_STATEMENT : MemorySource.AGENT_INFERENCE, importance: MemoryImportance.LOW, confidence: 1, tags: ["conversation", turn.role], lifecycle: MemoryLifecycle.FORGOTTEN, createdAt: turn.createdAt ?? new Date(), updatedAt: new Date(), lastAccessedAt: new Date(), embeddings: { vector: [], model: "BGEBaseEN", dimensions: 384, }, version: 1, history: [], }; logger.info(`Memory marked as forgotten for session ${sessionId}`, { memoryId: memory.id, }); } }

import { CacheEngine, InMemoryAdapter, buildPromptHash, buildCacheFingerprint, buildExactMatchKey, type CacheResult, type CacheEntry, type InvalidateResult, } from "@reaatech/llm-cache"; import { embedBatch } from "./embedding.js"; import { config } from "../lib/config.js"; class FastembedEmbedder { async embed(text: string): Promise<number[]> { const results = await embedBatch([text]); return results[0]; } async embedBatch(texts: string[]): Promise<number[][]> { return embedBatch(texts); } } const cache = new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder: new FastembedEmbedder(), config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "BGEBaseEN", dimensions: 384, batchSize: 50, maxRetries: 3, }, similarity: { threshold: config.SIMILARITY_THRESHOLD, metric: "cosine" as const, maxResults: 10, }, ttl: { default: config.CACHE_TTL_DEFAULT, factual: config.CACHE_TTL_DEFAULT, creative: config.CACHE_TTL_DEFAULT, analytical: config.CACHE_TTL_DEFAULT, sensitive: config.CACHE_TTL_DEFAULT, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "general" }, cost: { enabled: false, currency: "USD" }, observability: { metrics: false, tracing: false, logging: "info" as const }, }, }); const cacheOptions = { useCase: "financial-analytics" as const, model: "command-a-03-2025" as const, modelVersion: "command-a-03-2025" as const, }; export async function cacheGet(query: string): Promise<CacheResult> { return cache.get(query, cacheOptions); } export async function cacheSet( query: string, response: unknown, ): Promise<CacheEntry> { return cache.set(query, response, cacheOptions); } export async function cacheInvalidate( olderThan?: Date, ): Promise<InvalidateResult> { return cache.invalidate({ olderThan }); } export { buildPromptHash, buildCacheFingerprint, buildExactMatchKey };

import { CohereClientV2, CohereError, CohereTimeoutError } from "cohere-ai"; const cohere = new CohereClientV2({}); function extractContent( content: string | { type?: string; text?: string }[] | null | undefined, ): string { if (typeof content === "string") return content; if (Array.isArray(content) && content.length > 0) { const first = content[0]; return first.text ?? ""; } return ""; } export async function generateAnswer( context: string, question: string, ): Promise<string> { try { const response = await cohere.chat({ model: "command-a-03-2025", messages: [ { role: "system", content: `You are a financial analytics assistant. Use the following context to answer the user's question accurately. Context: ${context}`, }, { role: "user", content: question, }, ], }); const message = response.message; const msgContent = message.content; return extractContent(msgContent); } catch (cause) { if (cause instanceof CohereTimeoutError) { throw new Error(`Cohere request timed out: ${cause.message}`); } if (cause instanceof CohereError) { throw new Error( `Cohere API error (status ${String(cause.statusCode ?? "unknown")}): ${cause.message}`, ); } throw cause; } } export async function* generateAnswerStream( context: string, question: string, ): AsyncGenerator<string, void, undefined> { let stream; try { stream = await cohere.chatStream({ model: "command-a-03-2025", messages: [ { role: "system", content: `You are a financial analytics assistant. Use the following context to answer the user's question accurately. Context: ${context}`, }, { role: "user", content: question, }, ], }); } catch (cause) { if (cause instanceof CohereTimeoutError) { throw new Error(`Cohere request timed out: ${cause.message}`); } if (cause instanceof CohereError) { throw new Error( `Cohere API error (status ${String(cause.statusCode ?? "unknown")}): ${cause.message}`, ); } throw cause; } for await (const chatEvent of stream) { if (chatEvent.type === "content-delta") { const delta = chatEvent.delta?.message; if (typeof delta === "string") { yield delta; } } } }

import { Langfuse } from "langfuse"; import { config } from "../lib/config.js"; let langfuse: Langfuse | null = null; function getLangfuse(): Langfuse | null { if (langfuse !== null) return langfuse; if (config.LANGFUSE_PUBLIC_KEY && config.LANGFUSE_SECRET_KEY) { langfuse = new Langfuse({ publicKey: config.LANGFUSE_PUBLIC_KEY, secretKey: config.LANGFUSE_SECRET_KEY, baseUrl: config.LANGFUSE_HOST, }); } else { langfuse = null; } return langfuse; } export async function traceGeneration(params: { sessionId: string; query: string; response: string; model?: string; tokens?: number; }): Promise<void> { const lf = getLangfuse(); if (!lf) return; const trace = lf.trace({ id: `gen-${params.sessionId}-${String(Date.now())}`, name: "chat-generation", sessionId: params.sessionId, metadata: { model: params.model ?? "command-a-03-2025" }, }); trace.generation({ name: "cohere-chat", model: params.model ?? "command-a-03-2025", input: params.query, output: params.response, usage: params.tokens ? { output: params.tokens } : undefined, }); await lf.flushAsync(); } export function traceEmbedding(params: { sessionId: string; model?: string; tokenCount?: number; }): void { const lf = getLangfuse(); if (!lf) return; const trace = lf.trace({ id: `emb-${params.sessionId}-${String(Date.now())}`, name: "embedding", sessionId: params.sessionId, }); trace.generation({ name: "embedding-generation", model: params.model ?? "BGEBaseEN", usage: { output: params.tokenCount ?? 0 }, }); } export function traceCacheHit(params: { sessionId: string; hitType: string; cachedAt?: Date; }): void { const lf = getLangfuse(); if (!lf) return; const trace = lf.trace({ id: `cache-${params.sessionId}-${String(Date.now())}`, name: "cache-hit", sessionId: params.sessionId, metadata: { hitType: params.hitType, cachedAt: params.cachedAt?.toISOString(), }, }); trace.event({ name: "cache-result", input: { hitType: params.hitType }, output: { cachedAt: params.cachedAt }, }); } export function traceClarification(params: { sessionId: string; query: string; decisionType: string; }): void { const lf = getLangfuse(); if (!lf) return; const trace = lf.trace({ id: `clarify-${params.sessionId}-${String(Date.now())}`, name: "clarification", sessionId: params.sessionId, metadata: { decisionType: params.decisionType }, }); trace.event({ name: "clarification-requested", input: params.query, output: { decisionType: params.decisionType }, }); }

Cohere Knowledge Agent for PostgreSQL-Backed Financial Analytics

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure environment variables

Step 3: Build the configuration module

Step 4: Set up the database schema with pgvector

Step 5: Create the embedding service

Step 6: Add multi-turn conversation memory

Step 7: Implement confidence routing

Step 8: Build the semantic cache

Step 9: Connect the Cohere API

Step 10: Add structured output repair

Step 11: Add Langfuse observability

Step 12: Create the chat API route

Step 13: Wire the Express server entry point

Step 14: Run the tests

Next steps