Mistral AI Invoice Reconciliation for Stripe

Automatically extracts structured data from PDF invoices and reconciles them against Stripe transactions, flagging discrepancies for your finance team.

typescript express mistral stripe invoice-reconciliation document-pipeline pdf-extraction

The problem

Accountants manually match dozens of emailed PDF invoices to Stripe payment records, leading to errors and hours of tedious verification.

Built from

Intro

In this tutorial you’ll build an Express API that accepts PDF invoices, extracts structured data using Mistral AI, matches the extracted totals against Stripe payment intents, and emails a discrepancy report to your finance team. By the end you’ll understand how the pipeline chains together PDF text extraction, document classification via a confidence router, structured JSON extraction, Stripe reconciliation, and budget enforcement per file.

Prerequisites

Node.js >= 22
pnpm 10.x
Mistral AI API key
Stripe secret key
SMTP email account (for sending reports)

Step 1: Scaffold the project

Create the directory structure and initialize a TypeScript Node.js project.

terminal

mkdir mistral-ai-invoice-reconciliation
cd mistral-ai-invoice-reconciliation
pnpm init

Set the project to use ES modules and pin the package manager version.

terminal

pnpm pkg set type="module"
pnpm pkg set packageManager="pnpm@10.0.0"

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

75 tests·97.5% coverage·vitest passing

Book a conversation All solutions

Comments

Loading comments…

import { z } from "zod"; export const InvoiceLineItemSchema = z.object({ description: z.string(), quantity: z.number(), unitPrice: z.number(), totalPrice: z.number(), }); export const InvoiceSchema = z.object({ invoiceNumber: z.string(), date: z.string(), dueDate: z.string().optional(), vendor: z.string(), customer: z.string().optional(), lineItems: z.array(InvoiceLineItemSchema), subtotal: z.number(), tax: z.number().optional(), total: z.number(), currency: z.string(), }); export const ReceiptSchema = z.object({ date: z.string(), vendor: z.string(), total: z.number(), currency: z.string(), items: z.array(z.object({ description: z.string(), amount: z.number() })).optional(), }); export const CreditNoteSchema = z.object({ invoiceNumber: z.string(), date: z.string(), vendor: z.string(), total: z.number(), currency: z.string(), reason: z.string().optional(), }); export const ExtractionResultSchema = z.discriminatedUnion("documentType", [ z.object({ documentType: z.literal("invoice"), data: InvoiceSchema }), z.object({ documentType: z.literal("receipt"), data: ReceiptSchema }), z.object({ documentType: z.literal("credit_note"), data: CreditNoteSchema }), ]); export const DiscrepancyReportSchema = z.object({ invoiceNumber: z.string(), invoiceDate: z.string(), invoiceTotal: z.number(), stripeTotal: z.number().nullable(), matchedTransactionId: z.string().nullable(), differences: z.array(z.object({ field: z.string(), invoiceValue: z.string(), stripeValue: z.string() })), status: z.enum(["matched", "mismatch", "unmatched", "error"]), }); export type InvoiceLineItem = z.infer<typeof InvoiceLineItemSchema>; export type Invoice = z.infer<typeof InvoiceSchema>; export type Receipt = z.infer<typeof ReceiptSchema>; export type CreditNote = z.infer<typeof CreditNoteSchema>; export type ExtractionResult = z.infer<typeof ExtractionResultSchema>; export type DiscrepancyReport = z.infer<typeof DiscrepancyReportSchema>; export type ClassificationLabel = "invoice" | "receipt" | "credit_note" | "unknown"; export interface RoutingDecision { type: "ROUTE" | "CLARIFY" | "FALLBACK"; target?: string | undefined; prompt?: string | undefined; }

import { Mistral } from "@mistralai/mistralai"; import { jsonrepair } from "jsonrepair"; import { config } from "./config.js"; import { checkBudget, recordSpend } from "./budget.js"; import { InvoiceSchema, ReceiptSchema, CreditNoteSchema } from "./types.js"; import type { ExtractionResult } from "./types.js"; const mistral = new Mistral({ apiKey: config.MISTRAL_API_KEY }); export class ExtractionError extends Error { constructor(message: string, public readonly rawOutput: string) { super(message); this.name = "ExtractionError"; } } function buildPrompt(documentType: string, text: string): string { const system = `You are a document extraction assistant. Extract structured data from the provided text and output ONLY valid JSON matching the schema for a ${documentType}.`; return `${system}\n\nText:\n${text}`; } function parseAndValidate(documentType: string, rawText: string): ExtractionResult { let repaired: string; try { repaired = jsonrepair(rawText); } catch { throw new ExtractionError("Failed to repair JSON", rawText); } let parsed: unknown; try { parsed = JSON.parse(repaired); } catch { throw new ExtractionError("Failed to parse repaired JSON", rawText); } try { if (documentType === "invoice") { const data = InvoiceSchema.parse(parsed); return { documentType: "invoice", data }; } if (documentType === "receipt") { const data = ReceiptSchema.parse(parsed); return { documentType: "receipt", data }; } if (documentType === "credit_note") { const data = CreditNoteSchema.parse(parsed); return { documentType: "credit_note", data }; } } catch { throw new ExtractionError("Schema validation failed", rawText); } throw new ExtractionError(`Unknown document type: ${documentType}`, rawText); } export async function extractFromText( documentType: string, text: string, budgetScope: string ): Promise<ExtractionResult> { const estimatedTokens = Math.ceil(text.length / 4) + 500; const estimatedCost = (estimatedTokens / 1000) * 0.003 + (2048 / 1000) * 0.009; await checkBudget(budgetScope, estimatedCost, config.MISTRAL_MODEL); const response = await mistral.chat.complete({ model: config.MISTRAL_MODEL, messages: [{ role: "user", content: buildPrompt(documentType, text) }], maxTokens: 2048, }); const rawContent = response.choices[0]?.message?.content ?? ""; const contentString = typeof rawContent === "string" ? rawContent : ""; const result = parseAndValidate(documentType, contentString); const inputTokens = estimatedTokens; const outputTokens = contentString.length / 4; const actualCost = (inputTokens / 1000) * 0.003 + (outputTokens / 1000) * 0.009; recordSpend( `req-${budgetScope}`, budgetScope, actualCost, inputTokens, Math.ceil(outputTokens), config.MISTRAL_MODEL ); return result; }

import Stripe from "stripe"; import { config } from "./config.js"; import { DiscrepancyReportSchema } from "./types.js"; import type { Invoice, DiscrepancyReport } from "./types.js"; const stripe = new Stripe(config.STRIPE_SECRET_KEY, { apiVersion: "2026-04-22.dahlia" }); export async function searchPayments(invoice: Invoice): Promise<Stripe.PaymentIntent[]> { const minAmount = Math.round(invoice.total * 0.95 * 100); const maxAmount = Math.round(invoice.total * 1.05 * 100); const invoiceDate = new Date(invoice.date); const start = Math.floor(new Date(invoiceDate.getTime() - 2 * 86400000).getTime() / 1000); const end = Math.floor(new Date(invoiceDate.getTime() + 2 * 86400000).getTime() / 1000); try { const result = await stripe.paymentIntents.search({ query: `amount>=${minAmount} AND amount<=${maxAmount} AND created>=${start} AND created<=${end}`, }); return result.data; } catch { return []; } } export function reconcile(invoice: Invoice, payments: Stripe.PaymentIntent[]): DiscrepancyReport { try { if (payments.length === 0) { return DiscrepancyReportSchema.parse({ invoiceNumber: invoice.invoiceNumber, invoiceDate: invoice.date, invoiceTotal: invoice.total, stripeTotal: null, matchedTransactionId: null, differences: [], status: "unmatched", }); } const closest = payments.reduce((best, p) => { const bestDiff = Math.abs(best.amount - invoice.total * 100); const currDiff = Math.abs(p.amount - invoice.total * 100); return currDiff < bestDiff ? p : best; }); const differences: Array<{ field: string; invoiceValue: string; stripeValue: string }> = []; const stripeTotal = closest.amount / 100; if (Math.abs(invoice.total - stripeTotal) > 0.01) { differences.push({ field: "total", invoiceValue: invoice.total.toFixed(2), stripeValue: stripeTotal.toFixed(2), }); } const status = differences.length === 0 ? "matched" : "mismatch"; return DiscrepancyReportSchema.parse({ invoiceNumber: invoice.invoiceNumber, invoiceDate: invoice.date, invoiceTotal: invoice.total, stripeTotal, matchedTransactionId: closest.id, differences, status, }); } catch { return DiscrepancyReportSchema.parse({ invoiceNumber: invoice.invoiceNumber, invoiceDate: invoice.date, invoiceTotal: invoice.total, stripeTotal: null, matchedTransactionId: null, differences: [], status: "error", }); } }

import { Router } from "express"; import multer from "multer"; import { config } from "../lib/config.js"; import { extractTextFromPdf } from "../lib/pdf-processor.js"; import { classifyDocument } from "../classifier.js"; import { extractFromText, ExtractionError } from "../lib/mistral.js"; import { searchPayments, reconcile } from "../lib/stripe-reconciliation.js"; import { sendDiscrepancyReport } from "../lib/email.js"; import { defineFileBudget } from "../lib/budget.js"; const upload = multer({ storage: multer.memoryStorage(), limits: { fileSize: config.UPLOAD_MAX_SIZE, files: 1 }, }); export const uploadRouter = Router(); uploadRouter.post("/", upload.single("file"), async (req, res) => { const start = Date.now(); try { if (!req.file) { res.status(400).json({ success: false, error: "Missing file" }); return; } const allowedTypes = ["application/pdf", "image/png", "image/jpeg"]; if (!allowedTypes.includes(req.file.mimetype)) { res.status(400).json({ success: false, error: "Unsupported file type" }); return; } const uploadId = `upload-${Date.now()}-${Math.random().toString(36).slice(2)}`; defineFileBudget(uploadId); const { text } = await extractTextFromPdf(req.file.buffer); if (text.length < 50) { res.status(422).json({ success: false, error: "No readable text could be extracted" }); return; } const decision = await classifyDocument(text); if (decision.type === "CLARIFY") { res.status(422).json({ success: false, error: decision.prompt }); return; } if (decision.type === "FALLBACK") { res.status(422).json({ success: false, error: "Unable to classify document" }); return; } const target = decision.target ?? "unknown"; const extraction = await extractFromText(target, text, uploadId); let reconciliation = null; if (target === "invoice" && extraction.documentType === "invoice") { const payments = await searchPayments(extraction.data); reconciliation = reconcile(extraction.data, payments); const body = req.body as { email?: string }; const recipient = typeof body.email === "string" ? body.email : config.EMAIL_FROM; await sendDiscrepancyReport(reconciliation, recipient); } const duration = Date.now() - start; console.log(`Processed ${uploadId} in ${String(duration)}ms`); res.json({ success: true, data: { classification: target, extraction, reconciliation, }, }); } catch (err) { console.error("Upload error:", err); if (err instanceof ExtractionError) { res.status(500).json({ success: false, error: "Extraction failed", details: err.message }); return; } res.status(500).json({ success: false, error: "Internal server error" }); } });

Mistral AI Invoice Reconciliation for Stripe

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project

Step 2: Install dependencies

Step 3: Configure TypeScript

Step 4: Configure vitest

Step 5: Set up environment variables

Step 6: Write the config module

Step 7: Write the Zod schemas

Step 8: Write the budget engine

Step 9: Write the Mistral pricing provider

Step 10: Write the PDF text extractor

Step 11: Write the document classifier

Step 12: Write the Mistral extraction module

Step 13: Write the Stripe reconciliation module

Step 14: Write the email module

Step 15: Write the upload route

Step 16: Write the Express server

Step 17: Write the test setup

Step 18: Write the upload integration tests

Step 19: Add npm scripts

Step 20: Run the tests

Step 21: Start the server

Step 22: Upload an invoice

Next steps