Google Gemini Security Guardrails for SMB Healthcare PII Redaction

Automatically detect and redact PHI from patient messages before they reach the LLM, helping small clinics stay HIPAA‑compliant.

google-gemini hipaa pii-redaction security-guardrails healthcare-ai nextjs presidio reaatech

The problem

Small healthcare providers using AI chatbots for patient intake risk exposing protected health information (PHI) to LLM APIs—a HIPAA violation that can result in severe fines and loss of trust.

Built from

Intro

Small healthcare clinics and SMBs are using AI chatbots for patient intake, but every message sent to a Gemini API risks exposing protected health information (PHI) — a direct HIPAA violation. This tutorial walks you through building a guardrail pipeline that detects, redacts, and re-identifies PHI across multi-turn conversations. You’ll wire together PII detection from @presidio-dev/hai-guardrails, deterministic redaction via @reaatech/guardrail-chain, a tool-use firewall from @reaatech/tool-use-firewall-core, session persistence from @reaatech/session-continuity, and token-level cost tracking from @reaatech/llm-cost-telemetry — all inside a single Next.js App Router endpoint at POST /api/chat.

Prerequisites

Node.js 22+ and pnpm 10 installed globally
A Google Gemini API key — get one free at Google AI Studio
Basic familiarity with Next.js App Router route handlers and TypeScript

Step 1: Scaffold the Next.js project and install dependencies

Create a Next.js App Router project and install every dependency at exact pinned versions.

terminal

npx create-next-app@latest google-gemini-pii-guardrails --typescript --app --use-pnpm --no-tailwind

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

160 kB·72 tests·99.6% coverage·vitest passing

SHA-2567708aba1b117c428bc7d8a12de5dfaef00aa2a87d867abf8c9cad8c9fe9e925f

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js 22+ and pnpm 10 installed globally
A Google Gemini API key — get one free at Google AI Studio
Basic familiarity with Next.js App Router route handlers and TypeScript

Step 1: Scaffold the Next.js project and install dependencies

Create a Next.js App Router project and install every dependency at exact pinned versions.

terminal

npx create-next-app@latest google-gemini-pii-guardrails --typescript --app --use-pnpm --no-tailwind

// src/services/entity-mapper.ts import type { PIIEntity, EntityMap } from '../types.js'; export class EntityMapper { buildEntityMap(entities: PIIEntity[], existingMap?: EntityMap): EntityMap { const map: Map<string, { original: string; redacted: string }> = new Map(existingMap); if (entities.length === 0) { return map; } let nextIndex = map.size + 1; for (const entity of entities) { let found = false; for (const [, entry] of map) { if (entry.original === entity.originalText) { found = true; break; } } if (!found) { const token = `[PHI-${String(nextIndex)}]`; map.set(token, { original: entity.originalText, redacted: token }); nextIndex++; } } return map; } applyRedaction(text: string, entities: PIIEntity[]): string { if (entities.length === 0) { return text; } const sorted = [...entities].sort((a, b) => { const lenDiff = b.originalText.length - a.originalText.length; if (lenDiff !== 0) return lenDiff; return a.startIndex - b.startIndex; }); const covered = new Set<number>(); const filtered: PIIEntity[] = []; for (const entity of sorted) { let isCovered = false; for (let i = entity.startIndex; i < entity.endIndex; i++) { if (covered.has(i)) { isCovered = true; break; } } if (!isCovered) { for (let i = entity.startIndex; i < entity.endIndex; i++) { covered.add(i); } filtered.push(entity); } } const rightToLeft = [...filtered].sort((a, b) => b.startIndex - a.startIndex); let result = text; for (const entity of rightToLeft) { const before = result.slice(0, entity.startIndex); const after = result.slice(entity.endIndex); result = before + entity.redactedToken + after; } return result; } reconstructOriginal(text: string, entityMap: EntityMap): string { if (entityMap.size === 0) { return text; } let result = text; for (const [token, entry] of entityMap) { if (token.startsWith('[PHI-') && token.endsWith(']')) { result = result.split(token).join(entry.original); } } return result; } scanForUnredactedTerms(text: string, entityMap: EntityMap): string[] { const leaked: string[] = []; for (const [, entry] of entityMap) { if (text.includes(entry.original)) { leaked.push(entry.original); } } return leaked; } } export const entityMapper = new EntityMapper();

// src/services/cost-tracker.ts import { generateId, now, calculateCostFromTokens, loadConfig, type CostSpan, CostSpanSchema } from '@reaatech/llm-cost-telemetry'; const INPUT_PRICE_PER_MILLION = 0.15; const OUTPUT_PRICE_PER_MILLION = 0.60; class CostTracker { private config: ReturnType<typeof loadConfig>; private spans: Map<string, CostSpan[]>; constructor() { this.config = loadConfig(); this.spans = new Map<string, CostSpan[]>(); } reset(): void { this.spans = new Map<string, CostSpan[]>(); } recordCall(params: { provider: string; model: string; inputTokens: number; outputTokens: number; sessionId: string }): CostSpan { const span: CostSpan = { id: generateId(), provider: params.provider as CostSpan['provider'], model: params.model, inputTokens: params.inputTokens, outputTokens: params.outputTokens, costUsd: calculateCostFromTokens(params.inputTokens, INPUT_PRICE_PER_MILLION) + calculateCostFromTokens(params.outputTokens, OUTPUT_PRICE_PER_MILLION), timestamp: now(), tenant: params.sessionId, }; const validated = CostSpanSchema.parse(span); const existing = this.spans.get(params.sessionId) ?? []; existing.push(validated); this.spans.set(params.sessionId, existing); return validated; } getSessionCost(sessionId: string): Promise<number> { const sessionSpans = this.spans.get(sessionId); if (!sessionSpans || sessionSpans.length === 0) { return Promise.resolve(0); } return Promise.resolve(sessionSpans.reduce((sum, span) => sum + span.costUsd, 0)); } async checkBudget(sessionId: string): Promise<{ withinBudget: boolean; spent: number; limit: number }> { const spent = await this.getSessionCost(sessionId); const configAny = this.config as { budget?: { global?: { daily?: number } } }; const limit = configAny.budget?.global?.daily ?? Infinity; return { withinBudget: spent <= limit, spent, limit }; } } export const costTracker = new CostTracker();

// app/api/chat/route.ts import { type NextRequest, NextResponse } from 'next/server'; import { ChatRequestSchema } from '../../../src/types.js'; import { piiDetector } from '../../../src/services/pii-detector.js'; import { sessionService } from '../../../src/services/session-service.js'; import { entityMapper } from '../../../src/services/entity-mapper.js'; import { guardrailChainService } from '../../../src/services/guardrail-chain-service.js'; import { geminiClient } from '../../../src/services/llm-client.js'; import { firewall } from '../../../src/services/firewall-service.js'; import { costTracker } from '../../../src/services/cost-tracker.js'; export async function POST(req: NextRequest): Promise<NextResponse> { try { let body: unknown; try { body = await req.json(); } catch { return NextResponse.json({ error: 'Malformed JSON' }, { status: 400 }); } let parsed: { message: string; sessionId?: string; userId?: string }; try { parsed = ChatRequestSchema.parse(body); } catch { return NextResponse.json({ error: 'Invalid request body' }, { status: 400 }); } const { message, sessionId: sessionIdParam, userId } = parsed; const entities = await piiDetector.detect(message); const { sessionId, entityMap } = await sessionService.getOrCreateSession(sessionIdParam, userId); const mergedMap = entityMapper.buildEntityMap(entities, entityMap); const redacted = await guardrailChainService.redact(message); const llmResult = await geminiClient.generate(redacted.output); const firewallResult = await firewall.inspectToolCall('generate', { response: llmResult.text }, mergedMap); if (!firewallResult.allowed) { return NextResponse.json({ error: 'Blocked: tool call references PHI' }, { status: 403 }); } const replyText = entityMapper.reconstructOriginal(llmResult.text, mergedMap); const span = costTracker.recordCall({ provider: 'google', model: 'gemini-2.5-flash', inputTokens: llmResult.inputTokens, outputTokens: llmResult.outputTokens, sessionId, }); await sessionService.storeEntityMap(sessionId, mergedMap); await sessionService.addMessages(sessionId, message, replyText); return NextResponse.json({ reply: replyText, sessionId, redacted: entities.length > 0, costUsd: span.costUsd, }); } catch { return NextResponse.json({ error: 'Internal server error' }, { status: 500 }); } }

// tests/services/entity-mapper.test.ts import { describe, it, expect } from 'vitest'; import { entityMapper } from '../../src/services/entity-mapper.js'; import type { PIIEntity, EntityMap } from '../../src/types.js'; describe('entityMapper', () => { it('buildEntityMap with empty entities returns empty map', () => { const result = entityMapper.buildEntityMap([]); expect(result.size).toBe(0); }); it('buildEntityMap skips duplicate entities already in existingMap', () => { const existing: EntityMap = new Map([['[PHI-1]', { original: 'john@test.com', redacted: '[PHI-1]' }]]); const result = entityMapper.buildEntityMap( [{ type: 'EMAIL', originalText: 'john@test.com', redactedToken: '[MASKED]', startIndex: 0, endIndex: 14 }], existing, ); expect(result.size).toBe(1); expect(result.get('[PHI-1]')?.original).toBe('john@test.com'); }); it('applyRedaction replaces phone number with [PHI-1]', () => { const entities: PIIEntity[] = [ { type: 'PHONE', originalText: '555-1234', redactedToken: '[PHI-1]', startIndex: 13, endIndex: 21 }, ]; const result = entityMapper.applyRedaction('Call John at 555-1234', entities); expect(result).toBe('Call John at [PHI-1]'); }); it('reconstructOriginal restores original text from tokens', () => { const map: EntityMap = new Map([['[PHI-1]', { original: '555-1234', redacted: '[PHI-1]' }]]); const result = entityMapper.reconstructOriginal('Call John at [PHI-1]', map); expect(result).toBe('Call John at 555-1234'); }); it('overlapping entities sorted by length, longest applied first', () => { const entities: PIIEntity[] = [ { type: 'PERSON', originalText: 'John Doe', redactedToken: '[PHI-1]', startIndex: 3, endIndex: 11 }, { type: 'LAST_NAME', originalText: 'Doe', redactedToken: '[PHI-2]', startIndex: 8, endIndex: 11 }, ]; const result = entityMapper.applyRedaction('Hi John Doe!', entities); expect(result).toBe('Hi [PHI-1]!'); expect(result).not.toContain('[PHI-2]'); }); it('scanForUnredactedTerms finds leaked terms', () => { const map: EntityMap = new Map([['[PHI-1]', { original: '555-1234', redacted: '[PHI-1]' }]]); const leaked = entityMapper.scanForUnredactedTerms('Call John at 555-1234', map); expect(leaked).toEqual(['555-1234']); }); });

Google Gemini Security Guardrails for SMB Healthcare PII Redaction

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Step 2: Configure environment variables

Step 3: Write shared types with Zod

Step 4: Create the PII detector

Step 5: Build the guardrail chain for PHI redaction

Step 6: Implement the entity mapper

Step 7: Build the session manager

Step 8: Wire the Gemini LLM client

Step 9: Implement the tool-use firewall

Step 10: Set up cost tracking

Step 11: Wire the API route

Step 12: Write the test suite

Entity mapper tests (pure logic, no mocking needed)

Route handler integration test (all services mocked)

Step 13: Create barrel exports and run quality checks

Next steps