Perplexity Code Sandbox for SMB Data Reporting

Run AI‑generated analytics code in a safe, budget‑controlled sandbox with real‑time data hooks.

perplexity code-sandbox nextjs daytona budget-enforcement circuit-breaker smb-data-reporting

The problem

Small business analysts need custom reports but can’t write code. Asking an LLM to generate SQL or Python is one step — safely executing it against production data without destroying anything or blowing cloud budgets is the real challenge.

Built from

Intro

This tutorial builds a Perplexity Code Sandbox — a Next.js app that lets small business analysts ask natural-language questions and get back safe, sandboxed Python or SQL code execution. You’ll wire up Perplexity AI for code generation, the Daytona SDK for ephemeral sandboxes, the REAA circuit breaker and budget engine for safety controls, and an LLM cache with semantic similarity matching. By the end, you’ll have a working dashboard where you type “show me last month’s sales by region” and get back generated code, execution results, and a cost breakdown.

Prerequisites

Node.js >= 22
pnpm 10.x installed globally
A Perplexity API key for code generation
A Daytona API key for sandbox provisioning
An OpenAI API key for cache embeddings
A Langfuse account (optional — tracing is a graceful no-op if unset)
Familiarity with TypeScript and Next.js 16 App Router patterns

Step 1: Scaffold the Next.js project and install dependencies

Create the project with Next.js 16 (App Router) and TypeScript:

terminal

npx create-next-app@latest perplexity-code-sandbox --typescript --app --src-dir --no-tailwind --import-alias "@/*"
cd perplexity-code-sandbox

Install all the packages you’ll need. Pin every dependency to an exact version — no or :

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

175 kB·128 tests·94.8% coverage·vitest passing

SHA-256f90f3aa3800474cd2d5837770e5e931787381a736931391c9d747e958257f61a

Book a conversation All solutions

Comments

Loading comments…

import { Daytona, Sandbox, DaytonaTimeoutError, } from "@daytonaio/sdk"; import type { ExecutionResult } from "../types.js"; export class SandboxTimeoutError extends Error { constructor(message: string) { super(message); this.name = "SandboxTimeoutError"; } } export interface SandboxSession { id: string; status: string; } export class DaytonaClient { daytona: Daytona; sandboxMap: Map<string, Sandbox>; readonly maxExecutionDuration: number; constructor(maxExecutionDuration: number = 30) { const apiKey = process.env.DAYTONA_API_KEY; if (!apiKey) { throw new Error( "DAYTONA_API_KEY is not set. Provide an apiKey or set the DAYTONA_API_KEY environment variable.", ); } this.daytona = new Daytona({ apiKey }); this.sandboxMap = new Map<string, Sandbox>(); this.maxExecutionDuration = maxExecutionDuration; } async createSandbox(): Promise<SandboxSession> { const sandbox = await this.daytona.create(); this.sandboxMap.set(sandbox.id, sandbox); return { id: sandbox.id, status: sandbox.state ?? "unknown" }; } async executeCode( session: SandboxSession, code: string, language: "python" | "sql", ): Promise<ExecutionResult> { const sandbox = this.sandboxMap.get(session.id); if (!sandbox) { throw new Error(`Sandbox ${session.id} not found`); } const startTime = Date.now(); try { if (language === "python") { const result = await sandbox.codeInterpreter.runCode(code, { timeout: this.maxExecutionDuration, }); return { stdout: result.stdout, stderr: result.stderr, exitCode: result.error ? 1 : 0, durationMs: Date.now() - startTime, }; } const result = await sandbox.process.codeRun( code, undefined, this.maxExecutionDuration, ); return { stdout: result.artifacts?.stdout ?? result.result, stderr: "", exitCode: result.exitCode, durationMs: Date.now() - startTime, }; } catch (err) { if (err instanceof DaytonaTimeoutError) { throw new SandboxTimeoutError( `Code execution timed out after ${String(this.maxExecutionDuration)}s`, ); } throw err; } } async destroySandbox(session: SandboxSession): Promise<void> { const sandbox = this.sandboxMap.get(session.id); if (!sandbox) { return; } await sandbox.delete(); this.sandboxMap.delete(session.id); } } let defaultClient: DaytonaClient | undefined; export function createDaytonaClient(maxExecutionDuration?: number): DaytonaClient { if (maxExecutionDuration !== undefined) { return new DaytonaClient(maxExecutionDuration); } if (defaultClient === undefined) { defaultClient = new DaytonaClient(); } return defaultClient; }

import { BudgetController } from '@reaatech/agent-budget-engine'; import { SpendStore } from '@reaatech/agent-budget-spend-tracker'; import { BudgetScope, BudgetCheckResult, SpendEntry, BudgetState, } from '@reaatech/agent-budget-types'; export { BudgetScope }; export interface BudgetPolicyInput { softCap?: number; hardCap?: number; } export interface BudgetControllerInstance { defineScope(scopeKey: string, limit?: number, policy?: BudgetPolicyInput): void; checkBudget( scopeKey: string, estimatedCost: number, modelId: string, tools?: string[], ): BudgetCheckResult; recordSpend( scopeKey: string, cost: number, requestId: string, inputTokens: number, outputTokens: number, modelId: string, ): void; getState(scopeKey: string): BudgetState | undefined; listAll(): Array<{ definition: unknown; state: unknown }>; } export function createBudgetController(defaultLimit?: number): BudgetControllerInstance { const store = new SpendStore(); const controller = new BudgetController({ spendTracker: store }); controller.on('threshold-breach', (event) => { console.warn( `[Budget] threshold-breach: ${event.scopeType}:${event.scopeKey} at ${String(event.threshold * 100)}%`, ); }); controller.on('hard-stop', (event) => { console.error( `[Budget] hard-stop: ${event.scopeType}:${event.scopeKey} spent=${String(event.spent)} limit=${String(event.limit)}`, ); }); return { defineScope( scopeKey: string, budgetLimit?: number, policy?: BudgetPolicyInput, ): void { const limit = budgetLimit ?? defaultLimit ?? Number(process.env.BUDGET_DEFAULT_LIMIT ?? "5"); controller.defineBudget({ scopeType: BudgetScope.User, scopeKey, limit, policy: { softCap: policy?.softCap ?? 0.8, hardCap: policy?.hardCap ?? 1.0, autoDowngrade: [], disableTools: [], }, }); }, checkBudget( scopeKey: string, estimatedCost: number, modelId: string, tools?: string[], ): BudgetCheckResult { return controller.check({ scopeType: BudgetScope.User, scopeKey, estimatedCost, modelId, tools: tools ?? [], }); }, recordSpend( scopeKey: string, cost: number, requestId: string, inputTokens: number, outputTokens: number, modelId: string, ): void { const entry: SpendEntry = { requestId, scopeType: BudgetScope.User, scopeKey, cost, inputTokens, outputTokens, modelId, provider: 'perplexity', timestamp: new Date(), }; controller.record(entry); }, getState(scopeKey: string): BudgetState | undefined { return controller.getState(BudgetScope.User, scopeKey); }, listAll(): Array<{ definition: unknown; state: unknown }> { return controller.listAll(); }, }; }

import { CacheEngine, InMemoryAdapter, OpenAIEmbedder, buildPromptHash, } from "@reaatech/llm-cache"; export function createCacheEngine(): CacheEngine { const threshold = Number(process.env.SEMANTIC_CACHE_THRESHOLD ?? "0.8"); const ttl = Number(process.env.CACHE_DEFAULT_TTL ?? "3600"); return new CacheEngine({ storage: new InMemoryAdapter(), vectorStorage: new InMemoryAdapter(), embedder: new OpenAIEmbedder({ provider: "openai", model: "text-embedding-3-small", dimensions: 1536, apiKey: process.env.OPENAI_API_KEY ?? "", }), config: { storage: { adapter: "memory" }, vectorStorage: { adapter: "memory" }, embedding: { provider: "openai", model: "text-embedding-3-small", dimensions: 1536, batchSize: 100, maxRetries: 3, }, similarity: { threshold, metric: "cosine", maxResults: 10, }, ttl: { default: ttl, factual: 1800, creative: 7200, analytical: 3600, sensitive: 600, byUseCase: {}, }, segmentation: { enabled: true, defaultUseCase: "general" }, cost: { enabled: true, currency: "USD" }, observability: { metrics: true, tracing: false, logging: "info" }, }, }); } export const cacheEngine = createCacheEngine(); export { buildPromptHash }; export async function getCached( prompt: string, options?: { useCase?: string; model?: string; modelVersion?: string; }, ) { return cacheEngine.get(prompt, options); } export async function setCached( prompt: string, response: unknown, options?: { useCase?: string; model?: string; modelVersion?: string; }, ) { return cacheEngine.set(prompt, response, options); } export async function invalidateByModel(modelVersion: string) { return cacheEngine.invalidate({ modelVersion }); } export async function health() { return cacheEngine.healthCheck(); }

import { jsonrepair } from "jsonrepair"; export class RepairError extends Error { constructor(message: string) { super(message); this.name = "RepairError"; } } export interface SanitizedCode { code: string; safe: boolean; warnings: string[]; } export function repairCodeBlock(raw: string): string { let code = raw.trim(); const fenceMatch = code.match(/^```\w*\s*\n?([\s\S]*?)```\s*$/); if (fenceMatch) { code = fenceMatch[1].trim(); } return code; } export function repairJson(raw: string): unknown { try { const repaired = jsonrepair(raw); return JSON.parse(repaired); } catch { throw new RepairError("Unrecoverable JSON"); } } const PYTHON_DANGEROUS = [ "os.system", "subprocess.call", "subprocess.Popen", "subprocess.run", "subprocess.check_output", "subprocess.check_call", "eval", "exec", "__import__", "importlib", ]; const SQL_DDL_PATTERNS = [ /\bDROP\s+(TABLE|DATABASE|INDEX|VIEW|SCHEMA|PROCEDURE)\b/i, /\bCREATE\s+(TABLE|INDEX|VIEW|DATABASE|PROCEDURE)\b/i, /\bALTER\s+(TABLE|DATABASE|INDEX)\b/i, /\bTRUNCATE\s+(TABLE)\b/i, /\bINSERT\b/i, /\bUPDATE\b/i, /\bDELETE\b/i, ]; export function validateGeneratedCode( code: string, language: "python" | "sql", ): { valid: boolean; errors: string[]; warnings: string[] } { const errors: string[] = []; const warnings: string[] = []; const trimmed = code.trim(); if (!trimmed) { errors.push("Code is empty"); return { valid: false, errors, warnings }; } if (language === "python") { for (const dangerous of PYTHON_DANGEROUS) { if (code.includes(dangerous)) { errors.push(`Dangerous import: ${dangerous}`); } } if (code.includes("open(")) { warnings.push("File operation detected: open()"); } } if (language === "sql") { const statements = code.split(";").filter((s) => s.trim().length > 0); if (statements.length > 1) { errors.push("Multiple statements not allowed"); } for (const pattern of SQL_DDL_PATTERNS) { const match = code.match(pattern); if (match) { errors.push(`DDL not allowed: ${match[0]}`); } } } return { valid: errors.length === 0, errors, warnings }; }

import { describe, it, expect } from "vitest"; import { repairCodeBlock, repairJson, validateGeneratedCode, RepairError, } from "../../../../src/lib/structured-output/repair"; describe("repairCodeBlock", () => { it("strips python fences", () => { const raw = "```python\nprint('hello')\n```"; expect(repairCodeBlock(raw)).toBe("print('hello')"); }); it("strips sql fences", () => { const raw = "```sql\nSELECT * FROM users;\n```"; expect(repairCodeBlock(raw)).toBe("SELECT * FROM users;"); }); it("strips generic fences", () => { const raw = "```\nplain code\n```"; expect(repairCodeBlock(raw)).toBe("plain code"); }); it("trims leading and trailing whitespace", () => { const raw = " \n const x = 1; \n "; expect(repairCodeBlock(raw)).toBe("const x = 1;"); }); it("returns empty string for empty input", () => { expect(repairCodeBlock("")).toBe(""); }); }); describe("repairJson", () => { it("fixes trailing comma", () => { const raw = '{"key": "value",}'; const result = repairJson(raw); expect(result).toEqual({ key: "value" }); }); it("repairs single-quoted keys", () => { const raw = "{'name': 'Alice'}"; const result = repairJson(raw); expect(result).toEqual({ name: "Alice" }); }); it("throws RepairError on unrecoverable input", () => { expect(() => repairJson("{key unquoted,}")).toThrow(RepairError); }); }); describe("validateGeneratedCode", () => { describe("python", () => { it("rejects os.system", () => { const result = validateGeneratedCode('import os\nos.system("ls")', "python"); expect(result.valid).toBe(false); expect(result.errors).toContain("Dangerous import: os.system"); }); it("passes clean python code", () => { const code = 'print("hello")'; const result = validateGeneratedCode(code, "python"); expect(result.valid).toBe(true); expect(result.errors).toHaveLength(0); }); }); describe("sql", () => { it("rejects DROP TABLE", () => { const code = "DROP TABLE users;"; const result = validateGeneratedCode(code, "sql"); expect(result.valid).toBe(false); expect(result.errors).toContain("DDL not allowed: DROP TABLE"); }); it("passes valid SELECT", () => { const code = "SELECT * FROM users WHERE id = 1;"; const result = validateGeneratedCode(code, "sql"); expect(result.valid).toBe(true); expect(result.errors).toHaveLength(0); }); }); });

Perplexity Code Sandbox for SMB Data Reporting

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the Next.js project and install dependencies

Step 2: Configure environment variables

Step 3: Define shared types with Zod

Step 4: Build the Perplexity model adapter

Step 5: Build the Daytona sandbox adapter

Step 6: Set up the circuit breaker

Step 7: Wire up the budget controller

Step 8: Implement the confidence router

Step 9: Set up the LLM cache engine

Step 10: Build the structured output repair module

Step 11: Add Langfuse observability

Step 12: Wire up the orchestration service

Step 13: Create the API routes

POST /api/generate

GET/POST /api/budgets

GET/DELETE /api/cache

Step 14: Build the dashboard UI

Step 15: Run the tests

Next steps