xAI Grok Secure Code Sandbox for SMB Data Pipelines

Empower SMBs to safely run AI-generated code on their data with cost controls and execution quarantine, preventing runaway bills and destructive operations.

xai-grok code-execution sandbox e2b nextjs express typescript smb data-pipeline budget-guardrails

The problem

Small businesses want to use LLMs to automate Excel transformations, CSV analysis, or generate reports, but blindly executing generated code risks data corruption, infinite loops, and unpredictable cloud costs.

Built from

Intro

This tutorial walks you through building a secure code execution sandbox powered by xAI’s Grok API. You’ll create a Next.js application with an Express companion server that takes natural-language prompts from users, generates Python code via Grok, validates and repairs the structured output, checks it against a security policy (blocked patterns, allowed libraries), enforces per-user budget limits, and executes approved code inside an E2B sandbox. Every step along the way is traced to Langfuse for observability. By the end, you’ll have both an Express REST API and a Next.js App Router API serving the same pipeline, with a full test suite hitting over 90% coverage.

Prerequisites

Node.js >= 22 with pnpm (v10+) installed
An xAI API key — set as XAI_API_KEY (get one from the xAI console)
An E2B API key — set as E2B_API_KEY (sign up at e2b.dev)
(Optional) Langfuse credentials for telemetry — skip this and the pipeline still works
Basic familiarity with TypeScript, Next.js App Router, and Express

Step 1: Create the project scaffold

Create a new Next.js project and install all dependencies. This recipe uses Next.js 16 (App Router), six @reaatech/* vendored packages, the E2B sandbox SDK, the Vercel AI SDK, and xlsx for spreadsheet processing.

terminal

npx create-next-app@latest xai-grok-sandbox --typescript --app --src-dir --no-tailwind --eslint

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

175 kB·117 tests·95.3% coverage·vitest passing

SHA-2563f6a363ce4771a426398339205dec0bd35798bf5aaa04bb35234e7bd12e86574

Book a conversation All solutions

Comments

Loading comments…

Intro

Prerequisites

Node.js >= 22 with pnpm (v10+) installed
An xAI API key — set as XAI_API_KEY (get one from the xAI console)
An E2B API key — set as E2B_API_KEY (sign up at e2b.dev)
(Optional) Langfuse credentials for telemetry — skip this and the pipeline still works
Basic familiarity with TypeScript, Next.js App Router, and Express

Step 1: Create the project scaffold

terminal

npx create-next-app@latest xai-grok-sandbox --typescript --app --src-dir --no-tailwind --eslint

// src/lib/budget.ts import { BudgetController } from "@reaatech/agent-budget-engine"; import { SpendStore } from "@reaatech/agent-budget-spend-tracker"; import { PricingEngine } from "@reaatech/agent-budget-pricing"; import { BudgetScope } from "@reaatech/agent-budget-types"; import type { BudgetState } from "@reaatech/agent-budget-types"; import { TelemetryService } from "./telemetry"; export class BudgetService { private controller: BudgetController; private pricing: PricingEngine; private telemetry?: TelemetryService; constructor(telemetry?: TelemetryService) { const store = new SpendStore({ maxEntries: 100_000 }); this.pricing = new PricingEngine(); this.controller = new BudgetController({ spendTracker: store, pricing: this.pricing }); this.telemetry = telemetry; this.controller.on("threshold-breach", (event: unknown) => { this.telemetry?.traceEvent({ name: "budget_threshold_breach", metadata: { event } }); }); this.controller.on("hard-stop", (event: unknown) => { this.telemetry?.traceEvent({ name: "budget_hard_stop", metadata: { event } }); }); } defineUserBudget(userId: string, limit: number): Promise<void> { this.controller.defineBudget({ scopeType: BudgetScope.User, scopeKey: userId, limit, policy: { softCap: 0.8, hardCap: 1.0 }, }); return Promise.resolve(); } checkBudget( userId: string, modelId: string, estimatedTokens: number, ): Promise<{ allowed: boolean; action: string; suggestedModel?: string }> { const estimatedCost = this.pricing.estimateCost(modelId, estimatedTokens); return Promise.resolve(this.controller.check({ scopeType: BudgetScope.User, scopeKey: userId, estimatedCost, modelId, tools: [], })); } recordSpend( userId: string, cost: number, inputTokens: number, outputTokens: number, modelId: string, ): Promise<void> { this.controller.record({ requestId: crypto.randomUUID(), scopeType: BudgetScope.User, scopeKey: userId, cost, inputTokens, outputTokens, modelId, provider: "xai", timestamp: new Date(), }); return Promise.resolve(); } getUserState(userId: string): Promise<BudgetState> { const state = this.controller.getState(BudgetScope.User, userId); if (!state) { return Promise.reject(new Error(`No budget state found for user ${userId}`)); } return Promise.resolve(state); } resetBudget(userId: string): Promise<void> { this.controller.reset(BudgetScope.User, userId); return Promise.resolve(); } }

// app/api/code/route.ts import { NextRequest, NextResponse } from "next/server"; import { runCodePipeline } from "@/src/services/pipeline"; import { SandboxService } from "@/src/lib/sandbox"; import { BudgetService } from "@/src/lib/budget"; import { TelemetryService } from "@/src/lib/telemetry"; import { ApprovalStore } from "@/src/lib/approval"; import { PolicyViolationError, BudgetExceededError } from "@reaatech/tool-use-firewall-core"; const sandboxService = new SandboxService(); const budgetService = new BudgetService(); const telemetryService = TelemetryService.getInstance(); const approvalStore = new ApprovalStore(); const deps = { sandboxService, budgetService, telemetryService, approvalStore }; export async function POST(req: NextRequest) { try { const body = await req.json() as Record<string, unknown>; const { prompt, userId, fileBuffer: rawFileBuffer, fileName } = body; if (typeof prompt !== "string" || typeof userId !== "string" || !prompt || !userId) { return NextResponse.json({ error: "Missing required fields" }, { status: 400 }); } const fileBuffer = Array.isArray(rawFileBuffer) ? Buffer.from(rawFileBuffer as number[]) : undefined; const result = await runCodePipeline( { prompt, userId, fileBuffer, fileName: typeof fileName === "string" ? fileName : undefined }, deps, ); if (result.approvalId) { return NextResponse.json({ status: "approval_required", approvalId: result.approvalId }, { status: 202 }); } return NextResponse.json({ status: "success", ...result }); } catch (err) { if (err instanceof PolicyViolationError) { return NextResponse.json({ error: "policy_violation", reason: err.message }, { status: 403 }); } if (err instanceof BudgetExceededError) { return NextResponse.json({ error: "budget_exceeded" }, { status: 402 }); } return NextResponse.json({ error: "internal_error" }, { status: 500 }); } } export function GET(req: NextRequest) { const userId = req.nextUrl.searchParams.get("userId"); return NextResponse.json({ history: [], userId }); }

xAI Grok Secure Code Sandbox for SMB Data Pipelines

The problem

Built from

Intro

Prerequisites

Step 1: Create the project scaffold

Example artifact

Comments

Intro

Prerequisites

Step 1: Create the project scaffold

Step 2: Set up environment variables

Step 3: Create the typed configuration module

Step 4: Wire up xAI Grok via the AI SDK

Step 5: Build the structured-output repair layer

Step 6: Set up the sandbox service with a firewall

Step 7: Define the sandbox security policy

Step 8: Implement budget enforcement

Step 9: Add approval workflow, telemetry, and spreadsheet utilities

Step 10: Build the pipeline orchestrator

Step 11: Create the Express server and Next.js App Router routes

Step 12: Write tests and verify

Next steps