Anthropic AI Runbook Automation for SMB Field Service Dispatching

Automatically detect agent failures in field service workflows and execute pre-defined runbooks to restore operations without human intervention.

anthropic nextjs field-service runbook-automation reliability-ops trigger-dev slack langfuse

The problem

Small field service businesses lose revenue when their AI dispatch agents fail during after-hours or peak times. Manual recovery is slow and requires operations staff that small teams can’t afford.

Built from

Intro

This recipe builds an AI dispatch failure detection and remediation pipeline. You’ll wire up six REAA agent-runbook packages, create a Next.js API route that receives webhook failure events, classify incidents, generate runbooks, and alert Slack only when automatic recovery fails. By the end, you’ll have a runbook engine that autonomously triages and remediates field service failures.

Prerequisites

Node.js 22+ and pnpm 10 (npm install -g pnpm@10)
An Anthropic API key (free tier at console.anthropic.com)
A Slack bot token and channel ID (create an app at api.slack.com/apps, add chat:write scope, install to workspace)
A Langfuse account for observability (free tier at langfuse.com — get public/secret keys)
Familiarity with Next.js App Router route handlers, TypeScript, and vitest

Step 1: Scaffold the project and install dependencies

Create the project directory and scaffold a Next.js project:

terminal

mkdir anthropic-field-service-runbook
cd anthropic-field-service-runbook
pnpm create next-app@latest . --typescript --app --src-dir

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

Download example (zip)Browse files

169 kB·56 tests·100.0% coverage·vitest passing

SHA-256cea9e99ab4b920136d22bffe04383f54e22337213d2e54283c92ae60aeef4eac

Book a conversation All solutions

Comments

Loading comments…

import { identifyHealthChecks, generateHealthChecks, generateKubernetesProbeYaml } from "@reaatech/agent-runbook-health-checks"; import { defaultContext } from "./default-context.js"; export type HealthCheckResult = { endpoint: string; alive: boolean; latencyMs: number; lastChecked: Date; }; export type HealthCheckConfig = { platform: "prometheus" | "datadog" | "kubernetes" | "load-balancer"; serviceName: string; port?: number; path?: string; }; interface HealthCheckItem { name: string; type: string; endpoint?: string; expectedStatus?: number; timeout?: number; } interface HealthCheck { id?: string; name: string; type: "liveness" | "readiness" | "startup" | "deep"; endpoint: string; interval: string; timeout: string; successCriteria: string; checks?: HealthCheckItem[]; } export async function probeAgentEndpoints(endpoints: string[]): Promise<HealthCheckResult[]> { const results: HealthCheckResult[] = []; for (const endpoint of endpoints) { const start = performance.now(); try { const controller = new AbortController(); const timeout = setTimeout(() => { controller.abort(); }, 5000); const response = await fetch(endpoint, { signal: controller.signal }); clearTimeout(timeout); const latencyMs = Math.round(performance.now() - start); results.push({ endpoint, alive: response.ok, latencyMs, lastChecked: new Date() }); } catch { const latencyMs = Math.round(performance.now() - start); results.push({ endpoint, alive: false, latencyMs, lastChecked: new Date() }); } } return results; } export function generateHealthProbes(repoPath: string, config: HealthCheckConfig): string { const checks = generateHealthChecks(repoPath, defaultContext, config); return generateKubernetesProbeYaml(checks, config.serviceName, config.port); } export function getExistingHealthChecks(repoPath: string): HealthCheck[] { return identifyHealthChecks(repoPath, defaultContext); } export function getProbeEndpoint(): string { return "/api/health"; }

import { type NextRequest, NextResponse } from "next/server"; import { createRunbookEngine } from "../../../src/lib/runbook-engine.js"; import { createSlackNotifier } from "../../../src/lib/notify.js"; import { info, error } from "../../../src/lib/observability.js"; type FailureBody = { agentId: string; failureType: string; failureDetails: Record<string, unknown>; timestamp: string; }; function isFailureBody(body: unknown): body is FailureBody { if (!body || typeof body !== "object") return false; const b = body as Record<string, unknown>; return typeof b.agentId === "string" && typeof b.failureType === "string" && typeof b.failureDetails === "object" && b.failureDetails !== null && typeof b.timestamp === "string"; } export async function POST(req: NextRequest) { try { const body: unknown = await req.json(); if (!isFailureBody(body)) { return NextResponse.json( { error: "Missing required fields: agentId, failureType, failureDetails, timestamp" }, { status: 400 }, ); } const context = { agentId: body.agentId, failureType: body.failureType, failureDetails: body.failureDetails, timestamp: new Date(body.timestamp) }; const anthropicApiKey = process.env.ANTHROPIC_API_KEY; const slackToken = process.env.SLACK_TOKEN; const slackChannelId = process.env.SLACK_CHANNEL_ID; if (!anthropicApiKey) { return NextResponse.json({ error: "ANTHROPIC_API_KEY is not configured" }, { status: 500 }); } const runbookEngine = createRunbookEngine({ anthropicApiKey, repoPath: process.cwd(), }); const result = await runbookEngine.handleFailure(context); if (result.requiresHuman && slackToken && slackChannelId) { const slackNotifier = createSlackNotifier(slackToken, slackChannelId); await slackNotifier.sendAlert({ title: context.failureType, description: `Runbook failed for agent ${context.agentId}`, severity: "critical", runbookId: result.runbookId, markdown: result.markdown, }); } if (result.success) { info("Runbook executed successfully", { runbookId: result.runbookId }); } return NextResponse.json({ runbookId: result.runbookId, success: result.success, summary: result.markdown.substring(0, 200), requiresHuman: result.requiresHuman, completenessScore: result.completenessScore, }); } catch (e) { const message = e instanceof Error ? e.message : String(e); error("Trigger handler failed", { message }); return NextResponse.json({ error: message }, { status: 500 }); } } export function GET() { return NextResponse.json({ status: "ok", service: "anthropic-field-service-runbook" }); }

File	Tests
`tests/observability.test.ts`	Init order, span lifecycle (5 tests), tracking methods, re-exports
`tests/health-check.test.ts`	Probe success/failure/empty/mixed, health check generation, Kubernetes YAML
`tests/runbook-engine.test.ts`	Engine config boundary, full pipeline, empty modes, error path, completeness scoring
`tests/notify.test.ts`	Message formatting, runbook ID trailer, PlatformError handling, generic error, empty channel
`tests/trigger.test.ts`	Valid POST, missing fields, empty body, handler throw, requiresHuman alert, GET health
`tests/index.test.ts`	Every export is a function
`tests/integration.test.ts`	End-to-end MSW-mocked Anthropic flow, 500 from API, logger verification
`tests/instrumentation.test.ts`	register() calls initializeObservability

Anthropic AI Runbook Automation for SMB Field Service Dispatching

The problem

Built from

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Example artifact

Comments

Intro

Prerequisites

Step 1: Scaffold the project and install dependencies

Step 2: Configure your environment

Step 3: Set up the observability layer

Step 4: Build the health check service

Step 5: Create the runbook engine

Step 6: Wire up Slack notifications

Step 7: Create the webhook trigger route

Step 8: Set up instrumentation for startup init

Step 9: Export the public API and create a landing page

Step 10: Write the test suite

Step 11: Type-check, lint, and preflight

Step 12: Try the recipe

Next steps