Skip to content
reaatech

prompt-injection-bench · packages

Every package shipped from reaatech/prompt-injection-bench, published or pending.

9 packages

@reaatech/pi-bench-adapters

v1.0.1
A collection of pluggable defense adapters for prompt injection detection, providing a standard `DefenseAdapter` interface with `detect()` and `sanitize()` methods across 8 built-in implementations (Rebuff, Lakera Guard, LLM Guard, Garak, OpenAI/Azure/Anthropic/Cohere Moderation, and Custom HTTP). Each adapter extends `BaseAdapter` which handles input validation, SSRF protection, rate limiting, and injection pattern detection.
status
published
published
26 days ago

@reaatech/pi-bench-core

v1.0.1
Canonical TypeScript types, Zod schemas, and attack taxonomy for the prompt-injection-bench benchmarking suite. Exports 18 domain types, 16 Zod schemas for runtime validation, and an 8-category attack taxonomy with weights and severity levels, with zero runtime dependencies beyond Zod.
status
published
published
26 days ago

@reaatech/pi-bench-corpus

v1.0.1
A corpus builder, validator, and variant generation engine for prompt-injection-bench that produces balanced injection samples from YAML templates using synonym replacement, Unicode homoglyphs, and other obfuscation strategies. Exports factory functions (`createCorpusBuilder`, `createCorpusValidator`) and a `generateDefaultCorpus` function.
status
published
published
26 days ago

@reaatech/pi-bench-leaderboard

v1.0.1
A leaderboard manager for prompt-injection-bench that ranks defenses by composite score into S/A/B/C/D tiers, with JSON file persistence and pairwise comparison. Exports a `createLeaderboardManager` factory function and standalone `loadLeaderboardEntries`/`saveLeaderboardEntries` utilities.
status
published
published
26 days ago

@reaatech/pi-bench-mcp-server

v1.0.1
An MCP server that exposes four tools (`run_benchmark`, `compare_defenses`, `generate_report`, `submit_results`) for running prompt injection benchmarks against defenses, comparing results, and generating reports. It provides a `createMCPServer` factory function and a `normalizeReportData` utility for standardizing benchmark results across different formats.
status
published
published
26 days ago

@reaatech/pi-bench-observability

v1.0.1
A structured logging, OpenTelemetry-compatible metrics, and span-based tracing toolkit for benchmark operations, built on Pino v10 and the OpenTelemetry SDK. It provides `createLogger()`, `createMetricsCollector()`, and `createTracingManager()` factory functions, plus global singleton accessors (`getGlobalLogger()`, `getGlobalTracer()`, `getGlobalMetrics()`) for zero-config usage.
status
published
published
26 days ago

@reaatech/pi-bench-runner

v1.0.1
A benchmark execution engine that runs prompt injection attacks against defense adapters in parallel with configurable timeouts, progress reporting, and PII-safe result collection. Exports factory functions (`createBenchmarkEngine`, `createAttackExecutor`, `createDefenseEvaluator`) and requires a `DefenseAdapter` and attack corpus (e.g., from `@reaatech/pi-bench-adapters` and `@reaatech/pi-bench-corpus`).
status
published
published
26 days ago

@reaatech/pi-bench-scoring

v1.0.1
Computes weighted defense scores, confidence intervals, effect sizes, and statistical comparisons (z-test, chi-square, ANOVA) for prompt-injection benchmark results, exporting functions like `calculateDefenseScore`, `compareMetrics`, and `createStatisticalTests`.
status
published
published
26 days ago

prompt-injection-bench

v1.0.1
A CLI and library for benchmarking LLM prompt-injection defenses against standardized attack corpora. Exports a `createBenchmarkEngine` function and CLI with subcommands (`benchmark`, `attack`, `compare`, `corpus`, `leaderboard`, `report`) that work with defense adapters for services like Rebuff, Lakera, and OpenAI Moderation.
status
published
published
26 days ago