reaatech/classifier-evals
These packages provide a comprehensive evaluation harness for testing and monitoring intent classification systems. They allow you to automate dataset validation, calculate classification metrics, run LLM-as-judge assessments, and enforce regression quality gates within CI/CD pipelines. The suite is built around a shared set of Zod schemas and TypeScript types, ensuring consistent data structures across the entire evaluation lifecycle from CLI execution to observability exports.
Packages
8 packages
@reaatech/classifier-evals
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-cli
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-dataset
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-exporters
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-gates
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-judge
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-mcp-server
- status
- published
- published
- 7 days ago
@reaatech/classifier-evals-metrics
- status
- published
- published
- 7 days ago
Comments
Sign in with GitHub to comment and vote.
