Skip to content
reaatechREAATECH

vLLM Agent Eval Harness for Fine-Tuned Model Quality

Automated CI/CD-quality evaluations for locally-hosted fine-tuned LLMs using vLLM with LLM-as-judge and cost tracking.

The problem

SMBs that fine-tune open models locally lack a structured way to verify model quality before production, exposing them to regressions and failed customer interactions.

Example artifact

A complete, working implementation of this recipe — downloadable as a zip or browsable file by file. Generated by our build pipeline; tested with full coverage before publishing.

188 kB·126 tests·100.0% coverage·vitest passing

SHA-256bb63d9654cb6308b6e5cafd3c3cf6e92eca4ee2f1dbc5c608b73eeaa692eda23

Comments

Sign in with GitHub to comment and vote.

Loading comments…