Run core evaluation locally
Upload CSV data, map your columns, and inspect accuracy, precision, recall, MCC, ROC AUC, PR AUC, calibration, and more without shipping your dataset to a backend.
Upload a CSV with actual labels and either probabilities or predicted labels. Inspect 12+ metrics, ROC/PR, calibration, threshold tuning, and paid HTML report export without building a full SaaS backend.
EvalBench is a lightweight evaluator for binary classifiers. It stays honest about scope and keeps the core analysis in the browser.
Upload CSV data, map your columns, and inspect accuracy, precision, recall, MCC, ROC AUC, PR AUC, calibration, and more without shipping your dataset to a backend.
Use the threshold slider for live feedback, then unlock the Pro threshold sweep table when you need a stronger operational recommendation.
Pro exports a standalone HTML report and cohort audit so the output is more than a screenshot. It becomes something you can send, archive, or review.
No hidden SaaS backend. The site is static-plus-serverless: browser-side evaluation, Cloudflare Pages Functions for entitlement checks, and Lemon Squeezy for checkout and license keys.
Upload a CSV, paste rows, or start with the bundled demo dataset.
Review metrics, charts, calibration, and the confusion matrix in the browser.
Use hosted checkout, then return through the unlock flow to enable report export, threshold sweep, and cohort audit on this browser.
Keep the launch simple. One free tier, one paid tier, no subscription theater.
| Capability | Free | Pro |
|---|---|---|
| CSV upload, paste, and sample dataset | Included | Included |
| Core metrics + confusion matrix | Included | Included |
| ROC / PR / calibration / score distribution | Included | Included |
| Downloadable metrics CSV | Included | Included |
| Standalone HTML report export | — | Included |
| Threshold sweep table | — | Included |
| Cohort audit by segment | — | Included |
| Paid session recovery on the same browser | — | Included |
The honest answers matter more than the shiny ones.
The evaluator processes uploaded CSV data in the browser. Payment, licensing, and basic web hosting still rely on external services.
Pro unlocks the HTML report export, threshold sweep tables, and cohort-audit breakdowns. The free tier still includes the core evaluator.
This version is intentionally scoped to binary classification CSVs. Multiclass and regression are not included in this starter build.
The support page covers receipt recovery, My Orders, re-unlock, and switching browsers.