CLI
Command-line interface for Benchwise.
Installation
How to install the Benchwise CLI.
The CLI is included when you install Benchwise:
pip install benchwise
Available Commands
A list of commands available through the Benchwise CLI.
List Models
benchwise list models
List Metrics
benchwise list metrics
Run Evaluation
benchwise eval gpt-4 claude-3-opus --dataset data.json --metrics accuracy rouge_l
Validate Dataset
benchwise validate dataset.json
Compare Results
benchwise compare results1.json results2.json --metric accuracy
Examples
Practical examples of using Benchwise CLI commands.
# Run QA evaluation
benchwise eval gpt-4 --dataset qa.json --metrics accuracy
# Compare multiple result files
benchwise compare run1.json run2.json run3.json
# Validate dataset format
benchwise validate my_dataset.json