User Guide Overview

This guide provides detailed information about using the SWE-bench CLI. Each command is documented with examples and common use cases.

Available Commands

SWE-bench has different subsets and splits available:

Basic Evaluation:

sb-cli submit swe-bench-m dev --predictions_path preds.json --run_id my_run
sb-cli get-report swe-bench-m dev my_run

Development Testing:

sb-cli submit swe-bench_lite dev --predictions_path test.json --run_id test_run

Managing Runs:

sb-cli list-runs swe-bench-m dev
sb-cli delete-run swe-bench-m dev old_run_id