Submit Command
The submit
command uploads your model's predictions for evaluation.
Usage
Arguments
subset
: Dataset subset (swe-bench-m
orswe-bench_lite
)split
: Dataset split (dev
ortest
)
Options
--predictions_path
: Path to your predictions file (required)--run_id
: Unique identifier for this submission. You can use the values PARENT or STEM to use the parent directory name or the stem of the predictions file name. (default: PARENT)--instance_ids
: Comma-separated list of specific instances to submit--output_dir
: Directory to save report files (default: sb-cli-reports)--overwrite
: Overwrite existing report (0/1, default: 0)--gen_report
: Generate report after completion (0/1, default: 1)--verify_submission
: Verify submission before waiting (0/1, default: 1)--wait_for_evaluation
: Wait for evaluation to complete (0/1, default: 1)
Predictions File Format
Your predictions file should be a JSON file in one of these formats:
Dictionary Format
{
"instance_id_1": {
"model_patch": "...",
"model_name_or_path": "..."
},
"instance_id_2": {
"model_patch": "...",
"model_name_or_path": "..."
}
}
List Format
[
{
"instance_id": "instance_id_1",
"model_patch": "...",
"model_name_or_path": "..."
},
{
"instance_id": "instance_id_2",
"model_patch": "...",
"model_name_or_path": "..."
}
]
Examples
-
Basic submission:
-
Custom run ID and output directory:
-
Submit specific instances: