Files
CGA-bench/analysis/paper_runs/paper_fsm/task_summary.csv
2026-05-22 10:02:42 +08:00

754 B

1modelconditiontask_idn_runscoverage_meancoverage_stdcoverage_bestsemantic_coverage_meansemantic_coverage_stdsemantic_coverage_besteval2_meaneval2_stdeval2_besttime_mean_sectime_std_sectoken_cost_meantoken_cost_stdfirst_improvement_iter_mean
2qwen-maxbaseline2012_q2fsm191.666666666666660.091.6666666666666661.170.061.170.70.00.7348.970.00.626820.0
3qwen-maxbaseline2013_q2afsm192.30769230769230.092.307692307692373.510.073.510.0519.090.00.78504000000000010.0
4qwen-maxcga2012_q2fsm191.666666666666660.091.6666666666666661.170.061.170.30.00.3489.720.00.72135999999999990.0
5qwen-maxcga2013_q2afsm192.30769230769230.092.307692307692373.510.073.510.01803.020.02.661420.0