sci-ml/evalplus
Rigorous evaluation of LLM-synthesised code (HumanEval+, MBPP+)
USE Flags
perf
* This flag is undocumented *
python_targets_python3_12
* This flag is undocumented *
python_targets_python3_13
* This flag is undocumented *
python_targets_python3_14
* This flag is undocumented *


View
Download
Browse