Failure-Inducing Test Cases
A dataset of failure-inducing test cases found by Differential Prompting and baselines.
Download 1200 failure-inducing test cases found from buggy program (RQ1)
Download 1200 failure-inducing test cases found from correct program (RQ1)
Download 210 failure-inducing test cases found from buggy program (RQ4)
Download 210 failure-inducing test cases found from correct program (RQ4)
Intentions
A dataset of 470 intentions inferred by Differential Prompting.
Download 400 intentions used in RQ2
Download 70 intentions used in RQ4
Reference Versions
A dataset of 940 reference versions inferred by Differential Prompting and baselines.
Download 800 reference versions used in RQ3
Download 140 reference versions used in RQ4
Codeforces Programs
A dataset of 14 Codeforces programs studied in RQ4.