Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education

This repository contains code for the paper - "Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education" by Nischal Ashok Kumar and Andrew Lan published as a part of the AI4ED workshop held with AAAI-2024.

We ground our test case generation approach on the CSEDM Challenge Dataset.

If you find our code or paper useful, please consider citing:

TODO: Add citation

Installation

To run the code in the repository you will need to install Java.

To install the python libraries using conda execute the following command:

conda env create -f environment.yml

Test Case Generation

Navigate to the directory and execute the codes:

cd gt_test_case_generation
python choose_data.py
python test_case_generation_loop.py

This code produces two outputs:

llm_output - Directory containing the output of the LLM (bug explanation and corresponding test cases) for selected student codes for every iteration of refinement for every problem.
compiler_code - Directory containing the automatically constructed Java code for testing the generated test cases for the original student buggy code for every problem.

Evaluation

Navigate to the directory and execute the codes:

cd gt_test_case_generation
python evaluate_and_select.py 
python adjust_group_scores.py
python average_grp_wise_scores.py
python plot_micro_stats.py

This code produces two outputs:

full_evaluation_5 - contains the error for every problem of all assignment across for all students.
plots - contains the plots used in our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
S19_All_Release_2_10_22		S19_All_Release_2_10_22
gt_test_case_generation		gt_test_case_generation
.DS_Store		.DS_Store
2nd CSEDM Data Challenge - Problem Prompts & Concepts Used.xlsx		2nd CSEDM Data Challenge - Problem Prompts & Concepts Used.xlsx
ProgSnap2-v6-31Jul2019.pdf		ProgSnap2-v6-31Jul2019.pdf
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S19_All_Release_2_10_22

S19_All_Release_2_10_22

gt_test_case_generation

gt_test_case_generation

.DS_Store

.DS_Store

2nd CSEDM Data Challenge - Problem Prompts & Concepts Used.xlsx

2nd CSEDM Data Challenge - Problem Prompts & Concepts Used.xlsx

ProgSnap2-v6-31Jul2019.pdf

ProgSnap2-v6-31Jul2019.pdf

README.md

README.md

environment.yml

environment.yml

Repository files navigation

Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education

Contents

Installation

Test Case Generation

Evaluation

About

Releases

Packages

Languages

umass-ml4ed/test_case_generation

Folders and files

Latest commit

History

Repository files navigation

Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education

Contents

Installation

Test Case Generation

Evaluation

About

Topics

Resources

Stars

Watchers

Forks

Languages