Skip to content

Data used in "Recovering Concept Prerequisite Relations from University Course Dependencies" (Liang et al. 2017)

Notifications You must be signed in to change notification settings

harrylclc/eaai17-cpr-recover

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CPR-Recover data

Release 1, 2017-01-10

This is data used in "Recovering Concept Prerequisite Relations from University Course Dependencies" (Liang et al., 2017)

Data Description

  • cs_courses.csv: These are CS-related course information collected from 11 U.S universities (Carnegie Mellon University, Stanford University, the Massachusetts Institute of Technology, Princeton University, the California Institute of Technology, Purdue University, University of Maryland, Michigan State University, Pennsylvania State University, University of Illinois, and University of Iowa). Each line is formatted as "<Course_id>,<Course_description>". Note the course titles are located at the begining of the description.

  • cs_edges.csv: There are course prerequisite information. Each line "<course_1>,<course_2>" represents <course_2> is a prerequisite for <course_1>.

  • cs_annotations.tsv: These are annotation results for candidate pairs generated from above CS courses. Please refer to the "Data Labeling" section for more details. Each line is formatted as "<Concept_A>,<Concept_B>,<Annotator_1>...<Annotator_13>". Each pair gets labels from three different annotators. Valid labels are: 1 B is a prerequisite of A. 2 A is a prerequisite of B. 3 There is no prerequisite relation between A and B.

  • cs_preqs.csv: These are concept prerequisite pairs exported from the above annotation by using majority vote. Each line "<Concept_A>,<Concept_B>" represents that B is a prerequisite of A.

Note: As described in the paper, Wikipedia concepts in this data are all extracted with the help of Wikipedia-miner. You can also try other Wikification/Entity linking methods to extract Wiki concepts from course descriptions. In that case, even though our labeled prerequisite pairs perhaps will not cover all candidate pairs, we believe this annotation still covers most of them and can save you lots of time when collecting prerequisite labels.

Citation

Please cite the following paper if you use this data.

@inproceedings{liang2017recovering,
  title={Recovering Concept Prerequisite Relations from University Course Dependencies.},
  author={Liang, Chen and Ye, Jianbo and Wu, Zhaohui and Pursel, Bart and Giles, C Lee},
  booktitle={AAAI},
  pages={4786--4791},
  year={2017}
}

If you have any problems, please contact Chen Liang at cul226@ist.psu.edu.

License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Alt

About

Data used in "Recovering Concept Prerequisite Relations from University Course Dependencies" (Liang et al. 2017)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published