Skip to content

Relicensing Combined Datasets (ReCoDa)

License

Notifications You must be signed in to change notification settings

dice-group/ReCoDa

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Relicensing Combined Datasets (ReCoDa)

Repository

This repository contains code to generate a list of compatible licenses based on multiple input licenses to be checked. The following main steps are executed:

  • Creation of a KnowledgeBase which contains the licenses and attributes.
  • License Attribute values are mapped according to their type.
  • The Operator computes composite attributes of all input licenses.
  • Finally, the BackMapping creates a list of compatible licenses.

Knowledge Bases (Knowledge Graphs):

  • Knowledge Bases are available in the resource directory knowledge-bases as turtle/ccREL files.
  • Knowledge Base files can be imported using the KnowledgeBases class.
  • To add additional Knowledge Bases, the AttributeFactory can be utilized and afterwards the Execution methods help to run an experiment.

Download

Downloads are available at the releases page.

Evaluation and experiments

For the evaluation of the approach, experiments based on two license datasets are provided: Creative Commons and the European Data Portal (EDP) License Compatibility Matrix.

Creative Commons experiments

To run the evaluations based on Creative Commons, you first have to download the underlying dataset. Therefore, download or clone the cc.licenserdf repository. For the ReCoDa article, a cc.licenserdf copy was used. Afterwards, the directory of the repository can be set by the system property cc.licenserdf. Example commands to run the experiments are listed below.

Creative Commons License Compatibility Chart

java -Dcc.licenserdf=cc.licenserdf-master/cc/licenserdf/licenses/ -jar ReCoDa.jar cc1

This will run the CcExperiment.

Creative Commons cc.licenserdf with two input licenses

java -Dcc.licenserdf=cc.licenserdf-master/cc/licenserdf/licenses/ -jar ReCoDa.jar cc2

This will run the CcExperimentTuples.

Creative Commons cc.licenserdf with three input licenses

java -Dcc.licenserdf=cc.licenserdf-master/cc/licenserdf/licenses/ -jar ReCoDa.jar cc3

This will run the CcExperimentTriples.

European Data Portal experiments

To run the evaluation of the European Data Portal (EDP) License Compatibility Matrix, run the following maven command:

mvn clean test -Dtest=EdpLcmEvaluationTest -Drun.edp.lcm.tests=true

This will run the additional EdpLcmEvaluationTest.

Credits

Data Science Group (DICE) at Paderborn University

This work has been supported by the German Federal Ministry of Transport and Digital Infrastructure (BMVI) within the project OPAL under the grant no. 19F2028A and by the EU H2020 Marie Skłodowska-Curie project KnowGraphs under the grant agreement no 860801.