Skip to content

data-lessons/dh-openrefine

Repository files navigation

Library Carpentry

The Library Carpentry module 'OpenRefine Lessons for Digital Humanities' is maintained by Juliane Schneider.

Background

Library Carpentry is a software skills training programme aimed at library and information professions. It builds on the work of Software Carpentry and Data Carpentry.

These materials are based on the Library Carpentry materials initially developed and taught by James Baker, Owen Stephens and Daniel van Strien in 2015. http://librarycarpentry.github.io/outline/. The original Library Carpentry contained four modules:

  • Introduction, including jargon busting, data structures and regular expressions
  • The Unix shell, including use of the command line and commands such asgrep and sed to find data within files
  • Git and version control
  • Using OpenRefine for data clean up.

On 2-3 June 2016 during the Mozilla Science Lab Global Sprint these existing four repositories were forked (links below) and re-developed within individual repositories, with the addition of a fifth module that introduces SQL, a topic which many librarians have expressed interest in learning.

The Library Carpentry sprint was co-ordinated by Belinda Weaver and attracted global participation.

Contribution

There are many ways of contributing to Library Carpentry:

Code of Conduct

All participants should agree to abide by the Software Carpentry Code of Conduct.

Authors

Library Carpentry is authored and maintained by the community.

Citation

Please cite as:

Library Carpentry. OpenRefine Lessons for Digital Humanities. July 2016, https://data-lessons.github.io/dh-openrefine/.

License

All the lessons are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Exceptions: embeds to and from external sources, and direct quotations from speakers.