Skip to content

TuomoNieminen/Helsinki-Open-Data-Science

Repository files navigation

Helsinki Open Data Science

Welcome to the Helsinki Open Data Science repository! This repository includes the codes for both the DataCamp and the presentation slides related to the University of Helsinki course "Introduction to Open Data Science" (IODS), thought by prof. Kimmo Vehkalahti. You can click on the 'Course on DataCamp' -link above to go to the course page.

Both the DataCamp course and the presentation slides are created by Tuomo Nieminen and Emma Kämäräinen.

Course slides

The presentation slides for the IODS course are published in a GitHub web page enabled by this repository. The slides have been created by Tuomo Nieminen and Emma Kämäräinen, using Rpresentation. They can be found in the following link

IODS slides.

The Rpresentation codes are included in the 'docs' folder. The index.html file in the 'docs' folder has been used to enable the GitHub web page. See 'instructions.Rmd' in the 'docs' folder for more information.

DataCamp course creation

Changes made to this GitHub repository are automatically reflected in the linked DataCamp course. This means that you can enjoy all the advantages of version control, collaboration, issue handling ... of GitHub.

Workflow

  1. Edit the markdown and yml files in this repository. You can use GitHub's online editor or use git locally and push your changes.
  2. Check out your build attempts on the Teach Dashboard.
  3. Check out your automatically updated course on DataCamp

Getting Started

A DataCamp course consists of two types of files:

  • course.yml, a YAML-formatted file that's prepopulated with some general course information.
  • chapterX.md, a markdown file with:
    • a YAML header containing chapter information.
    • markdown chunks representing DataCamp Exercises.

To learn more about the structure of a DataCamp course, check out the documentation.

Every DataCamp exercise consists of different parts, read up about them here. A very important part about DataCamp exercises is to provide automated personalized feedback to students. In R, these so-called Submission Correctness Tests (SCTs) are written with the testwhat package. Check out the GitHub repositories' wiki pages for more information and examples.

For more information check out the documentation on teaching at DataCamp.

Datasets

The data found in the 'datasets' folder of this repository are used in the DataCamp exercises. The files with data related filename extensions in the 'datasets' folder are automatically uploaded to amazon S3 servers.

The links to the currently used data files can be seen from the chapterx.Rmd files. The links to new files can be seen from the course build log under datacamp.com/teach. There is also information about uploading assets in the DataCamp teach documentation

adding a line adding a line