Skip to content

UtrechtUniversity/workshop-r-for-humanities

Repository files navigation

workshop-r-for-humanities

Workshop Description

R is a powerful scripting language for data handling, data visualization, and statistics. In this workshop, we aim to give you the tools to start exploring R and all it has to offer by yourself. We focus on learning the basics of R and applying your new found R knowledge and skills to texts. This workshop specifically targets researchers working within the Humanities, focusing on the analysis of textual data (as found in poems and novels, for example). However, it is open to and useful for all UU employees who want to perform computational textual analyses.

The course will take you from the very basics in R syntax, to data handling and visualisation using a set of tools known as the ‘tidyverse’. You will learn how to perform text mining using the ‘tidytext’ package, enabling you to handle your own data in a clean, robust, and reproducible manner. We will work in RStudio and introduce R as well as R Markdown: this is a great way to combine code and its output with text, allowing you to code in a narrative and intuitive way. Moreover, this way you produce a human-readable document with which you can easily share and showcase your work.

At the end of the course you will be able to:

  • read and write lines of R code (even if you do not understand all functions, you know how to look them up);
  • understand what ‘tidy’ text data is, how to generate it, and work with it;
  • perform basic text mining and sentiment analysis, calculate important terms in a (set of) texts and relationships between words;
  • use RStudio, and use it to write an R script and an R markdown document.

Schedule

Time Activity
9:00 Walk-in, tech support
9:30 Introductions
10:00 Short intro to the course (on text mining & Tidyverse)
10:15 Base R: Exercises 1-3
11:15 Recap & Questions
11:30 Coffee break
11:45 Base R: Exercises 4-6
12:45 Recap & Questions
13:00 Lunch break
13:45 The Tidy Text Format & Sentiment Analysis: Exercises 7-9
14:45 Recap & Questions
15:00 Coffee break
15:15 Analyzing word and document frequency & Relationships between words: Exercises 10-12
16:15 Recap & Questions
16:30 If needed, extra time for Recap & Questions + individual assistance
17:00 Doei