GitHub - lknelson/DH-Institute-2017: Repository for the text analysis workshop for the DH Summer Institute at UC Berkeley

DHBSI 2017: Computational Text Analysis

14-18 August 2017

Instructors: Laura Nelson and Teddy Roland

Overview

Scholars across multiple disciplines are finding themselves face-to-face with massive amounts of digitized data. In the humanities and many social science disciplines, this data is often in the form of unstructured text. This course will introduce students to cutting edge ways of structuring and analyzing digitized text-as-data, and will do so by exploring questions fundamental to the humanities. The ultimate goal is to encourage students to think about novel ways they can apply these techniques to their own data and research questions, and to provide the skills necessary to apply the methods in their own research. We will use the open source (and free!) programming language Python. We will also provide demonstration corpora.

Topics Covered

Principles of Natural Language Processing
Introduction to Python for NLP
Discriminating Words
Dictionary Methods
Textual Classification
Word Embedding

Requirements

This workshop will be taught in the open source programming language Python. Participants should install Anaconda for Python 3.6 on their laptops prior to the first class.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
00-Introduction		00-Introduction
01-Intro to NLP		01-Intro to NLP
02-Intro to Python		02-Intro to Python
03-Operationalizing		03-Operationalizing
04-Discriminating-Words		04-Discriminating-Words
05-Dictionary-Method		05-Dictionary-Method
06-Literary Distinction (Probably)		06-Literary Distinction (Probably)
07-Word2Vec		07-Word2Vec
08-Workshop-Overview		08-Workshop-Overview
09-Sandbox		09-Sandbox
A-Syllabus.md		A-Syllabus.md
B-Annotated Bibliography.md		B-Annotated Bibliography.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

lknelson/DH-Institute-2017

Folders and files

Latest commit

History

Repository files navigation

DHBSI 2017: Computational Text Analysis

14-18 August 2017

Instructors: Laura Nelson and Teddy Roland

Overview

Topics Covered

Requirements

Suggested Reading

About

Resources

License

Stars

Watchers

Forks

Languages