- First steps with Python
- Installing and getting started with the toolset
- First steps with text analysis
- Simple explorations of word use
- Sentiment analysis
- Basic Tagging
- Counting Words
- Representations of co-occurrence
- Collocations and n-grams
- Classification Methods
- Decision Trees
- Naive Bayes
- Support Vector Machines
- Neural Networks
- Testing frameworks
- Look at some tools with Graphical User Interfaces
- Assembling raw data
- From the web: web pages, twitter, etc.
- From PDFs
- Cleaning
- Normalizing
- Tokenizing
- Stemming
- Lemmatizing
- Regular Expressions
- About bag-of-words approaches
- Converting text to a vector
- Various representations of vectors
- Use of vector representations to measure text similarity
- Distributed vector representations (word embeddings)
- Text Clustering
- Topic Modeling
- A week of readings
- Networks