Please put in pull requests to add resources and links
A syntax for search queries that match string patterns. Essentially a sophisticated find method for a document or text set.
- The Programming Historian
- Learn and Test RegEx
- Learn RegEx the Hard Way
- The Bastards Book of Regular Expression
- 30 Minute Regular Expressions Tutorial
A high-level, multipurpose programming language useful for working with plain text and data. Used not only by DH practitioners, but by Google, NASA, and the scientific community.
- Learn Python the Hard Way
- A Beginner's Python Tutorial
- Dive into Python
- CodeAcademy: Python
- Python for you and me
- free-programming-books: Python
A Python library for working with natural language. NLTK can tokenize strings (create a list of words from a set of characters), idenfity parts of sppech, and perform operations based on a word's context.
MALLET
Unix/Linux
R
XLST
TEI
Git
Gephi
D3
TextWrangler
BBEdit
Excel
OpenRefine
Voyant Tools
R Studio
DH Box
Resources
The Programming Historian http://programminghistorian.org/
Workshops
http://gcdi.commons.gc.cuny.edu/events/