Skip to content

Latest commit

 

History

History
58 lines (41 loc) · 1.99 KB

tools-resources.md

File metadata and controls

58 lines (41 loc) · 1.99 KB

Text Transformations Tools

Please put in pull requests to add resources and links

RegEx

A syntax for search queries that match string patterns. Essentially a sophisticated find method for a document or text set.

Python

A high-level, multipurpose programming language useful for working with plain text and data. Used not only by DH practitioners, but by Google, NASA, and the scientific community.

Natural Language Toolkit (NLTK)

A Python library for working with natural language. NLTK can tokenize strings (create a list of words from a set of characters), idenfity parts of sppech, and perform operations based on a word's context.



MALLET
Unix/Linux
R
XLST
TEI
Git
Gephi
D3
TextWrangler
BBEdit
Excel
OpenRefine
Voyant Tools
R Studio
DH Box

Resources
The Programming Historian http://programminghistorian.org/

Workshops
http://gcdi.commons.gc.cuny.edu/events/