Skip to content

Latest commit

 

History

History
22 lines (15 loc) · 1.26 KB

README.md

File metadata and controls

22 lines (15 loc) · 1.26 KB

Practical 4

Text Processing Pipelines

Overview

In this practical you will be introduced to text data and the development of text processing pipelines. You will build a text classifer.

What is in this Practical Session

  1. Text Features
  2. Text Classification
  3. Exercises

It is suggested to read the notebooks in the above order. You can also try the Exercises while you read through the notebooks

Set up your notebook

Binder

Open up this repository in binder to get started.

Run Locally

If you want to run this locally, download files from this repository and extract them. Follow instructions from here to create an Anaconda virtual environment. After activating this environment make sure you install 'numpy', 'pandas', 'scikit-learn', and 'matplotlib' (as required by environment.yml) as well as jupyter notebook. Remember, you need to install these within the environment so make sure you have run 'conda activate environment_name'. You should then be able to open the notebooks on your computer.

If you have any questions, my email is daniel.organisciak@northumbria.ac.uk