Practical 4

Text Processing Pipelines

Overview

In this practical you will be introduced to text data and the development of text processing pipelines. You will build a text classifer.

What is in this Practical Session

Text Features
Text Classification
Exercises

It is suggested to read the notebooks in the above order. You can also try the Exercises while you read through the notebooks

Set up your notebook

Binder

Open up this repository in binder to get started.

Run Locally

If you want to run this locally, download files from this repository and extract them. Follow instructions from here to create an Anaconda virtual environment. After activating this environment make sure you install 'numpy', 'pandas', 'scikit-learn', and 'matplotlib' (as required by environment.yml) as well as jupyter notebook. Remember, you need to install these within the environment so make sure you have run 'conda activate environment_name'. You should then be able to open the notebooks on your computer.

If you have any questions, my email is daniel.organisciak@northumbria.ac.uk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Practical 4

Overview

What is in this Practical Session

Set up your notebook

Binder

Run Locally

Files

README.md

Latest commit

History

README.md

File metadata and controls

Practical 4

Overview

What is in this Practical Session

Set up your notebook

Binder

Run Locally