Text Processing Pipelines
In this practical you will be introduced to text data and the development of text processing pipelines. You will build a text classifer.
- Text Features
- Text Classification
- Exercises
It is suggested to read the notebooks in the above order. You can also try the Exercises while you read through the notebooks
Open up this repository in binder to get started.
If you want to run this locally, download files from this repository and extract them. Follow instructions from here to create an Anaconda virtual environment. After activating this environment make sure you install 'numpy', 'pandas', 'scikit-learn', and 'matplotlib' (as required by environment.yml) as well as jupyter notebook. Remember, you need to install these within the environment so make sure you have run 'conda activate environment_name'. You should then be able to open the notebooks on your computer.
If you have any questions, my email is daniel.organisciak@northumbria.ac.uk