Skip to content

A data science exercise supporting qualitative coding using ML/NLP.

Notifications You must be signed in to change notification settings

juanroesel/avalanche-ds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

avalanche-ds

This repository contains the code and files related to the Data Science exercise developed as part of the job interview process for Avalanche Insights.

Requirements

  • Interpreter: Python 3.8
  • Have Conda installed locally (see here for installation instructions)
  • Have Jupyter Lab installed locally (see here for installation instructions)

Folder structure

├── README.md
├── artifacts
│   ├── X.joblib
│   ├── lda_model.joblib
│   └── vectorizer.joblib
├── data
│   └── coded_response_dataframe.pkl
├── images
│   └── architecture.png
├── requirements.txt
├── src
│   ├── 1-exploratory_data_analysis.ipynb
│   ├── 2-topic_modeling.ipynb
│   ├── 3-sentiment_analysis.ipynb
│   ├── ml_utils.py
│   ├── nlp_pipeline.py
│   └── utils.py
└── workplan
    └── workplan.md

Instructions

To properly run this notebook, please follow these steps:

  1. Clone this repository and navigate to the root folder. Once there, set up a conda environment named ds-interview
$  conda create -n ds-interview python=3.8
  1. Activate the virtual environment
$ conda activate ds-interview
  1. Install all packages in requirements.txt (make sure you are at the root folder)
$ python3 -m pip install -r requirements.txt
  1. Install the ipykernel module, which provides the IPython kernel for Jupyter
$ python3 -m pip install ipykernel
  1. Add the enviornment ds-interview into Jupyer Notebooks
$ python3 -m ipykernel install --user --name=ds-interview
  1. Start a jupyter lab server and connect to the jupyter notebook instance on your browser
$ jupyter lab
  1. Select the ds-interview kernel at the top right hand corner in the Jupyter Lab interface.

  2. Open the notebook you want to explore by navigating to the src directory.

About

A data science exercise supporting qualitative coding using ML/NLP.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published