UPC-NLP-Neural-NERC

The goal of this project is to build and analyze a language classifying pipeline in order to gain insight and familiarity to typical Natural Language Processing (NLP) tools and strategies.

HOW TO RUN

In CMD move to this_dir/source
For the best performing model run: python langdetect.py -i "..\data\dataset.csv" -v 1000 -a "word"

Note:

The -v (vocabulary size) is a modifiable parameter and -a can be set to 'word' or 'char' granularity
You need Python3 with a selection of packages like ntlk, Sklearn and pandas. Look at the error code given by the cmd to know what to install using pip.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
source		source
README.md		README.md
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

source

source

README.md

README.md

report.pdf

report.pdf

Repository files navigation

UPC-NLP-Neural-NERC

HOW TO RUN

About

Releases

Packages

Languages

LouisVanLangendonck/UPC-MUD-LanguageDetection

Folders and files

Latest commit

History

Repository files navigation

UPC-NLP-Neural-NERC

HOW TO RUN

About

Topics

Resources

Stars

Watchers

Forks

Languages