Language_Predictor_ML_NLP

huggingface

Check out the hosted website here👉.

rander server

Check out the hosted website here👉

#To run code: streamlit run app.py

TfidfVectorizer

The TfidfVectorizer method helps us to achieve this by generating a numerical representation of each text document based on the frequency of each term and how often it appears in each document compared to its frequency in the entire corpus.

The ngram_range parameter in TfidfVectorizer specifies the range of n-grams to be considered. An n-gram is a contiguous sequence of n items from a given sample of text or speech. By default, TfidfVectorizer uses a unigram approach, but specifying ngram_range=(1,2) means that both unigrams and bigrams will be considered.

The analyzer parameter in TfidfVectorizer specifies the type of analysis to be performed. By setting analyzer='char', the vectorizer will generate character-level n-grams instead of word-level n-grams.

Using TfidfVectorizer from the feature_extraction.text module in the scikit-learn library, we can generate numerical representations of text data based on term frequency and inverse document frequency. By specifying ngram_range=(1,2) and analyzer='char', we can consider both unigrams and bigrams at the character level.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Language Detection.csv		Language Detection.csv
NLP.png		NLP.png
README.md		README.md
Untitled.ipynb		Untitled.ipynb
app.py		app.py
model.pckl		model.pckl
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language Detection.csv

Language Detection.csv

NLP.png

NLP.png

README.md

README.md

Untitled.ipynb

Untitled.ipynb

app.py

app.py

model.pckl

model.pckl

requirements.txt

requirements.txt

Repository files navigation

Language_Predictor_ML_NLP

huggingface

rander server

TfidfVectorizer

About

Releases

Packages

Languages

vishal815/Language_Predictor_ML_NLP

Folders and files

Latest commit

History

Repository files navigation

Language_Predictor_ML_NLP

huggingface

rander server

TfidfVectorizer

About

Topics

Resources

Stars

Watchers

Forks

Languages