Skip to content

This repository consist of projects related to Natural Language Processing using machine learning and deep learning concepts

License

Notifications You must be signed in to change notification settings

BALAJIHARIDASAN/Natural-Language-Processing

Repository files navigation

Natural-Language-Processing ( Conversational AI)

Natural language processing (NLP) refers to the branch of computer science—and more specifically, the branch of artificial intelligence or AI—concerned with giving computers the ability to understand text and spoken words in much the same way human beings can.

image

Natural Language Projects :

  1. Part of speech Tagging (POS)

  2. Spam Filtering

  3. Text Summarization

  4. Speech TO text

  5. sentiment analysis

  6. Information Retrivel

  7. Text / Document Classification

  8. Question Answering

  9. Machine Translation

Text Preprocessing Steps :

  1. Lower casing

  2. Removal of Punctuations

  3. Removal of Stopwords

  4. Removal of Frequent words

  5. Removal of Rare words

  6. Stemming

  7. Lemmatization

  8. Removal of emojis

  9. Removal of emoticons

  10. Conversion of emoticons to words

  11. Conversion of emojis to words

  12. Removal of URLs

  13. Removal of HTML tags

Word-Vectorization :

  1. One hot encoding

  2. BOW(Bag of words)

  3. Tfidf(Term frequency and inverse document frequency)

  4. WordEmbedding

    • Word2Vec :

      • CBOW

      • Skip - Gram

Models:

MACHINE LEARING MODELS:

  1. Naive Bayes
  2. Random Forest
  3. XGBOOST

DEEP LEARNING MODELS :

  1. RNN
  2. LSTM
  3. GRU
  4. Bi- Directional LSTM
  5. Stacked LSTM
  6. Transformers (ATTENTION ALL YOU NEED)
  7. BERT
  8. Al-BERT
  9. distill BERT
  10. GPT
  11. ELMO