Skip to content

Comprises a set of techniques to work with documents written in a natural language to achieve many different objectives.

Notifications You must be signed in to change notification settings

flightstar/Natural-Language-Processing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Natural Language Processing

Technique Where to look
Grouping similar words Stemming, Splitting Words, Parsing Documents
Finding words with the same meaning Latent Semantic Analysis
Generating realistic names Splitting Words
Understanding how much time it takes to read a text Reading Time
Understanding how difficult to read is a text Readability of a Text
Identifying the language of a text Identifying a Language
Generating a summary of a text SumBasic (word-based), Graph-based Methods: TextRank (relationship-based), Latent Semantic Analysis (semantic-based)
Finding similar documents Latent Semantic Analysis
Identifying entities (e.g., cities, people) in a text Parsing Documents
Understanding the attitude expressed in a text Parsing Documents
Translating a text Parsing Documents
Chinese, Cantonese (tiếng Quảng), Taiwanese (tiếng Quan Thoại, phương ngữ Trung Hoa), Japanese, Korean, Arabic (tiếng Ả Rập) Parsing the languages, translate the languages

Classifying Words

Phân loại các pattern, compression của các từ theo một nhóm hay phân loại nào đó được dùng rất nhiều trong ngành khoa học công nghệ như template engine, framework, recommendation system, speech engine, parser in compiler, search engine, social website feeds, spam filter, translation,...

Grouping Similar Words

Stemming

Splitting Words

Classifying Documents

Text Metrics

Reading Time

Calculating the Readability of a Text

Identifying a Language

Understanding Documents

Generation of Summaries

SumBasic

Graph-based Methods: TextRank

Latent Semantic Analysis

Other Methods and Libraries

Other Uses

Parsing Documents

You Need Data

The Things You Can Do

The Libraries You Can Use

References

About

Comprises a set of techniques to work with documents written in a natural language to achieve many different objectives.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages