Skip to content

LazyLanger is a tool to extract most common words from a subtitle, and sort them out by number of appearance in a show / movie subtitle. Useful when learning a new language, to get the most used words from a language.

License

Notifications You must be signed in to change notification settings

petar-prog91/LazyLanger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LazyLanger

LazyLanger is a tool to extract most common words from a subtitle, and sort them out by number of appearance in a show / movie subtitle. Useful when learning a new language, to get the most used words from a language.

ToDo

  • Read a text file (subtitle) or multiple text files and merge into one file
  • Remove lines that start with numbers
  • Convert comma, dot, ? and ! into spaces
  • Split all words by space
  • Sort words by number of occurance
  • Return JSON containing list of words and number of occurance
  • Reject single character words <-- I'm too lazy at the moment for this one
  • Create UI, a form with upload field, nothing special
  • Display result in a table after sorting
  • Create CSV file of that table on demand (button to create a file)
  • Make each word start with a capital letter

About

LazyLanger is a tool to extract most common words from a subtitle, and sort them out by number of appearance in a show / movie subtitle. Useful when learning a new language, to get the most used words from a language.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published