Skip to content
#

language-identifier

Here are 11 public repositories matching this topic...

An NLP project leveraging character trigrams and smoothing techniques (Lidstone, Linear Discounting, Absolute Discounting) for language identification. Trained on for Spanish, Italian, English, French, Dutch, and German, achieving 99.8932% accuracy. Includes datasets, model parameters, and comprehensive documentation.

  • Updated Mar 4, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the language-identifier topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the language-identifier topic, visit your repo's landing page and select "manage topics."

Learn more