Exploring the history of word usage in English texts with a weighted popularity history plot.
-
Updated
May 22, 2024 - Java
Exploring the history of word usage in English texts with a weighted popularity history plot.
Using k-nearest neighbors, and infinite-lookback ngrams with LLMs
Simple tool for training n-gram language model
Haiku-Search is a high-performance fuzzy search library designed for web applications. It is built using Rust and compiled to WebAssembly
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Character-level n-gram models from scratch
Recovery of ActiveWatch statistical text analysis from 20th Century Java code saved on a CD-ROM disk. This probably should be rewritten, but can now demonstrate AW mapping of dynamic text content and detection of unusual activity.
Model Generator for Firestore
Finding insights on what could be improved at a restaurant based on reviews. Project contains the implementation, dataset and a written report. Methods utilized include LDA, NER, keyword extraction, length analysis, association rules mining, N-gram analysis and more.
To evaluate machine translation, they use several methods, some of which we fully implemented
Error detection tool for finding unlikely word sequences in Estonian texts based on the words' part-of-speech.
Tool for extracting part of speech n-grams (POS-grams) together with their context (preceding/following POS or sentence onset/ending).)
cloze complition using N-grams in python
Uses letter frequency and catboost classifier model in synchronous for guessing letters in hangman game instance. The model performance is evaluated on both seen words in the dictionary and words out of the dictionary.
Implementation & analysis of various NLP techniques in Python: 4 projects on tokenization, text classification, sequence labeling, and more
Uncensor for russian masked or separated obscene words based on frequent letters, bi- and tri-grams analysis
Detect language from a text string in Swift!
Sentiment Analysis of Twitter Data (saotd)
We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.
This repository contains Natural Language Processing programs in the Python programming language.
Add a description, image, and links to the n-grams topic page so that developers can more easily learn about it.
To associate your repository with the n-grams topic, visit your repo's landing page and select "manage topics."