#

n-grams

Here are 197 public repositories matching this topic...

s-bose7 / ngrams-viewer

Exploring the history of word usage in English texts with a weighted popularity history plot.

n-grams text-corpus popularity-analysis

Updated May 22, 2024
Java

evolvingstuff / kNNGen

Using k-nearest neighbors, and infinite-lookback ngrams with LLMs

python deep-learning n-grams k-nearest-neighbours large-language-models genai

Updated May 19, 2024
Python

georgiyozhegov / n_gram

Simple tool for training n-gram language model

rust n-grams language-model n-gram-language-models

Updated May 16, 2024
Rust

beowolx / haiku-search

Haiku-Search is a high-performance fuzzy search library designed for web applications. It is built using Rust and compiled to WebAssembly

javascript rust fuzzy-search wasm n-grams bitma

Updated May 14, 2024
Rust

ml-classify-text-js

andreekeberg / ml-classify-text-js

Machine learning based text classification in JavaScript using n-grams and cosine similarity

training classifier machine-learning natural-language-processing library sentiment-analysis text-classification n-grams labels similarity artificial-intelligence classification cosine-similarity predictions n-gram text-classifier

Updated Apr 21, 2024
JavaScript

vdyma / ngrams

Character-level n-gram models from scratch

n-grams from-scratch

Updated Apr 4, 2024
Jupyter Notebook

prohippo / ActiveWatch

Recovery of ActiveWatch statistical text analysis from 20th Century Java code saved on a CD-ROM disk. This probably should be rewritten, but can now demonstrate AW mapping of dynamic text content and detection of unusual activity.

nlp text-mining news-aggregator n-grams statistical-analysis lexical-analysis cluster-analysis vector-model signal-to-noise dynamic-datasource content-mapping github-config finite-indexing zipf-s-law indexing-entropy indications-and-warning

Updated Mar 19, 2024
Java

go-generalize / volcago

Model Generator for Firestore

go golang firebase generator n-grams code-generation ngrams firestore firestore-database

Updated May 8, 2024
Go

ariandra34 / Restaurant-Reviews-Mining

Finding insights on what could be improved at a restaurant based on reviews. Project contains the implementation, dataset and a written report. Methods utilized include LDA, NER, keyword extraction, length analysis, association rules mining, N-gram analysis and more.

text-mining n-grams named-entity-recognition latent-dirichlet-allocation wordcloud-visualization term-frequency-inverse-document-frequency

Updated Feb 12, 2024
Jupyter Notebook

parvvaresh / Evaluation-of-machine-translation-by-NLP

To evaluate machine translation, they use several methods, some of which we fully implemented

python jupyter-notebook n-grams edit-distance bleu-score edit-distance-algorithm gleu-score wer-score nist-score chrf-score meteor-score

Updated Feb 5, 2024
Python

tlu-dt-nlp / POSgram-errors

Error detection tool for finding unlikely word sequences in Estonian texts based on the words' part-of-speech.

n-grams language-model estonian-language error-detection part-of-speech-tagging pos-ngram grammatical-error-detection

Updated Feb 5, 2024
Jupyter Notebook

tlu-dt-nlp / POSgram-contexts

Tool for extracting part of speech n-grams (POS-grams) together with their context (preceding/following POS or sentence onset/ending).)

language-modeling n-grams estonian-language grammar-rules statistical-modeling

Updated Feb 5, 2024
Jupyter Notebook

nirpr / cloze_completion

cloze complition using N-grams in python

python nlp data-science n-grams statistical-models

Updated Jan 25, 2024
Python

yoraghav / Automated_Hangman

Uses letter frequency and catboost classifier model in synchronous for guessing letters in hangman game instance. The model performance is evaluated on both seen words in the dictionary and words out of the dictionary.

machine-learning hangman-game n-grams words automated-testing hangman-game-api hangman-in-python letters-game catboost-classifier hangman-challenge

Updated Jan 17, 2024
Jupyter Notebook

nataliakoliou / NLP-Various-Implementations

Implementation & analysis of various NLP techniques in Python: 4 projects on tokenization, text classification, sequence labeling, and more

natural-language-processing neural-network text-classification word2vec word-embeddings n-grams spacy lstm nltk rnn dependency-parser language-models sequence-labeling tokenization tf-idf-vectorizer bert-model roberta-model

Updated Jan 10, 2024
Jupyter Notebook

AlexKly / russian_uncensor

Uncensor for russian masked or separated obscene words based on frequent letters, bi- and tri-grams analysis

n-grams uncensor swear-words obscene-words

Updated Jan 9, 2024
Python

Al00X / LanguageDetector

Detect language from a text string in Swift!

nlp swift language natural-language-processing language-detection n-grams

Updated Jan 9, 2024
Swift

evan-l-munson / saotd

Sentiment Analysis of Twitter Data (saotd)

r tweets sentiment-analysis plot tidy-data n-grams latent-dirichlet-allocation twitter-data topicanalysis bing-lexicon

Updated Dec 27, 2023
R

Leen-Alzebdeh / NLP-LMs

We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.

nlp n-grams nltk language-model kenlm

Updated Dec 27, 2023
Python

Natural-Language-Processing-in-Python

madhurimarawat / Natural-Language-Processing-in-Python

This repository contains Natural Language Processing programs in the Python programming language.

Updated Dec 26, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the n-grams topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the n-grams topic, visit your repo's landing page and select "manage topics."