sentence-tokenizer

My legal background gave me a deep appreciation for language's importance. It's not just words; it's a profound understanding woven into every case. This connection led me to coding, where I coded a potent pipeline system with Stanford CoreNLP.

java nlp sentiment-analysis tokenizer oop named-entity-recognition lemmatizer sentence-tokenizer partofspeech-tagger stanfordnlp

Updated Jan 12, 2024
Java

paaila / language

Star

Language processing for better query answering

machine-learning natural-language-processing sentence-tokenizer sentence-segmentation sklearn-classify

Updated Jun 8, 2018
Python

zainmujahid / Longest-Common-Subsequence

Star

This repository contains python script for calculating Longest Common Subsequences (LSC) between tokenized URDU sentences.

python longest-common-subsequence sentence-tokenizer lsc sentence-similarity urdu-nlp

Updated Nov 9, 2020
Jupyter Notebook

quocthang0507 / VietnameseNaturalLanguageProcessing

Star

Vietnamese Natural Language Processing

nlp natural-language-processing sentence-tokenizer word-tokenizer

Updated Jan 10, 2022
Jupyter Notebook

StarlangSoftware / Corpus-Cy

Star

Corpus Processing Library

sentence-tokenizer sentence-segmentation corpus-processing turkish-sentence-segmentation turkish-sentence-tokenizer

Updated May 20, 2024
Cython

StarlangSoftware / Corpus-CS

Star

Corpus processing library

sentence-tokenizer sentence-segmentation corpus-processing turkish-sentence-segmentation turkish-sentence-tokenizer

Updated May 20, 2024
C#

rmjacobson / privacy-crawler-parser-tokenizer

Star

Crawler, Parser, Sentence Tokenizer for online privacy policies. Intended to support ML efforts on policy language and verification.

web-crawler html-parser sentence-tokenizer privacy-policy web-crawler-python

Updated Apr 23, 2020
HTML

Musaddiq625 / Python-Projects

Star

Some of my Python Projects

Updated Oct 2, 2019
Python

deepakrana47 / Sentence_tokenizer

Star

Consist of Neural Network based sentence Tokenizer

neural-network sentence-tokenizer multilayer-perceptron-network

Updated Aug 13, 2018
Python

victoryosiobe / kingchop

Star

Kingchop ⚔️ is a JavaScript English based library for tokenizing text (chopping text). It uses vast rules for tokenizing, and you can adjust them easily.

nodejs javascript natural-language-processing text-processing sentence-tokenizer text-tokenization word-tokenizer tokenizers paragraph-tokenizer

Updated Jan 22, 2024
JavaScript

StarlangSoftware / Corpus-Js

Star

Corpus Processing Library

sentence-tokenizer sentence-segmentation corpus-processing turkish-sentence-segmentation turkish-sentence-tokenizer

Updated May 21, 2024
TypeScript

sichkar-valentyn / Machine_Learning_in_Python

Star

Practical experiments on Machine Learning in Python. Processing of sentences and finding relevant ones, approximation of function with polynomials, function optimization

machine-learning sentence-tokenizer cosine-distance function-approximation function-optimization polynomial-calculator function-minimization