A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
-
Updated
May 11, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
A sentence segmenter that actually works!
Punctuation restoration and spell correction experiments.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Text normalization library for Python
Text and Punctuation correction with Deep Learning
Pre-process arabic text (remove diacritics, punctuations and repeating characters)
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules …
Apache OpenNLP wrapper for Nodejs
A small seq2seq punctuator tool based on DistilBERT
Нейронная сеть для восстановления пунктуации на русском языке.
Sequence to sequence model for Arabic punctuation prediction.
#Sentimental Analytics
Regular Expressions for finding wrong punctuation before publishing.
Regular expression for matching punctuation characters.
A blazingly fast tool for converting to English punctuations
Created a Python library specifically for Traditional Chinese stopwords and punctuations removal
Add a description, image, and links to the punctuation topic page so that developers can more easily learn about it.
To associate your repository with the punctuation topic, visit your repo's landing page and select "manage topics."