Resources and tools for Indian language Natural Language Processing
-
Updated
Apr 18, 2024 - Python
Resources and tools for Indian language Natural Language Processing
A collaborative catalog of NLP resources for Indic languages
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
Xlit-Crowd: Hindi-English Transliteration Corpus
Resources to go with the Indic NLP Library
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
Software and Resources for Mitigating Online Gender Based Violence in India
Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/IndicXlit
A configurable engine for analysing multi-lingual and multi-modal content.
Python library for converting numbers to words for all Indian Languages.
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
A Python NLP Toolkit for Gujarati(Under Progress)
An LSTM-CRF classifier for NER in Telugu, an Indian language.
Tooling to play around with multilingual machine translation for Indian Languages.
This repositary hosts my experiments for the project, I did with OffNote Labs.
Curated list of publicly available parallel corpus for Indian Languages
Small demo showing how MuRIL (Multilingual Representations for Indian Languages : A BERT model pre-trained on 17 Indian languages) understands Indian Languages better
This repository demonstrates usage of Amazon Bedrock Claude 3 models for Indian languages. The use cases include but not limited to: 1. Information extraction, 2. Question answering 3. Summarisation, 4. Translation and 5. Transliteration from the content in Indian languages such as Hindi, Telugu, Tamil, Malayalam, Marathi, Kannada to mention a few.
A transliterator between ITRANS and any Indic Script.
Add a description, image, and links to the indian-languages topic page so that developers can more easily learn about it.
To associate your repository with the indian-languages topic, visit your repo's landing page and select "manage topics."