Build software better, together

Collection of scripts used to create SRL datasets for Galician and Spanish using a verbal indexing method, as well as fine-tuned BERT and XLM-R models for SRL on each language

spanish semantic-parsing semantic-role-labeling srl-parser galician mbert xlm-r

Updated Jan 3, 2024
Python

RobinSmits / GPT-3.5-FineTuning

Star

GPT 3.5 FineTuning

transformers fine-tuning dutch-language mbert openai-api large-language-models prompt-engineering deberta-v3 mdistilbert gpt-35-turbo gpt-3-5-turbo

Updated Dec 25, 2023
Jupyter Notebook

michaelpeterhoffmann / masterthesis

Star

Multilingual hate speech detection for German, Italian and Spanish Social Media Posts #machine learning #classifier

transformer transfer-learning bert svm-classifier mbert xlmroberta

Updated Nov 30, 2023
Jupyter Notebook

Koharu24 / mBERT_crosslingual_rd

Star

This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms

reverse-dictionary mbert cross-lingual-transfer untranslatability

Updated Oct 7, 2023
Python

Koharu24 / mBERT-Unaligned-fine-tuning-for-a-cross-lingual-RD-of-untranslatable-terms

Star

This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms

nlp-machine-learning unaligned cross-linguistic-data reverse-dictionary mbert

Updated Aug 9, 2023

cambridgeltl / ContrastiveBLI

Star

Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

information-retrieval machine-translation word-embeddings pytorch self-learning word-alignment bilingual-word-embedding bilingual-lexicon-extraction fasttext-embeddings cross-lingual-embeddings mbert contrastive-learning low-resource-machine-translation bilingual-lexicon-induction cross-lingual-word-embedding word-translation cross-lingual-word-embeddings bilingual-dictionary-induction

Updated May 23, 2023
Python

honghanhh / definition_extraction

Star

Slovenian Definition Extraction

python transformers pytorch language-models binary-classifier slovenian rule-based-classifier mbert xlmr mdistilbert

Updated Jan 27, 2023
Python

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…

named-entity-recognition document-classification natural-language-inference bert sentiment-classification textual-entailment emotion-classification bangla-nlp bengali-language-processing bengali-natural-language-processing multilingual-models bengali-nlp bert-fine-tuning xlm-roberta mbert bangla-language-processing bangla-natural-language-processing banglabert

Updated Jan 24, 2023
Python

DiFronzo / Multilingual-Models

Sponsor

Star

mBERT and XLM-R for encodeing of Scandinavian languages

multilingual python language transformers python3 pytorch xlm-roberta mbert xlm-r scandinavian

Updated Dec 14, 2022
Python

juletx / multilingual-question-answering

Star

Zero-shot and Translation Experiments on XQuAD, MLQA and TyDiQA

translation machine-translation question-answering squad bert zero-shot roberta mbert xlm-r mlqa xquad multilingual-bert translate-train tydiqa translate-test

Updated Jun 14, 2022
Jupyter Notebook

peterzee-tsien / LING484-COMP599-Final-Projects

Star

By using the hypothesis of historical linguistics, we found a way to improve the performance of multilingual transformers with limited amount of data

swahili yoruba ner pos-tagger fine-tuning mbert multilingual-bert wolof

Updated Apr 27, 2022
Jupyter Notebook

AditiBagora / Hasoc2021CodeMix

Star

HASOC2021: Subtask 2 a) Codemix Challenge; Contains baselines and hierarchical approach that extracts the relevant context useful for classification of hostile tweets on English-Hindi code-mix data obtained from twitter.

tensorflow transformers torch feature-extraction mlp nlp-machine-learning fine-tuning xlm-roberta mbert

Updated Feb 20, 2022
Jupyter Notebook

BassaniRiccardo / ICEBERT

Star

ICEBERT: Interlingual-Clusters Enhanced BERT. A BERT-like model trained on clusters of monolingual subwords.

clustering tokenization subword-segmentation mbert

Updated Jan 10, 2022
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mbert

Here are 23 public repositories matching this topic...

negar-foroutan / multiLMs-lang-neutral-subnets

NasserMohamedEid / Text-AI-Detection

MusfiqDehan / Multilingual-Sentence-Alignments-Demo

ShafakatArnob / Bengali-Misogyny-Identification-Deep-Learning-LIME

lirondos / lazaro

Revanth-Reddy-Pingala / Abusive_Comment_Detector_BERT

Mukaffi28 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

mbruton0426 / GalicianSRL

RobinSmits / GPT-3.5-FineTuning

michaelpeterhoffmann / masterthesis

Koharu24 / mBERT_crosslingual_rd

Koharu24 / mBERT-Unaligned-fine-tuning-for-a-cross-lingual-RD-of-untranslatable-terms

cambridgeltl / ContrastiveBLI

honghanhh / definition_extraction

csebuetnlp / banglabert

DiFronzo / Multilingual-Models

juletx / multilingual-question-answering

peterzee-tsien / LING484-COMP599-Final-Projects

AditiBagora / Hasoc2021CodeMix

BassaniRiccardo / ICEBERT

Improve this page

Add this topic to your repo