multi-head-attention

Star

Here are 34 public repositories matching this topic...

AshishBodhankar / Transformer_NMT

Star

Attention is all you need: Discovering the Transformer model

natural-language-processing machine-translation transformer attention mask multi-head-attention vaswani

Updated Dec 22, 2021
Jupyter Notebook

YigitTurali / HydraViT

Star

HydraViT is a PyTorch implementation of the HydraViT model, an adaptive multi-branch transformer for multi-label disease classification from chest X-ray images. The repository provides the necessary code to train and evaluate the HydraViT model on the NIH Chest X-ray dataset.

machine-learning computer-vision deep-learning neural-networks chest-xray-images multi-head-attention visual-transformers

Updated Oct 14, 2023

shreyas-kowshik / nlp4if

Star

Code for the runners up entry on the English subtask on the Shared-Task-On-Fighting the COVID-19 Infodemic, NLP4IF workshop, NAACL'21.

natural-language-processing deep-learning multi-task-learning multi-head-attention naacl2021

Updated Apr 23, 2021
Python

TmohamedashrafT / vision-transformer-implementation

Star

This repository contains code for implementing Vision Transformer (ViT) model for image classification

transformers multi-head-attention vision-transformer

Updated Dec 20, 2023
Python

pi-tau / transformer

Star

The Transformer model implemented from scratch using PyTorch. The model uses weight sharing between the embedding layers and the pre-softmax linear layer. Training on the Multi30k machine translation task is shown.

deep-learning machine-translation pytorch transformer shared-embedding multi-head-attention multi30k

Updated Jul 23, 2023
Python

M-e-r-c-u-r-y / pytorch-transformers

Star

Collection of different types of transformers for learning purposes

transformers pytorch multi-head-attention einsum-notation multi-query-attention

Updated Jan 30, 2020
Jupyter Notebook

liaoyanqing666 / transformer_pytorch

Star

完整的原版transformer程序，complete origin transformer program

python pytorch transformer beginner multi-head-attention positional-encoding

Updated Mar 18, 2024
Python

sushantkumar23 / nano-gpt

Star

Simple character level Transformer

transformers pytorch attention attention-mechanism rope self-attention multi-head-attention shakespeare-dataset transformer-architecture llm rmsnorm

Updated May 27, 2024
Jupyter Notebook

dev-geof / final-state-transformer

Star

Machine learning development toolkit built upon Transformer encoder network architectures and tailored for the realm of high-energy physics and particle-collision event analysis.

machine-learning deep-learning toolkit transformer particle-physics science-research multi-head-attention

Updated Jun 3, 2024
Python

tate8 / translator

Star

Transformer translator website with multithreaded web server in Rust

javascript css python html rust website machine-learning multi-threading web-server tcp tensorflow chatbot keras transformer thread-pool word-embedding multi-head-attention positional-encoding

Updated Jul 29, 2022
Rust

sajith-rahim / transformer-classifier

Star

A Transformer Classifier implemented from Scratch.

pytorch embeddings attention sentence-classification classification-model scratch-implementation multi-head-attention

Updated Feb 28, 2022
Python

gazelle93 / Attention-Various-Positional-Encoding

Star

This project aims to implement the Scaled-Dot-Product Attention layer and the Multi-Head Attention layer using various Positional Encoding methods.

nlp natural-language-processing pytorch spacy nltk gensim attention-mechanism wordembeddings multi-head-attention t5 relative-positional-encoding scaled-dot-product relative-positional-representation

Updated Jun 27, 2022
Python

young-zonglin / yangzl-deep-text-matching

Star

Text matching using several deep models.

text-matching deep-model multi-head-attention transformer-encoder rnmt-plus-encoder deep-lstms

Updated Jan 28, 2019
Python

TranQuocTrinh / image_captioning

Star

Image Captioning with Encoder as Efficientnet and Decoder as Decoder of Transformer combined with the attention mechanism.

python natural-language-processing deep-learning pytorch transformer image-captioning convolutional-neural-networks attention-mechanism multi-head-attention efficientnet

Updated Apr 10, 2022
Python

SpydazWebAI-NLP / BasicNeuralNetWork2023

Star

A Basic Multi layered Neural Network, With Attention Masking Features

nlp neural-network rnn self-attention multi-head-attention transformer-architecture

Updated Jul 30, 2023
Visual Basic .NET

tanishqgautam / Transformers

Star

Pytorch Implementation of Transformers

nlp deep-learning pytorch transformer self-attention multi-head-attention

Updated Jan 10, 2021
Python

AIMedLab / DeepCE

Star

Code and Datasets for the paper "A deep learning framework for high-throughput mechanism-driven phenotype compound screening and its application to COVID-19 drug repurposing", published on Nature Machine Intelligence in 2021.

l1000 drug-repurposing multi-head-attention graph-neural-network covid-19 phenotypic-drug-discovery chemical-induced-gene-expression

Updated Aug 16, 2020
Python

navreeetkaur / learn-to-pay-attention

Star

TensorFlow implementation of AlexNet with multi-headed Attention mechanism

tensorflow alexnet attention-mechanism attention-model multi-head-attention

Updated Jul 30, 2019
Jupyter Notebook

jack57lee / Diversify-MHA

Star

EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Routing-by-Agreement

deep-learning neural-machine-transliteration multi-head-attention

Updated Oct 9, 2020
Python

shifop / datagrand_bert

Star

2019达观杯信息提取第5名代码

ner bert multi-head-attention

Updated Sep 20, 2019
Python

Improve this page

Add a description, image, and links to the multi-head-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-head-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-head-attention

Here are 34 public repositories matching this topic...

AshishBodhankar / Transformer_NMT

YigitTurali / HydraViT

shreyas-kowshik / nlp4if

TmohamedashrafT / vision-transformer-implementation

pi-tau / transformer

M-e-r-c-u-r-y / pytorch-transformers

liaoyanqing666 / transformer_pytorch

sushantkumar23 / nano-gpt

dev-geof / final-state-transformer

tate8 / translator

sajith-rahim / transformer-classifier

gazelle93 / Attention-Various-Positional-Encoding

young-zonglin / yangzl-deep-text-matching

TranQuocTrinh / image_captioning

SpydazWebAI-NLP / BasicNeuralNetWork2023

tanishqgautam / Transformers

AIMedLab / DeepCE

navreeetkaur / learn-to-pay-attention

jack57lee / Diversify-MHA

shifop / datagrand_bert

Improve this page

Add this topic to your repo