Multi-Head Mixture of Experts (MHMoE)

MH-MoE to collectively attend to information from various representation spaces within different experts to deepen context understanding while significantly enhancing expert activation.

install

pip3 install mh-moe

usage

import torch
from mh_moe.main import MHMoE

# Define model parameters
dim = 512
heads = 8
num_experts = 4
num_layers = 3

# Create MHMoE model instance
model = MHMoE(dim, heads, num_experts, num_layers)

# Generate dummy input
batch_size = 10
seq_length = 20
dummy_input = torch.rand(batch_size, seq_length, dim)
dummy_mask = torch.ones(batch_size, seq_length)  # Example mask

# Forward pass through the model
output = model(dummy_input, dummy_mask)

# Print output and its shape
print(output)
print(output.shape)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
mh_moe		mh_moe
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
README.md		README.md
agorabanner.png		agorabanner.png
example.py		example.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

mh_moe

mh_moe

scripts

scripts

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

.readthedocs.yml

.readthedocs.yml

LICENSE

LICENSE

README.md

README.md

agorabanner.png

agorabanner.png

example.py

example.py

mkdocs.yml

mkdocs.yml

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

Repository files navigation

Multi-Head Mixture of Experts (MHMoE)

install

usage

About

Releases

Sponsor this project

Packages

Languages

License

kyegomez/MHMoE

Folders and files

Latest commit

History

Repository files navigation

Multi-Head Mixture of Experts (MHMoE)

install

usage

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Languages