minMamba

A simple PyTorch re-implementation of Mamba in a single file. minMamba tries to be small, clean, interpretable and educational.

Library Installation

If you want to import minmamba into your project:

git clone https://github.com/lckr/minMamba.git
cd minMamba
pip install -e .

Usage

Here's how you'd load a pretrained Mamba model from Huggingface Hub:

import minmamba.model
pretrained_model = minmamba.model.MambaLMModel.from_pretrained("state-spaces/mamba-130m")

And here's how you'd run inference with it:

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b") # tokenizer used by "state-spaces/mamba-130m"

input_seq = tokenizer("A fish is a ", return_tensors="pt")["input_ids"]
gen_seq = pretrained_model.generate(input_seq, 100)
print(tokenizer.decode(gen_seq[0]))

References

Code:

state-spaces/mamba the official Mamba implementation released by the authors
karpathy/minGPT Andrej Kaparthys mingpt
rjb7731/nanoMamba Ryan Bradys nanoMamba implementation

Paper:

Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu*, Tri Dao*
Paper: https://arxiv.org/abs/2312.00752

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
minmamba		minmamba
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minmamba

minmamba

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

setup.py

setup.py

Repository files navigation

minMamba

Library Installation

Usage

References

License

About

Releases

Packages

Languages

License

lckr/minMamba

Folders and files

Latest commit

History

Repository files navigation

minMamba

Library Installation

Usage

References

License

About

Resources

License

Stars

Watchers

Forks

Languages