Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remaining pieces for upstreaming #153

Open
sunggg opened this issue Jan 9, 2024 · 0 comments
Open

Remaining pieces for upstreaming #153

sunggg opened this issue Jan 9, 2024 · 0 comments

Comments

@sunggg
Copy link
Member

sunggg commented Jan 9, 2024

These are prerequisite for making mlc-serve an independent package.

  • Mixtral support @vinx13
  • vLLM v2 kernel @vinx13
  • Misc changes in core.py for mlc-serve-specific artifact dump @sunggg
  • Batched model support for split + rotary fusion (mlc_llm/transform/fuse_split_rotary_embedding.py). This one depends on a hack to TVM
@masahi masahi changed the title [Feature Request, Low Priority] FT quantization + Multi-gpus Remaining pieces for upstreaming Jan 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant