Skip to content

Issues: microsoft/onnxruntime-genai

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

GeneratorParams should not throw an exception enhancement New feature or request
#453 opened May 14, 2024 by skottmckay
Wasm/WebGPU backends? enhancement New feature or request
#452 opened May 14, 2024 by josephrocca
Regex decoding support enhancement New feature or request
#450 opened May 14, 2024 by sheepymeh
How to release GPU memory after each inference? enhancement New feature or request
#446 opened May 13, 2024 by nguyenthekhoig7
Extensions for LLM
#442 opened May 11, 2024 by hannespreishuber
How to ignore EOS token when using onnxruntime-genai enhancement New feature or request
#436 opened May 10, 2024 by Tabrizian
cannot build phi2 model on system with 32 GB ram enhancement New feature or request
#319 opened Apr 24, 2024 by liqunfu
Phi-3 on mobile
#316 opened Apr 24, 2024 by cvb941
ProTip! Exclude everything labeled bug with -label:bug.