-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: mlc-ai/mlc-llm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Question] Quantization Problems
question
Question about the usage
#2523
opened Jun 6, 2024 by
ponytaill
[Bug] chatglm4 mlc_llm gen_config shows error 'ChatGLM4Tokenizer' object has no attribute 'backend_tokenizer' and then Segmentation fault
bug
Confirmed bugs
#2517
opened Jun 6, 2024 by
lihaofd
[Question] Running mlc_llm into a multi-phase container build
question
Question about the usage
#2512
opened Jun 5, 2024 by
oglok
mlc_llm package is ERROR: returned non-zero exit status[Bug]
bug
Confirmed bugs
#2511
opened Jun 5, 2024 by
panghongtao
[Bug] FlashInfer decode BeginForward error an illegal instruction was encountered
bug
Confirmed bugs
#2509
opened Jun 4, 2024 by
zifeitong
[Feature Request] please allow f32q5_k and f16q5_k quantizations
feature request
New feature or request
#2506
opened Jun 4, 2024 by
0wwafa
[Bug] SEVERE downstream task performance degradation compared to uncompiled model
bug
Confirmed bugs
#2499
opened Jun 4, 2024 by
0xLienid
[Bug] Confirmed bugs
mlc_llm serve
throws CUDA: invalid device ordinal
bug
#2498
opened Jun 3, 2024 by
josephrocca
[Question] Cannot compile custom model to work on web browser
question
Question about the usage
#2485
opened Jun 2, 2024 by
lawofcycles
[Bug] iOS | mlc_llm package not working
bug
Confirmed bugs
#2477
opened May 30, 2024 by
iOSDevCodiste
[Doc] benchmark on different hardware
documentation
Improvements or additions to documentation
#2475
opened May 30, 2024 by
louis030195
[Doc] Request for suggested build-from-source options + explanation of added functionality
documentation
Improvements or additions to documentation
#2473
opened May 30, 2024 by
BuildBackBuehler
Compiling WebAssembly library with debug symbols/source map to aid in debugging
question
Question about the usage
#2472
opened May 30, 2024 by
slash-under
mlc_llm serve fails on concurrent users - Llama3 70B parameter hosting
bug
Confirmed bugs
#2462
opened May 29, 2024 by
swamysrivathsan
'ChatGLMTokenizer' object has no attribute 'backend_tokenizer'
bug
Confirmed bugs
#2460
opened May 29, 2024 by
lihaofd
qwen1.5-0.5B-chat : lm_head.weight
question
Question about the usage
#2458
opened May 29, 2024 by
viaowp
[Doc] Python API KV/memory reset details absent
documentation
Improvements or additions to documentation
#2426
opened May 26, 2024 by
federicoparra
[Feature Request] phi-3 small realeased -> performs two times ebtter then Phi-3 mini
feature request
New feature or request
#2420
opened May 26, 2024 by
sebastienbo
Phi-2 q4f16_1 runs faster when compiled without Confirmed bugs
tvm.relax.transform.FuseOps()
and tvm.relax.transform.FuseTIR()
transformations
bug
#2405
opened May 24, 2024 by
MMuzzammil1
Fail to build tvm-unity from source on orin[Bug]
bug
Confirmed bugs
#2389
opened May 23, 2024 by
Louym
[Bug] java.lang.NullPointerException: Attempt to invoke virtual method 'org.apache.tvm.TVMValue org.apache.tvm.Function.invoke()' on a null object reference
bug
Confirmed bugs
#2366
opened May 21, 2024 by
View999888
Previous Next
ProTip!
Follow long discussions with comments:>50.