Questions about local-llm awq support #474

cha-noong · 2024-04-11T02:09:31Z

When I looked at the readme document for local_llm, it said that mlc and awq are supported as backends.

However, when I run it, awq is commented out (./models/init), and the related installation and dependency appear to be missing from the Docker file.

I wonder if it's still in development.

dusty-nv · 2024-04-11T02:11:45Z

@cha-noong yes sorry, i need to update the other HF-based APIs (its on my TODO list), alas I know from benchmarking them none are as fast as MLC which is why I use that, and it's more likely to add TensorRT-LLM backend when that becomes available for Jetson.

cha-noong · 2024-04-11T02:27:37Z

Thank you for quick response.

As far as I know, TensorRT-LLM is not yet supported. Can I know roughly when this will be possible?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about local-llm awq support #474

Questions about local-llm awq support #474

cha-noong commented Apr 11, 2024

dusty-nv commented Apr 11, 2024

cha-noong commented Apr 11, 2024

Questions about local-llm awq support #474

Questions about local-llm awq support #474

Comments

cha-noong commented Apr 11, 2024

dusty-nv commented Apr 11, 2024

cha-noong commented Apr 11, 2024