You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey @czb2133 , absolutely! We recommend using the gguf files with abetlen/llama-cpp-python to load and run inference.
Check out the colab notebook here.
Also, if you want to host your own server, we included a Triton Server deployment using Docker. Check out these instructions.
It is really surprising that the ckpt is accessible in hugging face. Can you provide a python file for inference?
The text was updated successfully, but these errors were encountered: