Add LLamaSharp.Backend.Vulkan #3 #517

moozoo64 · 2024-02-18T08:47:49Z

This PR adds the llama.cpp vulkan backend.

Add Vulkan Backend

Remove unnecessary defines

martindevans · 2024-02-18T13:06:15Z

Does this supercede #514?

Just in case you're not aware you can always amend a PR by pushing to the branch, you don't need to re-open new ones :)

Edit: Just saw your comment on the other PR, never mind!

martindevans · 2024-02-18T13:09:22Z

Note to self: Tests failing is expected because the binaries are missing in this PR. Base the next binary update PR on this branch.

moozoo64 · 2024-02-21T01:47:44Z

Just some general comments
I have only tested on Windows with an AMD Radeon VII
I'd request that other test this PR in order to validate it.
I just know it works for me and my simple use case.
I've not tested under Linux but I believe it has a high chance of working.

On a system with CUDA the CUDA library should precede the vulkan library and hence be used.
In theory vulkan could be made to work on macos using the Vulkan->Metal emulation. But I don't see the point.

The llama.cpp vulkan backend fully loads into vram. I believe the CLBlast doesn't neither does the kcompute.
In my testing the Vulkan backend is also much faster than the CLBlast
Benchmarks on the llama.cpp site suggest the Vulkan backend is about 60% to 90% as fast as the HIPBlast backend However the HIPBlast backend supports a much limited set of graphic cards.

With
private bool _useVulkan = true;
The presence of vulkan will be checked for via GetVulkanVersion()
This uses the command "vulkaninfo --summary" which on windows with a AMD graphics card is installed along with the driver.
Ditto with Nvidia cards. I don't know about Intel but I assume so.

On linux you might need to use "sudo apt install vulkan-tools". I needed to do this under wsl on my nvidia gpu system but not on my AMD gpu system.
I've not tested under wsl. I don't believe vulkan works properly this way.

The safest option is _useVulkan = false and to remove the check via GetVulkanVersion()
This however means that the vulkan backend won't automatically be used on systems with it.
I don't fully parse "vulkaninfo -summary", only that there is a vulkan device present.
Obviously this would break if the format of "vulkaninfo -summary" was ever changed.
It would be complicated to workout if there was a vulkan device present and that it could run llama.cpp
The only other options to using "vulkaninfo --summary"

have Llamasharp use the Vulkan SDK , which I don't think is desirable.
create a small stub program using the Vulkan SDK that simply returns VulkanOK and have that linked against the Vulkan SDK.

martindevans · 2024-03-01T22:58:47Z

@moozoo64 Sorry for the delay with this PR. I'm just starting to look at the next round of binary updates now.

To make that easier, would you mind creating a new PR with just the changes to the github build action. That way I can merge it in without breaking CI, we can merge the rest of the stuff in a separate PR later once the binaries are in place.

AsakusaRinne · 2024-04-30T17:09:56Z

@moozoo64 Hi, would you like to continue the work and finish this PR? If you have any problem when resolving the conflicts, please feel free to ask me for help. :)

moozoo64 added 4 commits February 18, 2024 16:26

Add Vulkan Backend

86b5b71

Merge pull request #1 from moozoo64/Vulkan-Backend

32f30ea

Add Vulkan Backend

Remove unnecessary defines

307f957

Merge pull request #2 from moozoo64/Vulkan-Backend

6cd9820

Remove unnecessary defines

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLamaSharp.Backend.Vulkan #3 #517

Add LLamaSharp.Backend.Vulkan #3 #517

moozoo64 commented Feb 18, 2024

martindevans commented Feb 18, 2024 •

edited

martindevans commented Feb 18, 2024

moozoo64 commented Feb 21, 2024

martindevans commented Mar 1, 2024

AsakusaRinne commented Apr 30, 2024

Add LLamaSharp.Backend.Vulkan #3 #517

Are you sure you want to change the base?

Add LLamaSharp.Backend.Vulkan #3 #517

Conversation

moozoo64 commented Feb 18, 2024

martindevans commented Feb 18, 2024 • edited

martindevans commented Feb 18, 2024

moozoo64 commented Feb 21, 2024

martindevans commented Mar 1, 2024

AsakusaRinne commented Apr 30, 2024

martindevans commented Feb 18, 2024 •

edited