Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

convert-hf-to-gguf-update.py: Added Ukrainian tokens into string python python script changes
#7888 opened Jun 11, 2024 by AragonerUA Loading…
2 of 4 tasks
Implemented Spellcheck for Llama.cpp examples
#7884 opened Jun 11, 2024 by Ferruolo Loading…
2 tasks
json: better support for "type" unions (e.g. nullable arrays w/ typed items) examples python python script changes Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server testing Everything test related
#7863 opened Jun 10, 2024 by ochafik Loading…
[SYCL] Revert "use the correct SYCL context for host USM allocations" SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#7858 opened Jun 10, 2024 by AidanBeltonS Loading…
tests : add non-cont unary tests ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#7857 opened Jun 10, 2024 by ggerganov Loading…
ggml : improve ggml_is_contiguous logic ggml changes relating to the ggml tensor library for machine learning
#7856 opened Jun 10, 2024 by ggerganov Loading…
gguf-dump.py: add --markdown dump output python python script changes Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7853 opened Jun 10, 2024 by mofosyne Loading…
Type cast AVX512_BF16 data types based on compiler instead of the OS platform ggml changes relating to the ggml tensor library for machine learning
#7851 opened Jun 10, 2024 by Srihari-mcw Loading…
AVX IQ Quants ggml changes relating to the ggml tensor library for machine learning
#7845 opened Jun 10, 2024 by netrunnereve Draft
examples: refine tensor dump examples
#7844 opened Jun 10, 2024 by zhouwg Loading…
Fix conversion of unnormalized BF16->BF16 weights python python script changes
#7843 opened Jun 10, 2024 by CISC Loading…
[WIP] json: fix additionalProperties, allow space after enum/const examples python python script changes server testing Everything test related
#7840 opened Jun 9, 2024 by ochafik Draft
1 of 2 tasks
update: support Qwen2-57B-A14B python python script changes
#7835 opened Jun 8, 2024 by legraphista Loading…
Avoid division-by-zero on 0-weights ggml changes relating to the ggml tensor library for machine learning Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7825 opened Jun 7, 2024 by CISC Loading…
Add Qwen2MoE 57B-A14B
#7814 opened Jun 7, 2024 by CISC Loading…
build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. devops improvements to build systems and github actions documentation Improvements or additions to documentation examples help wanted Extra attention is needed nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#7809 opened Jun 6, 2024 by ochafik Loading…
[WIP] json: support integer minimum, maximum, exclusiveMinimum, exclusiveMaximum enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs server testing Everything test related
#7797 opened Jun 6, 2024 by ochafik Draft
3 of 7 tasks
WIP: Use DirectStorage with CUDA interop to more efficient load tensors build Compilation issues ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7796 opened Jun 6, 2024 by mtavenrath Draft
feat: add changes to handle jina v2 chinese code python python script changes
#7795 opened Jun 6, 2024 by JoanFM Loading…
Fix a typo + add Fedora packages for Vulkan
#7794 opened Jun 6, 2024 by metal3d Loading…
JSON Schema to GBNF integration tests testing Everything test related
#7790 opened Jun 6, 2024 by HanClinto Loading…
Fix missing libgomp.so.1 Error in Docker Container for llama.cpp devops improvements to build systems and github actions
#7775 opened Jun 5, 2024 by 0x4139 Loading…
Enable stream updating in the SwiftUI example examples Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7754 opened Jun 5, 2024 by shu223 Loading…
Fix no gcc pragma on Windows merge ready indicates that this may be ready to merge soon and is just holding out in case of objections Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7751 opened Jun 4, 2024 by jojorne Loading…
ProTip! no:milestone will show everything without a milestone.