Skip to content

Pull requests: NVIDIA/FasterTransformer

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update README.md
#776 opened Oct 29, 2023 by eltociear Loading…
Include stdio.h
#770 opened Oct 19, 2023 by JihaoXin Loading…
Ft llama opt
#762 opened Oct 2, 2023 by dypshong Loading…
Support Seq length up to 8K
#756 opened Sep 4, 2023 by zhen-jia Loading…
[Bugfix] GptJ & GptNeoX batch inference error
#742 opened Aug 11, 2023 by YZP17121579 Loading…
Add fusion-for-decoder-only for llama
#733 opened Jul 28, 2023 by binxuan Loading…
Fix beam search output_log_prob index error
#732 opened Jul 25, 2023 by cpm0722 Loading…
Add cuDNN include path as a common include dir
#724 opened Jul 18, 2023 by jacobkahn Loading…
Remove parenthesis from asserts
#699 opened Jul 2, 2023 by miguelusque Loading…
[Doc] Fix typo in gpt_guide.md
#682 opened Jun 26, 2023 by myry96 Loading…
swin-transformer quantization readme files changes
#675 opened Jun 16, 2023 by Mhhhaster Loading…
fix: fix Qk_vec_acum_fp32_ has already been declared
#659 opened Jun 9, 2023 by lkm2835 Loading…
gptneox & gptj int8 quantization & share context
#653 opened Jun 7, 2023 by rahuan Loading…
Add missing headers
#648 opened Jun 1, 2023 by brian14708 Loading…
Fix TOC of gptneox_guide.md
#633 opened May 23, 2023 by xu-song Loading…
Fix for docker image build
#632 opened May 23, 2023 by fredr Loading…
Update gpt_guide.md: documentation link is invalid
#620 opened May 22, 2023 by treycheng Loading…
fix multi-gpu build
#616 opened May 17, 2023 by dskhudia Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.