-
Notifications
You must be signed in to change notification settings - Fork 893
Pull requests: NVIDIA/FasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix shape mismatch on the masked_tokens param in decoder masked multi-head attention kernel.
#773
opened Oct 24, 2023 by
FengDSP
Loading…
[BugFix] GPT inference error when pipeline_para_size > 1 and int8_mode != 0
#750
opened Aug 23, 2023 by
00why00
Loading…
[Doc] Add
projects
section in README which is developed based on FasterTransformer
#731
opened Jul 25, 2023 by
lvhan028
Loading…
Add triton fastertransformer backend support for deberta
#725
opened Jul 19, 2023 by
sfc-gh-zhwang
Loading…
fix: initialize tiled_prompt_lengths_buf_ to zero in gptneox
#716
opened Jul 13, 2023 by
yandai
Loading…
Huggingface gptj convert script supports sharded checkpoint
#695
opened Jun 29, 2023 by
skyser2003
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-10-10.