-
Notifications
You must be signed in to change notification settings - Fork 977
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TENSORRT-LLM] - Implement new looper thread based backend
#2357
opened Aug 2, 2024 by
mfuntowicz
•
Draft
fix: fix num_ln_in_parallel_attn attribute name typo in RWConfig
#2350
opened Aug 1, 2024 by
almersawi
Loading…
3 of 5 tasks
hotfix: fix xpu crash brought by code refine. torch.xpu rely on impor…
#2337
opened Jul 31, 2024 by
sywangyi
Loading…
fix: improve completions to send a final chunk with usage details
#2336
opened Jul 30, 2024 by
drbh
Loading…
Using HF_HOME instead of CACHE to get token read in addition to models.
#2288
opened Jul 23, 2024 by
Narsil
Loading…
5 tasks
Feature: During Generation add support for no_repeat_ngram_size
#2232
opened Jul 15, 2024 by
njbrake
Loading…
2 tasks
doc: Add metrics documentation and add a 'Reference' section
documentation
Improvements or additions to documentation
#2230
opened Jul 15, 2024 by
Hugoch
Loading…
2 of 5 tasks
added tie_weights support to mlp speculator
#2215
opened Jul 10, 2024 by
JRosenkranz
Loading…
5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-07-06.