Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] Add attention sinks
#3515 opened Mar 19, 2024 by felixzhu555 Draft
torch.compile() support needs-rebase unstale
#3014 opened Feb 23, 2024 by ani300 Loading…
Directly call cublas in awq_dequant
#2929 opened Feb 20, 2024 by zcnrex Draft
[docs] add load balancing examples
#1837 opened Nov 29, 2023 by imoneoi Loading…
add policy needs-rebase unstale
#2071 opened Dec 13, 2023 by xxw1995 Loading…
Prefix caching and deallocation mechanism documentation Improvements or additions to documentation frontend needs-rebase unstale
#2511 opened Jan 19, 2024 by jadielam Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.