-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Kernel][Core][WIP] Tree attention and parallel decoding
needs-rebase
unstale
#4325
opened Apr 24, 2024 by
yukavio
Loading…
[Hardware][Nvidia][Core][Feature] new feature add: vmm(virtual memory manage) kv cache for nvidia gpu
ci/build
needs-rebase
unstale
#6102
opened Jul 3, 2024 by
izhuhaoran
Loading…
[Core] Support sparse KV cache framework
needs-rebase
#5752
opened Jun 21, 2024 by
chizhang118
Loading…
[Hardware][Ascend] Add Ascend NPU backend
ci/build
needs-rebase
#8054
opened Aug 31, 2024 by
wangshuai09
Loading…
12 tasks done
[Bugfix]Fix evict v2 with long context length
needs-rebase
unstale
#5411
opened Jun 11, 2024 by
puf147
Loading…
GPTQ & AWQ Fused MOE
needs-rebase
unstale
#2761
opened Feb 5, 2024 by
chu-tianxiang
Loading…
3 tasks done
Add control panel allow manage multi vllm instances
frontend
needs-rebase
unstale
#4861
opened May 16, 2024 by
leiwen83
Loading…
[core] Sampling controller interface
needs-rebase
unstale
#6273
opened Jul 9, 2024 by
mmoskal
Loading…
[ DO NOT MERGE ] grpc openai server prototypes
#6839
opened Jul 26, 2024 by
robertgshaw2-neuralmagic
•
Draft
[Model] Add moondream vision language model
documentation
Improvements or additions to documentation
needs-rebase
unstale
#4228
opened Apr 20, 2024 by
vikhyat
Loading…
[V1] Supports scheduling asynchronousization on V1 version
needs-rebase
#11133
opened Dec 12, 2024 by
lixiaolx
Loading…
bug fixed: cuda out of memory lead to 'AsyncEngineDeadError: Background loop has errored already.
unstale
#5173
opened Jun 1, 2024 by
charent
Loading…
[WIP] Qwen-style dynamic-NTK ROPE kernel for long sequence support
needs-rebase
unstale
#1860
opened Nov 30, 2023 by
ZiyueHuang
Loading…
[FIX] Fix shape mismatch for swapped sequences when logprobs > 0
needs-rebase
unstale
#1971
opened Dec 7, 2023 by
derange-alembic
Loading…
[Misc] Support register quantization method out-of-tree
#11969
opened Jan 12, 2025 by
ice-tong
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.