-
-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model][LoRA]LoRA support added for MolmoForCausalLM
#11439
opened Dec 23, 2024 by
ayylemao
Loading…
[Misc]Suppress irrelevant exception stack trace information when CUDA…
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#11438
opened Dec 23, 2024 by
shiquan1988
Loading…
[Bugfix] Fix issues in CPU build Dockerfile. Fixes #9182
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#11435
opened Dec 23, 2024 by
terrytangyuan
Loading…
[Bugfix][Hardware][CPU] Fix CPU ONLY add when PR is ready to merge/full CI is needed
input_positions
creation for text-only inputs with mrope
ready
#11434
opened Dec 23, 2024 by
Isotr0py
Loading…
Bump helm/kind-action from 1.10.0 to 1.11.0
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#11424
opened Dec 23, 2024 by
dependabot
bot
Loading…
[WIP][Doc]Add documentation for using EAGLE in vLLM
documentation
Improvements or additions to documentation
[V1] Optimize block table transfer from CPU to GPU
ci/build
#11401
opened Dec 22, 2024 by
WoosukKwon
•
Draft
[VLM] Support caching in merged multi-modal processor
documentation
Improvements or additions to documentation
#11396
opened Dec 21, 2024 by
DarkLight1337
Loading…
[V1] Use FlashInfer Sampling Kernel for Top-P & Top-K Sampling
#11394
opened Dec 21, 2024 by
WoosukKwon
•
Draft
[Bugfix] Use .clone() for sampling params and deepcopy XGrammarLogitsProcessor
#11380
opened Dec 20, 2024 by
tjohnson31415
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.