-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
build: Adding b64 dependency to relevant targets (fix L0_build_variants) (#7855)
#7882
opened Dec 14, 2024 by
yinggeh
Loading…
fix: Lock httpx version to fix L0_openai--trtllm test failures (#7870)
#7881
opened Dec 14, 2024 by
yinggeh
Loading…
feat: Adding version switcher to Triton sphinx documentation
PR: feat
A new feature
#7872
opened Dec 11, 2024 by
nv-kmcgill53
Loading…
12 of 20 tasks
perf: Upgrade vLLM version to 0.6.3.post1
PR: perf
A code change that improves performance
#7858
opened Dec 6, 2024 by
kthui
Loading…
9 of 20 tasks
feat: ORCA Format KV Cache Utilization in Inference Response Header
#7839
opened Nov 27, 2024 by
BenjaminBraunDev
Loading…
12 of 22 tasks
refactor: Refactor of L0_backend_python and the env subtest
PR: ci
Changes to our CI configuration files and scripts
PR: refactor
A code change that neither fixes a bug nor adds a feature
#7838
opened Nov 27, 2024 by
nv-kmcgill53
•
Draft
5 of 20 tasks
ci: Fix Windows CI Errors
PR: ci
Changes to our CI configuration files and scripts
#7837
opened Nov 27, 2024 by
fpetrini15
Loading…
9 of 18 tasks
draft: Added gRPC timer for graceful shutdown of inflight requests
#7835
opened Nov 25, 2024 by
mattwittwer
Loading…
20 tasks
ci: Enables testing for pull requests
#7828
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
3 of 20 tasks
test: Updates L0 Python API tests to run all test files
#7827
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
4 of 20 tasks
fix: Default max tokens to None for OpenAI frontend.
#7819
opened Nov 20, 2024 by
thealmightygrant
Loading…
4 of 22 tasks
feat: Adding RestrictedFeatures Support to the Python Frontend Bindings
#7775
opened Nov 8, 2024 by
KrishnanPrash
Loading…
docs: Add clarification for label_filename in classification docs
#7766
opened Nov 5, 2024 by
trevoryao
Loading…
7 of 22 tasks
docs: Simplify PR templates
PR: docs
Documentation only changes
#7753
opened Oct 29, 2024 by
yinggeh
Loading…
6 of 11 tasks
[Do not merge!] Build: Remove TRT model generation for V100
#7712
opened Oct 16, 2024 by
pvijayakrish
•
Draft
3 of 20 tasks
fix:Split L0_nomodel_perf into 2 test to ensure better debug-ability and resource util for PA
#7705
opened Oct 15, 2024 by
indrajit96
•
Draft
6 of 19 tasks
test: TC for Metric P0 nv_load_time per model
#7697
opened Oct 14, 2024 by
indrajit96
Loading…
8 of 20 tasks
Build: Update TRT release branch referenced in model gen file
#7693
opened Oct 11, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
Build: Update README and versions for 24.10
#7686
opened Oct 8, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.