forked from kserve/kserve
-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Code sync with upstream 2024-02-15 #206
Merged
openshift-merge-bot
merged 18 commits into
opendatahub-io:master
from
israel-hdez:kserve-20240215-code-sync
Feb 15, 2024
Merged
Code sync with upstream 2024-02-15 #206
openshift-merge-bot
merged 18 commits into
opendatahub-io:master
from
israel-hdez:kserve-20240215-code-sync
Feb 15, 2024
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* Pass missing infer parameters Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix warnings Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add equal magic method for infer types Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Avoid inplace modification of parameters in conversion Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* feat: Support S3 transfer acceleration Signed-off-by: Yuan Tang <[email protected]> * Add missing field in ISVC yaml and storage.py Signed-off-by: Yuan Tang <[email protected]> * fix test Signed-off-by: Yuan Tang <[email protected]> * fix default Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* Initial huggingface server implementation Signed-off-by: Dan Sun <[email protected]> * Implement preprocess and predict for huggingface server Signed-off-by: Dan Sun <[email protected]> * Add postprocess Signed-off-by: Dan Sun <[email protected]> * Update lock file Signed-off-by: Dan Sun <[email protected]> * Update func signature Signed-off-by: Dan Sun <[email protected]> * Update encoding Signed-off-by: Dan Sun <[email protected]> * Add token classification test Signed-off-by: Dan Sun <[email protected]> * Add comments Signed-off-by: Dan Sun <[email protected]> * Fix comments Signed-off-by: Dan Sun <[email protected]> * Add generate endpoint Signed-off-by: Dan Sun <[email protected]> * Remove vllm async engine file Signed-off-by: Dan Sun <[email protected]> * Handle optional vllm Signed-off-by: Dan Sun <[email protected]> * Add async generator Signed-off-by: Dan Sun <[email protected]> * Fix lint Signed-off-by: Dan Sun <[email protected]> * Generate response streaming support Signed-off-by: Dan Sun <[email protected]> * rename async generate stream file Signed-off-by: Dan Sun <[email protected]> * Call generate endpoint directly Signed-off-by: Dan Sun <[email protected]> * Remove the tritonclient dependency Signed-off-by: Dan Sun <[email protected]> * Add huggingface dockerfile Signed-off-by: Dan Sun <[email protected]> * Test for triton bert case Signed-off-by: Dan Sun <[email protected]> * Add huggingface test Signed-off-by: Dan Sun <[email protected]> * Add Makefile Signed-off-by: Dan Sun <[email protected]> * Support gpu device Signed-off-by: Dan Sun <[email protected]> * Add gnereate parameters Signed-off-by: Dan Sun <[email protected]> * Fix generate tests Signed-off-by: Dan Sun <[email protected]> * Run huggingface test Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
…elcars (kserve#3384) Modelcar: Add mount of shared volume to transformer container Adding the missing mount of the emptyDir volume that holds the symbolic link to the model data in the model sidecar to a transformer container, if it is configured. Signed-off-by: Roland Huß <[email protected]>
chore: The cloudevents/sdk-go was migrated to v2, however, the v1 is still present in the go.mod. It was being used bu the pkg/logger/worker.go file to set a field that was not being in use anymore, thus it was removed Signed-off-by: Spolti <[email protected]>
…dency to >=2.8.1, <3.0.0 (kserve#3389) * Bump google cloud storage to fix warnings Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix poetry deprecated field warning Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Restrict ray dependency to ^2.8.1 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…ve#3104) * telepresence-setup.sh Signed-off-by: Roland Huß <[email protected]> * Add hint for downloading telepresence Signed-off-by: Roland Huß <[email protected]> --------- Signed-off-by: Roland Huß <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Yuan Tang <[email protected]>
* Change docker data directory to /mnt/docker Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Change huggingface test virtual env location Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use symlink to move docker data dir Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * skip huggingface tests for python 3.8 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…tom handler (kserve#3405) Add exception handler for model server and Add ability to specify custom exception handler Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
) Updated python-release script. Signed-off-by: Andrews Arokiam <[email protected]>
* feat: Add HuggingFace runtime out-of-the-box support Signed-off-by: Yuan Tang <[email protected]> * Update violation list Signed-off-by: Yuan Tang <[email protected]> * Add to charts Signed-off-by: Yuan Tang <[email protected]> * Regenerate everything Signed-off-by: Yuan Tang <[email protected]> * fix lint Signed-off-by: Yuan Tang <[email protected]> * fix lint Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
Update link to Slack instructions Signed-off-by: Yuan (Terry) Tang <[email protected]>
…rve#3431) Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* Fix parameters value type conversion Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
This is a small fix to the test_raw_logger.py/test_kserve_logger E2E test. This E2E test was creating a `message-dumper-raw` that was dependant on a Serverless setup, leading to the E2E to fail on an environment where Istio nor KNative are available. The fix is to add the right annotation to `message-dumper-raw` to deploy it in Raw mode, and adjust the logger URL of the `isvc-logger-raw` to point to the Kubernetes Service, rather than the KNative service. Signed-off-by: Edgar Hernández <[email protected]>
Signed-off-by: Edgar Hernández <[email protected]>
openshift-ci
bot
requested review from
terrytangyuan and
VedantMahabaleshwarkar
February 15, 2024 18:06
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: israel-hdez The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/lgtm |
openshift-merge-bot
bot
merged commit Feb 15, 2024
c6bf7da
into
opendatahub-io:master
28 checks passed
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What this PR does / why we need it:
Code sync up to upstream commit d3ec987