Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Code sync with upstream 2024-02-15 #206

Merged

Conversation

israel-hdez
Copy link

What this PR does / why we need it:
Code sync up to upstream commit d3ec987

sivanantha321 and others added 18 commits January 26, 2024 21:30
* Pass missing infer parameters

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add tests

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix warnings

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add equal magic method for infer types

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Avoid inplace modification of parameters in conversion

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* feat: Support S3 transfer acceleration

Signed-off-by: Yuan Tang <[email protected]>

* Add missing field in ISVC yaml and storage.py

Signed-off-by: Yuan Tang <[email protected]>

* fix test

Signed-off-by: Yuan Tang <[email protected]>

* fix default

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
* Initial huggingface server implementation

Signed-off-by: Dan Sun <[email protected]>

* Implement preprocess and predict for huggingface server

Signed-off-by: Dan Sun <[email protected]>

* Add postprocess

Signed-off-by: Dan Sun <[email protected]>

* Update lock file

Signed-off-by: Dan Sun <[email protected]>

* Update func signature

Signed-off-by: Dan Sun <[email protected]>

* Update encoding

Signed-off-by: Dan Sun <[email protected]>

* Add token classification test

Signed-off-by: Dan Sun <[email protected]>

* Add comments

Signed-off-by: Dan Sun <[email protected]>

* Fix comments

Signed-off-by: Dan Sun <[email protected]>

* Add generate endpoint

Signed-off-by: Dan Sun <[email protected]>

* Remove vllm async engine file

Signed-off-by: Dan Sun <[email protected]>

* Handle optional vllm

Signed-off-by: Dan Sun <[email protected]>

* Add async generator

Signed-off-by: Dan Sun <[email protected]>

* Fix lint

Signed-off-by: Dan Sun <[email protected]>

* Generate response streaming support

Signed-off-by: Dan Sun <[email protected]>

* rename async generate stream file

Signed-off-by: Dan Sun <[email protected]>

* Call generate endpoint directly

Signed-off-by: Dan Sun <[email protected]>

* Remove the tritonclient dependency

Signed-off-by: Dan Sun <[email protected]>

* Add huggingface dockerfile

Signed-off-by: Dan Sun <[email protected]>

* Test for triton bert case

Signed-off-by: Dan Sun <[email protected]>

* Add huggingface test

Signed-off-by: Dan Sun <[email protected]>

* Add Makefile

Signed-off-by: Dan Sun <[email protected]>

* Support gpu device

Signed-off-by: Dan Sun <[email protected]>

* Add gnereate parameters

Signed-off-by: Dan Sun <[email protected]>

* Fix generate tests

Signed-off-by: Dan Sun <[email protected]>

* Run huggingface test

Signed-off-by: Dan Sun <[email protected]>

---------

Signed-off-by: Dan Sun <[email protected]>
…elcars (kserve#3384)

Modelcar: Add mount of shared volume to transformer container

Adding the missing mount of the emptyDir volume that holds the symbolic link to the model data in the model sidecar to a transformer container, if it is configured.

Signed-off-by: Roland Huß <[email protected]>
chore:  The cloudevents/sdk-go was migrated to v2, however, the v1 is still present in the
        go.mod. It was being used bu the pkg/logger/worker.go file to set a field that was
        not being in use anymore, thus it was removed

Signed-off-by: Spolti <[email protected]>
…dency to >=2.8.1, <3.0.0 (kserve#3389)

* Bump google cloud storage to fix warnings

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix poetry deprecated field warning

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Rebase master

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Restrict ray dependency to ^2.8.1

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…ve#3104)

* telepresence-setup.sh

Signed-off-by: Roland Huß <[email protected]>

* Add hint for downloading telepresence

Signed-off-by: Roland Huß <[email protected]>

---------

Signed-off-by: Roland Huß <[email protected]>
* Change docker data directory to /mnt/docker

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Change huggingface test virtual env location

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Use symlink to move docker data dir

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* skip huggingface tests for python 3.8

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…tom handler (kserve#3405)

Add exception handler for model server and Add ability to specify custom exception handler

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
)

Updated python-release script.

Signed-off-by: Andrews Arokiam <[email protected]>
* feat: Add HuggingFace runtime out-of-the-box support

Signed-off-by: Yuan Tang <[email protected]>

* Update violation list

Signed-off-by: Yuan Tang <[email protected]>

* Add to charts

Signed-off-by: Yuan Tang <[email protected]>

* Regenerate everything

Signed-off-by: Yuan Tang <[email protected]>

* fix lint

Signed-off-by: Yuan Tang <[email protected]>

* fix lint

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
Update link to Slack instructions

Signed-off-by: Yuan (Terry) Tang <[email protected]>
* Fix parameters value type conversion

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add test

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix datatype check in infer type

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix datatype check in infer type

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
This is a small fix to the test_raw_logger.py/test_kserve_logger E2E test.

This E2E test was creating a `message-dumper-raw` that was dependant on a Serverless setup, leading to the E2E to fail on an environment where Istio nor KNative are available.

The fix is to add the right annotation to `message-dumper-raw` to deploy it in Raw mode, and adjust the logger URL of the `isvc-logger-raw` to point to the Kubernetes Service, rather than the KNative service.

Signed-off-by: Edgar Hernández <[email protected]>
Copy link

openshift-ci bot commented Feb 15, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: israel-hdez

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@spolti
Copy link
Member

spolti commented Feb 15, 2024

/lgtm

@openshift-ci openshift-ci bot added the lgtm label Feb 15, 2024
@openshift-merge-bot openshift-merge-bot bot merged commit c6bf7da into opendatahub-io:master Feb 15, 2024
28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

7 participants