-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[pull] master from kserve:master #361
Conversation
Signed-off-by: Vincent Hou <[email protected]>
Hi @pull[bot]. Thanks for your PR. I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
fix conflict Signed-off-by: jooho <[email protected]>
* Add precaution again running v1 endpoints on openai models Signed-off-by: grandbora <[email protected]> * Remove the check from explain Signed-off-by: grandbora <[email protected]> * Add a warning log for explain Signed-off-by: grandbora <[email protected]> --------- Signed-off-by: grandbora <[email protected]>
* Add FP16 datatype support for OIP grpc Add grpc server tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add grpcio-testing as test dependency Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix model repository initialization default value Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove fp16 global map Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Resolve comments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* added flag to return raw prediction results Signed-off-by: Andrews Arokiam <[email protected]> * black fix Signed-off-by: Andrews Arokiam <[email protected]> * unit test bug fix Signed-off-by: Andrews Arokiam <[email protected]> * unittest for token classification Signed-off-by: Andrews Arokiam <[email protected]> * verify codegen Signed-off-by: Andrews Arokiam <[email protected]> * bug fix Signed-off-by: Andrews Arokiam <[email protected]> --------- Signed-off-by: Andrews Arokiam <[email protected]>
Signed-off-by: Alex Peters <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Fix model_id and model_dir precendence Signed-off-by: Dan Sun <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
* Fixup max_length for HF and model info for vLLM Signed-off-by: Dattu Sharma <[email protected]> * Use vLLM's implementation for max_length Also fixup error in calculating input sequence lenngth Signed-off-by: Dattu Sharma <[email protected]> * Fixup linter Signed-off-by: Dattu Sharma <[email protected]> * Add license to new file Signed-off-by: Dattu Sharma <[email protected]> * Add bloom test case for max_tokens Revert input length fix Signed-off-by: Dattu Sharma <[email protected]> * Set limit on opt chat competion e2e test Signed-off-by: Dattu Sharma <[email protected]> --------- Signed-off-by: Dattu Sharma <[email protected]>
#3712) * Fix input token count and add completion usage Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add max_length for test models Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
update version to 0.13.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
/ok-to-test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: israel-hdez, pull[bot] The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
97c83cf
into
opendatahub-io:master
updating ksi file
See Commits and Changes for more details.
Created by pull[bot]
Can you help keep this open source service alive? 💖 Please sponsor : )