Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from kserve:master #357

Merged
merged 50 commits into from
May 17, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
50 commits
Select commit Hold shift + click to select a range
96ca17a
docs: Move Alibi explainer to docs (#3579)
terrytangyuan Apr 7, 2024
19314ab
build: Add flake8 and black to pre-commit hooks (#3578)
terrytangyuan Apr 9, 2024
d0cb667
fix: Set writable cache folder to avoid permission issue. Fixes #3562…
terrytangyuan Apr 9, 2024
9fd262f
golang.org/x/net Allocation of Resources Without Limits or Throttling…
spolti Apr 14, 2024
888afd5
Support OpenAIModel in ModelRepository (#3590)
grandbora Apr 16, 2024
d01d6c3
updated xgboost to support json and ubj models (#3551)
andyi2it Apr 16, 2024
385b8c2
Security fix - CVE 2024 24786 (#3585)
andyi2it Apr 16, 2024
9cc4eed
VLLM support for OpenAI Completions in HF server (#3589)
gavrissh Apr 19, 2024
09a9168
Add a user friendly error message for http exceptions (#3581)
grandbora Apr 20, 2024
362cd18
Fix model unload in server stop method (#3587)
sivanantha321 Apr 21, 2024
f1b4d4b
feat: Provide minimal distribution of CRDs (#3492)
terrytangyuan Apr 21, 2024
af37834
build: Allow pre-commit to keep changes in reformatted code (#3604)
terrytangyuan Apr 21, 2024
39a4866
Fix golint errors (#3552)
andyi2it Apr 21, 2024
9a897ed
Fix make deploy-dev-storage-initializer not working (#3617)
sivanantha321 Apr 23, 2024
ec59f43
Fix Pydantic 2 warnings (#3622)
cmaddalozzo Apr 23, 2024
34ea6f0
build: Fix CRD copying in generate-install.sh (#3620)
terrytangyuan Apr 23, 2024
6c23281
chore: v0.12.1 install files (#3619)
terrytangyuan Apr 23, 2024
cefdc2e
Fix CVE-2023-45288 for qpext (#3618)
sivanantha321 Apr 23, 2024
47265b2
Allow rerunning failed workflows by comment (#3550)
andyi2it Apr 25, 2024
a9d747e
Add OpenAI API support to Huggingfaceserver (#3582)
cmaddalozzo Apr 25, 2024
0549c98
add re-run info in the PR templates (#3633)
spolti Apr 27, 2024
e27aeec
chore: Bump ModelMesh version to v0.12.0-rc0 in Helm chart (#3642)
terrytangyuan Apr 28, 2024
5b51837
build: Remove misleading logs from minimal-crdgen.sh (#3641)
terrytangyuan Apr 28, 2024
622f32f
Add e2e tests for huggingface (#3600)
sivanantha321 Apr 29, 2024
1c5b0f9
set default SAFETENSORS_FAST_GPU and HF_HUB_DISABLE_TELEMETRY in HF S…
lizzzcai Apr 29, 2024
3f9b442
Remove generate endpoints (#3654)
cmaddalozzo Apr 30, 2024
0fe5d3f
Assign device to input tensors in huggingface server with huggingface…
saileshd1402 Apr 30, 2024
8cfb3e0
Test image builds for ARM64 arch in CI (#3629)
sivanantha321 Apr 30, 2024
f717e12
Fix Huggingface server stopping criteria (#3659)
cmaddalozzo Apr 30, 2024
36a3e6d
Enabled the multiple domains support on an inference service (#3615)
houshengbo Apr 30, 2024
9c6a6b8
Explicitly specify pad token id when generating tokens (#3565)
sivanantha321 May 2, 2024
5011c7d
Fix quick install does not cleans up Istio installer (#3660)
sivanantha321 May 3, 2024
2cddaa4
Add base model for proxying request to an OpenAI API enabled model se…
cmaddalozzo May 3, 2024
0fb5696
Add headers to predictor exception logging (#3658)
grandbora May 3, 2024
a3fda9c
workflow file for cherry-pick on comment (#3653)
andyi2it May 4, 2024
59d705b
Enhance controller setup based on available CRDs (#3472)
israel-hdez May 4, 2024
bfc2e21
Bump version to 0.13.0-rc0 (#3665)
cmaddalozzo May 6, 2024
f3c3220
upgrade vllm/transformers version (#3671)
johnugeorge May 8, 2024
d608056
Add openai models endpoint (#3666)
cmaddalozzo May 9, 2024
629e4ae
feat: Support customizable deployment strategy for RawDeployment mode…
terrytangyuan May 9, 2024
a30d402
Enable dtype support for huggingface server (#3613)
Datta0 May 9, 2024
ca50e18
Add method for checking model health/readiness (#3673)
cmaddalozzo May 10, 2024
ce9b0e8
fix for extract zip from gcs (#3510)
andyi2it May 10, 2024
a4cce1a
Update Dockerfile and Readme (#3676)
gavrissh May 11, 2024
9dbce8e
Update huggingface readme (#3678)
alexagriffith May 11, 2024
56a2940
fix: HPA equality check should include annotations (#3650)
terrytangyuan May 11, 2024
024f69b
Fix: huggingface runtime in helm chart (#3679)
yuzisun May 13, 2024
4c6ce45
Fix: model id and model dir check order (#3680)
yuzisun May 14, 2024
892e5dc
Fix:vLLM Model Supported check throwing circular dependency (#3688)
gavrissh May 15, 2024
8771c3d
Fix: Allow null in Finish reason streaming response in vLLM (#3684)
gavrissh May 15, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 18 additions & 0 deletions .flake8
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
[flake8]
max-line-length = 120
extend-ignore = E203,E501,E701, B017
exclude =
.venv,
venv,
python/kserve/kserve/exceptions.py,
python/kserve/kserve/configuration.py,
python/kserve/kserve/rest.py,
python/kserve/kserve/api_client.py,
python/kserve/test/test_v1*.py,
python/kserve/kserve/__init__.py,
python/kserve/test/__init__.py,
python/kserve/test/test_knative*.py,
python/kserve/kserve/protocol/grpc/grpc_predict_v2*.py
python/*_pb2.py
docs/**/*.py
python/kserve/kserve/protocol/rest/openai/types/openapi.py
5 changes: 5 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,8 @@ Please also list any relevant details for your test configuration.
```release-note

```

**Re-running failed tests**

- `/rerun-all` - rerun all failed workflows.
- `/rerun-workflow <workflow name>` - rerun a specific failed workflow. Only one workflow name can be specified. Multiple /rerun-workflow commands are allowed per comment.
28 changes: 18 additions & 10 deletions .github/workflows/artexplainer-docker-publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,17 +27,24 @@ jobs:
runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v4
- name: Checkout source
uses: actions/checkout@v4

- name: Setup QEMU
uses: docker/setup-qemu-action@v3

- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v3

- name: Run tests
run: |
if [ -f docker-compose.test.yml ]; then
docker-compose --file docker-compose.test.yml build
docker-compose --file docker-compose.test.yml run sut
else
cd python
docker buildx build . --file artexplainer.Dockerfile
fi
uses: docker/build-push-action@v5
with:
platforms: linux/amd64, linux/arm64/v8
context: python
file: python/artexplainer.Dockerfile
push: false
# https://github.com/docker/buildx/issues/1533
provenance: false

# Push image to GitHub Packages.
# See also https://docs.docker.com/docker-hub/builds/
Expand All @@ -49,7 +56,8 @@ jobs:
if: github.event_name == 'push'

steps:
- uses: actions/checkout@v4
- name: Checkout source
uses: actions/checkout@v4

- name: Setup QEMU
uses: docker/setup-qemu-action@v3
Expand Down
31 changes: 0 additions & 31 deletions .github/workflows/cherry-pick.yml

This file was deleted.

23 changes: 23 additions & 0 deletions .github/workflows/comment-cherry-pick.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
name: Cherry Pick On Comment

on:
issue_comment:
types: [created]

jobs:
cherry-pick:

name: Cherry Pick
if: github.event.issue.pull_request != '' && contains(github.event.comment.body, '/cherry-pick') && github.event.comment.author_association == 'CONTRIBUTOR'
runs-on: ubuntu-latest

steps:
- name: Checkout the latest code
uses: actions/checkout@v2
with:
token: ${{ secrets.GITHUB_TOKEN }}
fetch-depth: 0 # otherwise, you will fail to push refs to dest repo
- name: Automatic Cherry Pick
uses: hianhianhian/gha-cherry-pick@master
env:
GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
52 changes: 32 additions & 20 deletions .github/workflows/custom-model-grpc-publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,15 +26,18 @@ jobs:
- name: Checkout source
uses: actions/checkout@v4

- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v3

- name: Run tests
run: |
if [ -f docker-compose.test.yml ]; then
docker-compose --file docker-compose.test.yml build
docker-compose --file docker-compose.test.yml run sut
else
cd python
docker buildx build . --file custom_model_grpc.Dockerfile
fi
uses: docker/build-push-action@v5
with:
platforms: linux/amd64
context: python
file: python/custom_model_grpc.Dockerfile
push: false
# https://github.com/docker/buildx/issues/1533
provenance: false

# Push image to GitHub Packages.
# See also https://docs.docker.com/docker-hub/builds/
Expand All @@ -49,15 +52,16 @@ jobs:
- name: Checkout source
uses: actions/checkout@v4

- name: Build image
run: |
cd python
docker buildx build . --file custom_model_grpc.Dockerfile --tag $IMAGE_NAME
- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v3

- name: Log into registry
run: docker login -u ${{ secrets.DOCKER_USER }} -p ${{ secrets.DOCKER_PASSWORD }}
- name: Login to DockerHub
uses: docker/login-action@v3
with:
username: ${{ secrets.DOCKER_USER }}
password: ${{ secrets.DOCKER_PASSWORD }}

- name: Push image
- name: Export version variable
run: |
IMAGE_ID=kserve/$IMAGE_NAME

Expand All @@ -73,8 +77,16 @@ jobs:
# Use Docker `latest` tag convention
[ "$VERSION" == "master" ] && VERSION=latest

echo IMAGE_ID=$IMAGE_ID
echo VERSION=$VERSION

docker tag $IMAGE_NAME $IMAGE_ID:$VERSION
docker push $IMAGE_ID:$VERSION
echo VERSION=$VERSION >> $GITHUB_ENV
echo IMAGE_ID=$IMAGE_ID >> $GITHUB_ENV

- name: Build and push
uses: docker/build-push-action@v5
with:
platforms: linux/amd64
context: python
file: python/custom_model_grpc.Dockerfile
push: true
tags: ${{ env.IMAGE_ID }}:${{ env.VERSION }}
# https://github.com/docker/buildx/issues/1533
provenance: false
69 changes: 69 additions & 0 deletions .github/workflows/e2e-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,7 @@ env:
PMML_IMG: "pmmlserver"
PADDLE_IMG: "paddleserver"
CUSTOM_MODEL_GRPC_IMG: "custom-model-grpc"
HUGGINGFACE_IMG: "huggingfaceserver"
# Explainer images
ART_IMG: "art-explainer"
# Transformer images
Expand Down Expand Up @@ -199,6 +200,14 @@ jobs:
compression-level: 0
if-no-files-found: error

- name: Upload huggingface image
uses: actions/upload-artifact@v4
with:
name: ${{ env.HUGGINGFACE_IMG }}-${{ github.sha }}
path: ${{ env.DOCKER_IMAGES_PATH }}/${{ env.HUGGINGFACE_IMG }}-${{ github.sha }}
compression-level: 0
if-no-files-found: error

explainer-runtime-build:
runs-on: ubuntu-latest
needs: [poetry-check]
Expand Down Expand Up @@ -847,3 +856,63 @@ jobs:
if: always()
run: |
./test/scripts/gh-actions/status-check.sh "kourier"

test-llm:
runs-on: ubuntu-latest
needs:
[ kserve-image-build, predictor-runtime-build]
steps:
- name: Checkout source
uses: actions/checkout@v4

- name: Free-up disk space
uses: ./.github/actions/free-up-disk-space

- name: Setup Go
uses: actions/setup-go@v5
with:
go-version: "1.21"
- name: Setup Python
uses: actions/setup-python@v5
with:
python-version: "3.9"

- name: Setup Minikube
uses: ./.github/actions/minikube-setup

- name: KServe dependency setup
uses: ./.github/actions/kserve-dep-setup

- name: Download base images
uses: ./.github/actions/base-download

- name: Download huggingface server image
uses: actions/download-artifact@v4
with:
name: ${{ env.HUGGINGFACE_IMG }}-${{ github.sha }}
path: ./tmp

- name: Load docker images
uses: ./.github/actions/load-docker-images
with:
directory: ./tmp

- name: Install Poetry and version plugin
run: ./test/scripts/gh-actions/setup-poetry.sh

- name: Install KServe
run: |
./test/scripts/gh-actions/setup-kserve.sh

kubectl get pods -n kserve
kubectl describe pods -n kserve

- name: Run E2E tests
timeout-minutes: 30
run: |
./test/scripts/gh-actions/run-e2e-tests.sh "llm" "2"

- name: Check system status
if: always()
run: |
./test/scripts/gh-actions/status-check.sh
6 changes: 1 addition & 5 deletions .github/workflows/go.yml
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,6 @@ jobs:
run: |
go get -v -t -d ./...

- name: Go lint
run: |
make fmt

- name: Test
id: test
run: |
Expand All @@ -57,4 +53,4 @@ jobs:
filename: coverage.json
label: coverage
message: ${{ steps.test.outputs.coverage }}
color: green
color: green
28 changes: 28 additions & 0 deletions .github/workflows/golangci-lint.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
name: golangci-lint
on:
push:
branches:
- master
- main
pull_request:

permissions:
contents: read
pull-requests: read

jobs:
golangci:
name: lint
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: actions/setup-go@v5
with:
go-version: '1.21'
cache: false
- name: golangci-lint
uses: golangci/golangci-lint-action@v4
with:
version: v1.56
args: --out-format=line-number

Loading