[Bug]: Issues with vLLM tool call functionality leading to abnormal requests #11284

yumc2573 · 2024-12-18T06:02:04Z

Your current environment

The output of `python collect_env.py`

Your output of `python collect_env.py` here

Model Input Dumps

No response

🐛 Describe the bug

I am encountering issues while using the tool call capability of vLLM. Some requests are behaving abnormally, and the log indicates the following error:

Additionally, my startup script is as follows:
-d vllm/vllm-openai:v0.6.3.post1
--host 0.0.0.0 --port 30000
--model /llm/models/Qwen2.5-32B-Instruct
--served-model-name qwen2.5-32b-instruct
--dtype auto
--tensor-parallel-size 2
--gpu-memory-utilization 0.90
--enable-prefix-caching
--enable-auto-tool-choice
--tool-call-parser hermes

Currently, I am using vLLM version:vllm/vllm-openai:v0.6.3.post1

Could you please provide guidance on how to resolve this issue? Any help would be greatly appreciated.

Thank you!

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

yumc2573 added the bug Something isn't working label Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Issues with vLLM tool call functionality leading to abnormal requests #11284

[Bug]: Issues with vLLM tool call functionality leading to abnormal requests #11284

yumc2573 commented Dec 18, 2024

[Bug]: Issues with vLLM tool call functionality leading to abnormal requests #11284

[Bug]: Issues with vLLM tool call functionality leading to abnormal requests #11284

Comments

yumc2573 commented Dec 18, 2024

Your current environment

Model Input Dumps

🐛 Describe the bug

Before submitting a new issue...