[BUG] Unable to Use quantization_setting
for Customizing MoQ in DeepSpeed Inference
#6853
Labels
quantization_setting
for Customizing MoQ in DeepSpeed Inference
#6853
Describe the bug
Unable to customize MoQ using
quantization_setting
with DeepSpeed inference.To Reproduce
Follow the example from the DeepSpeed inference tutorial on datatypes and quantized models.
Below is the full script to reproduce the issue:
Expected behavior
The script should take the input in English and produce the French translation using the T5 model. However, an error is raised:
ds_report output
Screenshots
I will provide the full terminal output running my provided script on my machine:
System info (please complete the following information):
The text was updated successfully, but these errors were encountered: