Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc]: nvidia ammo has been renamed #8010

Open
1 task done
stas00 opened this issue Aug 29, 2024 · 2 comments
Open
1 task done

[Doc]: nvidia ammo has been renamed #8010

stas00 opened this issue Aug 29, 2024 · 2 comments
Labels
documentation Improvements or additions to documentation stale

Comments

@stas00
Copy link
Contributor

stas00 commented Aug 29, 2024

📚 The doc issue

https://docs.vllm.ai/en/latest/quantization/fp8_e4m3_kvcache.html is outdated wrt nvidia ammo

From: NVIDIA/TensorRT-LLM#1368 (comment)

The AMMO toolkit has been renamed to "TensorRT model optimizer" and the documentation is available at https://nvidia.github.io/TensorRT-Model-Optimizer/ . Examples related with Model Optimizer is available at https://github.com/NVIDIA/TensorRT-Model-Optimizer?tab=readme-ov-file

I'm following the breadcrumbs - will update if I can figure out the up-to-date instructions.

Suggest a potential alternative/fix

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
@stas00 stas00 added the documentation Improvements or additions to documentation label Aug 29, 2024
@stas00
Copy link
Contributor Author

stas00 commented Aug 29, 2024

So the new installation process is:

pip install "nvidia-modelopt[all]" --extra-index-url https://pypi.nvidia.com

Pre-compile/build the tool (could take a long time to finish):

python -c "import modelopt.torch.quantization.extensions as ext; print(ext.cuda_ext); print(ext.cuda_ext_fp8)"

Copy link

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

@github-actions github-actions bot added the stale label Nov 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation stale
Projects
None yet
Development

No branches or pull requests

1 participant