[Inference] Some effective methods to reduce the loading time of pir models #70219

aooxin · 2024-12-13T10:50:25Z

PR Category

Inference

PR Types

New features

Description

jit.save support option of separate_parameters
pir model load support multi threads
params_sync_among_devices_pass support multi threads and multi streams
使用时模型保存与加载方式需要改变：

模型需要重新导出，导出时paddle.jit.save api新增separate_parameters=True参数
config = paddle.inference.Config(model_path) 直接传入模型文件所在目录path即可

… dev_20241213

paddle-bot · 2024-12-13T10:50:30Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

aooxin added 4 commits December 13, 2024 18:45

params_sync_among_devices_pass support multi threads and multi streams

cf1f055

add separate_parameters for pir jit.save

1d8af89

pir model load support multi threads

695442a

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

0ef9683

… dev_20241213

paddle-bot bot added the contributor External developers label Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] Some effective methods to reduce the loading time of pir models #70219

[Inference] Some effective methods to reduce the loading time of pir models #70219

aooxin commented Dec 13, 2024

paddle-bot bot commented Dec 13, 2024

[Inference] Some effective methods to reduce the loading time of pir models #70219

Are you sure you want to change the base?

[Inference] Some effective methods to reduce the loading time of pir models #70219

Conversation

aooxin commented Dec 13, 2024

PR Category

PR Types

Description

paddle-bot bot commented Dec 13, 2024