About rolling_online_management again #572

yieldbook · 2021-08-19T14:54:10Z

❓ Questions and Help

If I set the rolling_step = 20, there will be more than 100 tasks to run and it takes more than 10 minutes to init the data in a single task(some processors are really time consuming). Any suggestion for me to speed up the data processing? Can these tasks run concurrently?
Thanks a lot!

We sincerely suggest you to carefully read the documentation of our library as well as the official paper. After that, if you still feel puzzled, please describe the question clearly under this issue.

you-n-g · 2021-08-22T07:33:11Z

For speeding up the data processing, you can refer to this issue

yieldbook · 2021-08-28T03:56:24Z

Thank you for the suggestion.
But from time to time, I need to change the processors or use different cols, and I have to train the models again. It will take a long time if there are too many models. I noticed there are some codes related to multiprocessing, like the "force_release" or worker() function. What should I do to make these models run concurrently?

you-n-g · 2021-08-29T05:55:34Z

@yieldbook
We have developed a task management module.
You can refer to the docs;
You can create a task pool and run multiple workers on different machines.

yieldbook · 2021-09-14T14:43:06Z

For speeding up the data processing, you can refer to this issue
I'm not good at coding, so I ask stupid questions.
Ideally, the processors should process the data for just once and use the same processed data in very rolling tasks, but with updated segments.
I used the to_pickle function to save the dataset in the first loop, but the problem is how I can update the segment of the dataset in the next loop?

you-n-g · 2021-09-17T00:59:31Z

@yieldbook We are drafting a demo to show a case to dump the processed data to the disk to avoid duplicated data processing
#606
Please check if the demo answers your question and help to review it.
Thanks :)

yieldbook · 2021-09-19T16:08:08Z

@you-n-g Thanks a lot for the demo. It's much much faster now, but dumping the handler to disk and loading it again is still too slow, especially when the handler is huge. It's better to keep the handler in memory and update the segment in every loop. That should be really efficient.

Wangwuyi123 · 2021-09-27T06:26:15Z

@yieldbook We updated that demo to show a case to dump the process data to the memory to reduce disk IO #606

Please check if the demo answers your question and help to review it.

yieldbook · 2021-10-08T14:42:13Z

@Wangwuyi123 Thanks a lot. It's very helpful. I have a following up question. In the old backtest function, I can pass the pred_score generated from the rolling tasks directly to the backtest, but in the new backtest function, pred_score is no longer accepted. How can I backtest the rolling tasks?

you-n-g · 2021-10-08T17:56:16Z

@yieldbook
We will add some more user-friendly functions to the new backtest function soon.

github-actions · 2022-01-06T18:01:52Z

This issue is stale because it has been open for three months with no activity. Remove the stale label or comment on the issue otherwise this will be closed in 5 days

you-n-g · 2022-01-07T02:08:06Z

@yieldbook
Here is a more user-friendly interface in the new version of backtesting.
https://qlib.readthedocs.io/en/latest/component/strategy.html#running-backtest

yieldbook added the question Further information is requested label Aug 19, 2021

you-n-g self-assigned this Sep 15, 2021

github-actions bot added the stale label Jan 6, 2022

you-n-g closed this as completed Jan 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About rolling_online_management again #572

About rolling_online_management again #572

yieldbook commented Aug 19, 2021

you-n-g commented Aug 22, 2021

yieldbook commented Aug 28, 2021

you-n-g commented Aug 29, 2021

yieldbook commented Sep 14, 2021

you-n-g commented Sep 17, 2021

yieldbook commented Sep 19, 2021

Wangwuyi123 commented Sep 27, 2021

yieldbook commented Oct 8, 2021

you-n-g commented Oct 8, 2021

github-actions bot commented Jan 6, 2022

you-n-g commented Jan 7, 2022

About rolling_online_management again #572

About rolling_online_management again #572

Comments

yieldbook commented Aug 19, 2021

❓ Questions and Help

you-n-g commented Aug 22, 2021

yieldbook commented Aug 28, 2021

you-n-g commented Aug 29, 2021

yieldbook commented Sep 14, 2021

you-n-g commented Sep 17, 2021

yieldbook commented Sep 19, 2021

Wangwuyi123 commented Sep 27, 2021

yieldbook commented Oct 8, 2021

you-n-g commented Oct 8, 2021

github-actions bot commented Jan 6, 2022

you-n-g commented Jan 7, 2022