Adds resuming and saving model to sb3 example #135

Ivan-267 · 2023-07-21T12:15:10Z

Adds the ability to save and load model for resuming training, run inference and save periodic checkpoints with CL arguments.

Limitations of this implementation:

While sb3 will save logs if starting multiple runs with the same experiment name by adding e.g. expname_1, expname_2 etc. to the folder name, my current checkpoint saving implementation requires a unique experiment dir or name argument to be set and prompts the user to set a different folder if the checkpoint folder exists, this is to prevent overwriting checkpoints from a previous run with the same experiment dir/name.

For example, if using the default arguments (no exp dir or name specified), the logs will be saved to:
.\logs\sb3\Experiment_1
and checkpoints will be saved to:
.\logs\sb3\Experiment_checkpoints

Resuming training does not keep track of previous timesteps in this implementation. So for example training with --timesteps=10_000 and saving a model, then resuming training with the same setting and saving the model again will train the model for 20_000 total timesteps, but 10_000 steps will be displayed in the console log after the final training run.

This is just a draft implementation, I welcome your feedback on whether it is sufficient as specified before starting changes to the documentation.

visuallization

Nice, looking good! I haven't had time to test it though.

examples/stable_baselines3_example.py

visuallization · 2023-07-23T11:39:51Z

examples/stable_baselines3_example.py

-
+parser.add_argument(
+    "--timesteps",
+    default=1_000_000,


examples/stable_baselines3_example.py

visuallization · 2023-07-23T11:43:30Z

examples/stable_baselines3_example.py

+                                             "remove the folder containing the checkpoints. ")
+
+if args.inference and args.resume_model_path is None:
+    raise parser.error("Using --inference requires --resume_model_path to be set.")


Nice even with proper error handling. ❤️

Co-authored-by: Florentin Luca Rieger <[email protected]>

Ivan-267 · 2023-07-23T12:45:33Z

Nice, looking good! I haven't had time to test it though.

Thanks for the review. I ran a quick test on it in which it worked, but I didn't test every use case in-depth.

edbeeching · 2023-07-23T12:49:38Z

examples/stable_baselines3_example.py

@@ -21,34 +24,114 @@
    "--experiment_dir",
    default="logs/sb3",
    type=str,
-    help="The name of the experiment directory, in which the tensorboard logs are getting stored",
+    help="The name of the experiment directory, in which the tensorboard logs and checkpoints (if enabled) are getting stored."


small nit, can the default experiment_name be lower case, "experiment"?

edbeeching

This is great! Maybe we could expose the show_window parameter, in other scripts I have this option as --viz. I think training can be faster if you don't render. (but obviously you can't see the agent's progress)

Otherwise LGTM feel free to merge.

Co-authored-by: Florentin Luca Rieger <[email protected]>

Ivan-267 · 2023-07-23T13:41:57Z

This is great! Maybe we could expose the show_window parameter, in other scripts I have this option as --viz. I think training can be faster if you don't render. (but obviously you can't see the agent's progress)

Otherwise LGTM feel free to merge.

Thank you for the review, I agree. Camera sensors may need --viz when used (I haven't tested them at all yet so I'm not sure about this, also whether they work better with a GPU, which I can't test yet), if they do we can consider adding a help note later to where the argument appears.

Will merge after all tests pass.

Ivan-267 added 3 commits July 21, 2023 13:55

Adds --resume_model_path and --save_model_path

25c9514

Adds timesteps and implements saving

10d8130

Adds auto-checkpoint saving and inference

7b86692

Ivan-267 requested review from visuallization and edbeeching July 21, 2023 13:43

Ivan-267 added 3 commits July 21, 2023 15:45

CL args help text update

602147b

Added error message when using inference without resume_model_path

7792893

Removes a left-over print from testing

e6e6214

visuallization approved these changes Jul 23, 2023

View reviewed changes

Adds infer to resume training description

537fdee

Co-authored-by: Florentin Luca Rieger <[email protected]>

Ivan-267 marked this pull request as ready for review July 23, 2023 12:43

edbeeching reviewed Jul 23, 2023

View reviewed changes

edbeeching approved these changes Jul 23, 2023

View reviewed changes

edbeeching mentioned this pull request Jul 23, 2023

Added ability to specify iterations for sb3 #48

Closed

Ivan-267 and others added 3 commits July 23, 2023 14:58

Default experiment name changed to lowercase

91c4e01

Adds --viz argument for changing rendering mode

52651e0

Add default=False to inference

82bd742

Co-authored-by: Florentin Luca Rieger <[email protected]>

Ivan-267 merged commit 2c348c8 into main Jul 23, 2023

Ivan-267 deleted the sb3_example_add_save_and_resume branch July 23, 2023 13:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds resuming and saving model to sb3 example #135

Adds resuming and saving model to sb3 example #135

Ivan-267 commented Jul 21, 2023 •

edited

Loading

visuallization left a comment

visuallization Jul 23, 2023

visuallization Jul 23, 2023

Ivan-267 commented Jul 23, 2023

edbeeching Jul 23, 2023 •

edited

Loading

edbeeching left a comment

Ivan-267 commented Jul 23, 2023

Adds resuming and saving model to sb3 example #135

Adds resuming and saving model to sb3 example #135

Conversation

Ivan-267 commented Jul 21, 2023 • edited Loading

visuallization left a comment

Choose a reason for hiding this comment

visuallization Jul 23, 2023

Choose a reason for hiding this comment

visuallization Jul 23, 2023

Choose a reason for hiding this comment

Ivan-267 commented Jul 23, 2023

edbeeching Jul 23, 2023 • edited Loading

Choose a reason for hiding this comment

edbeeching left a comment

Choose a reason for hiding this comment

Ivan-267 commented Jul 23, 2023

Ivan-267 commented Jul 21, 2023 •

edited

Loading

edbeeching Jul 23, 2023 •

edited

Loading