Cast learning_rate to float lambda for pickle safety when doing model.load #1901

markscsmith · 2024-04-19T01:14:26Z

Description

closes #1900

Motivation and Context

I have raised an issue to propose this change (required for new features and bug fixes)
closes [Bug]: if learning_rate function uses special types, they can cause torch.load to fail when weights_only=True #1900

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

…ssed a function with non-float types

araffin · 2024-04-19T10:17:36Z

stable_baselines3/common/utils.py

@@ -92,7 +92,7 @@ def get_schedule_fn(value_schedule: Union[Schedule, float]) -> Schedule:
        value_schedule = constant_fn(float(value_schedule))
    else:
        assert callable(value_schedule)
-    return value_schedule
+    return lambda _: float(value_schedule(_))


maybe a better solution is to do a call to value_schedule(1.0) and check that the return type is a float (and output a useful error message if not).

or, what you do is fine but I would explicitly name the parameter progress_remaining and add a comment of why

Hm... I see the value in both. Let me noodle a bit and I'll see if I can sort it out during my lunch. Thanks araffin!

araffin

LGTM, thanks =)

markscsmith · 2024-04-22T14:04:30Z

Awesome! Thanks again araffin! The docs on SB3 you and the crew wrote and your guidance made this a breeze :)

….load (DLR-RM#1901) * create failing test for unpickle error * Fix learning_rate argument causing failure in weights_only=True if passed a function with non-float types * Updated with feedback from araffin on PR#1901 * Update test and version * Update changelog and SBX doc --------- Co-authored-by: Antonin Raffin <antonin.raffin@ensta.org>

markscsmith added 2 commits April 18, 2024 19:58

create failing test for unpickle error

dfc0666

Fix learning_rate argument causing failure in weights_only=True if pa…

5ab2fcd

…ssed a function with non-float types

araffin reviewed Apr 19, 2024

View reviewed changes

markscsmith and others added 3 commits April 20, 2024 22:47

Updated with feedback from araffin on PR#1901

10eb400

Update test and version

d35ee6a

Update changelog and SBX doc

c1d8e60

araffin approved these changes Apr 22, 2024

View reviewed changes

araffin merged commit 9a74938 into DLR-RM:master Apr 22, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cast learning_rate to float lambda for pickle safety when doing model.load #1901

Cast learning_rate to float lambda for pickle safety when doing model.load #1901

markscsmith commented Apr 19, 2024 •

edited by araffin

Loading

araffin Apr 19, 2024 •

edited

Loading

araffin Apr 19, 2024

markscsmith Apr 19, 2024

araffin left a comment

markscsmith commented Apr 22, 2024

Cast learning_rate to float lambda for pickle safety when doing model.load #1901

Cast learning_rate to float lambda for pickle safety when doing model.load #1901

Conversation

markscsmith commented Apr 19, 2024 • edited by araffin Loading

Description

Motivation and Context

Types of changes

Checklist

araffin Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

araffin Apr 19, 2024

Choose a reason for hiding this comment

markscsmith Apr 19, 2024

Choose a reason for hiding this comment

araffin left a comment

Choose a reason for hiding this comment

markscsmith commented Apr 22, 2024

markscsmith commented Apr 19, 2024 •

edited by araffin

Loading

araffin Apr 19, 2024 •

edited

Loading