Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle RVs assigned to steps #4701

Merged
merged 4 commits into from
Sep 23, 2021

Conversation

ricardoV94
Copy link
Member

@ricardoV94 ricardoV94 commented May 15, 2021

Following some work in #4698 I realized some steppers (BinaryMetropolis and BinaryGibbsMetropolis) would silently ignore if a model rv_var was assigned manually to them, instead of the correct value_var. I added a check in assign_step_methods which is called in sample that raises an error when any of the manually assigned variables is not found in model.value_vars.

It would probably be better if the steppers themselves raised an informative error (or gracefully converted the rv_var to value_var, as some already do more or less incidentally), but the new check still seems reasonable in the long-term.


Here is a non-comprehensive list of problematic ways the steppers currently deal with receiving rv_vars instead of value_vars:

with pm.Model() as m:
    x = pm.HalfNormal("x", 1)
    l = [m["x"]]
    print(l)  # [x]
    step = pm.NUTS(l)
    print(l)  # [x_log__] CHANGED!
    print(step.vars)  # [x_log__]
  • Slice returns an empty list (and so wouldn't trigger the new check!)
with pm.Model() as m:
    x = pm.HalfNormal("x", 1)
    l = [m["x"]]
    print(l)  # [x]
    step = pm.Slice(l)
    print(l)  # [x]
    print(step.vars)  # []
  • Metropolis raises a cryptic error:
with pm.Model() as m:
    x = pm.HalfNormal("x", 1)
    l = [m["x"]]
    print(l)  # [x]
    step = pm.Metropolis(l)  # raises ValueError: need at least one array to concatenate
    ...   
  • BinaryMetropolis and BinaryGibbsMetropolis will happily incorporate the rv_vars (which triggers the new check)
with pm.Model() as m:
    x = pm.Bernoulli("x", 1)
    l = [m["x"]]
    print(l)  # [x]
    step = pm.BinaryGibbsMetropolis(l)
    print(l)  # [x]
    print(step.vars)  # [x]  # It's still the `rv_var`

@ricardoV94 ricardoV94 changed the title Raise error unused steps Raise error on unused variables assigned to steps May 15, 2021
Copy link
Contributor

@brandonwillard brandonwillard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we perform this check more directly in BlockedStep.__new__ instead? For example, could the exception be raised when pm.BinaryMetropolis([model["x"]]) is called?

@ricardoV94
Copy link
Member Author

I'll check. Sounds more reasonable in principle.

Do we want to be permissive (ie call the respective value_var ourselves if the input is a rv_var) or raise an error?

@brandonwillard
Copy link
Contributor

Do we want to be permissive (ie call the respective value_var ourselves if the input is a rv_var) or raise an error?

We need to be very straightforward about the argument types in our step methods, because we're going to convert those step methods to Aesara—or at least Numba/Cython—next, and those kinds of niceties don't translate well.

@michaelosthege
Copy link
Member

Do we want to be permissive (ie call the respective value_var ourselves if the input is a rv_var) or raise an error?

We need to be very straightforward about the argument types in our step methods, because we're going to convert those step methods to Aesara—or at least Numba/Cython—next, and those kinds of niceties don't translate well.

We also need to remember that PyMC3 is primarily a library that makes probabilistic modeling accessible to a lot of people who don't know what the difference between rv_var and a value_var is. One big contributor to this accessibility is the fact that PyMC3 automatically transforms variables and users in most cases don't even need to know about this.
Taking rv_vars for assignment into step methods is therefore quite important for a consistent and user friendly API.

@ricardoV94
Copy link
Member Author

Should we try to separate a bit more the initialization logic from the actual stepping algorithms then? I imagine that as long as we make sure the inputs to the actual step methods are valid (and are what users would expect) it doesn't matter how we "got them".

We are currently very lenient in how we accept starting point dictionaries as well (eg auto transforming the points)

@michaelosthege
Copy link
Member

Should we try to separate a bit more the initialization logic from the actual stepping algorithms then? I imagine that as long as we make sure the inputs to the actual step methods are valid (and are what users would expect) it doesn't matter how we "got them".

It think that'd be a good call. Could be a very thin function layer too, and maybe we'll even need it because of sampler stats? But let's take one step at a time. First we need to get v4 into master.

We are currently very lenient in how we accept starting point dictionaries as well (eg auto transforming the points)

Issue #4689 comes to mind.

@brandonwillard
Copy link
Contributor

We also need to remember that PyMC3 is primarily a library that makes probabilistic modeling accessible to a lot of people who don't know what the difference between rv_var and a value_var is. One big contributor to this accessibility is the fact that PyMC3 automatically transforms variables and users in most cases don't even need to know about this.
Taking rv_vars for assignment into step methods is therefore quite important for a consistent and user friendly API.

My statement was neither prescriptive nor did it preclude the "accessibility" changes implied here; it was a general guideline for making changes to this area of the code.

Right now, we only need to make sure that the early v4 codebase does not preclude the addition of "accessibility"-based features, but, if we pretend like the current state of the v4 codebase is somehow final and user-facing, then start building naive features around it in the name of "accessibility", we'll end up making development significantly more difficult.

Should we try to separate a bit more the initialization logic from the actual stepping algorithms then? I imagine that as long as we make sure the inputs to the actual step methods are valid (and are what users would expect) it doesn't matter how we "got them".

If we end up keeping the same BlockedStep interface at the user-level, then a conversion to value variables can take place in BlockedStep.__new__.

First, we need to take a look over everything and find out whether or not we can simply take random variables as vars in all cases. My guess is that we can.

No matter what, we want the codebase to be as uniform as possible (i.e. always pass one type or the other). There's less room for confusion that way.

This definitely needs to be fixed.

@ricardoV94
Copy link
Member Author

Thanks for the input. I'll take some time and come back with a worked suggestion.

@michaelosthege
Copy link
Member

This PR looks stale, but is included in the v4.0.0 milestone.
What should we do about it?

@ricardoV94
Copy link
Member Author

ricardoV94 commented Jul 17, 2021

I'll come back to it. But we can move it to 4.0.1 if there is an intent to release anytime soon

@michaelosthege michaelosthege modified the milestones: vNext (4.0.0), v4.0.1 Jul 17, 2021
@ricardoV94 ricardoV94 modified the milestones: v4.0.1, vNext (4.0.0) Sep 22, 2021
@ricardoV94 ricardoV94 marked this pull request as ready for review September 22, 2021 00:07
@codecov
Copy link

codecov bot commented Sep 22, 2021

Codecov Report

Merging #4701 (2f902a5) into main (bcc40ce) will increase coverage by 0.90%.
The diff coverage is 88.88%.

Impacted file tree graph

@@            Coverage Diff             @@
##             main    #4701      +/-   ##
==========================================
+ Coverage   76.34%   77.24%   +0.90%     
==========================================
  Files          86       85       -1     
  Lines       13931    14004      +73     
==========================================
+ Hits        10636    10818     +182     
+ Misses       3295     3186     -109     
Impacted Files Coverage Δ
pymc3/step_methods/__init__.py 100.00% <ø> (ø)
pymc3/step_methods/arraystep.py 94.24% <ø> (-0.72%) ⬇️
pymc3/step_methods/hmc/hmc.py 92.15% <ø> (ø)
pymc3/step_methods/hmc/nuts.py 97.50% <ø> (ø)
pymc3/step_methods/pgbart.py 95.80% <ø> (ø)
pymc3/step_methods/sgmcmc.py 0.00% <0.00%> (ø)
pymc3/step_methods/mlda.py 86.34% <66.66%> (-0.16%) ⬇️
pymc3/model.py 83.81% <100.00%> (+0.09%) ⬆️
pymc3/sampling.py 86.89% <100.00%> (+0.88%) ⬆️
pymc3/step_methods/compound.py 95.34% <100.00%> (+0.22%) ⬆️
... and 15 more

@ricardoV94 ricardoV94 changed the title Raise error on unused variables assigned to steps Handle RVs assigned to steps Sep 22, 2021
@michaelosthege michaelosthege merged commit b9f225b into pymc-devs:main Sep 23, 2021
@ricardoV94 ricardoV94 deleted the raise_error_unused_steps branch January 31, 2022 09:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants