Skewed Student-T distribution #7252

fonnesbeck · 2024-04-13T18:09:10Z

Description

Added skewed Student T distribution (Jones and Faddy implementation).

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

📚 Documentation preview 📚: https://pymc--7252.org.readthedocs.build/en/7252/

ricardoV94 · 2024-04-13T18:31:02Z

Add in pymc-experimental first?

jessegrabowski · 2024-04-14T11:17:07Z

If something like this goes in experimental, there should be a clear roadmap to go from there back to the main repo. This is a straight-forward distribution that (I guess?) can't be easily implemented as a generative graph using only existing distributions via CustomDist. If it's in pmx, what's the criteria to move it over? It's not something like statespace or marginalmodel that proposes a huge new functionality, or something like r2d2m2 that implements a distribution that doesn't fit neatly into a typical pymc model as we present them in books/tutorials.

ricardoV94 · 2024-04-14T11:31:03Z

I don't think this one has a straightforward generative graph

ricardoV94 · 2024-04-14T11:40:13Z

I assumed something "according to x and y" was a bit experimental, hence my suggestion

tests/distributions/test_continuous.py

fonnesbeck · 2024-04-14T21:15:57Z

The "Following Jones and Faddy 2003" was to distinguish it from the range of available skewed Student T implementations (there are 3 or 4). I chose this one because it is available in SciPy and relatively straightforward to implement.

I think it will be a useful distribution because it allows one to specify a likelihood that is both skewed and overdispersed and converges to normal when a and b are both large and equal.

fonnesbeck · 2024-04-15T02:27:14Z

Jax failure does not seem to be related to this PR

ricardoV94 · 2024-04-15T08:30:35Z

Jax failure does not seem to be related to this PR

It's not. Has been failing since the last scipy

ricardoV94 · 2024-04-15T08:31:41Z

But this one is: https://github.com/pymc-devs/pymc/actions/runs/8682226852/job/23806440256?pr=7252#step:7:616

You should try running those logp/logcdf/icdf with n_samples=-1 or whatever it is, to test all combinations locally instead of only a random subset

ricardoV94 · 2024-04-15T08:33:29Z

You're missing the tests for the RV itself. That's also where we cover alternative parametrizations. Something like these

pymc/tests/distributions/test_continuous.py

Lines 2179 to 2200 in 34c2d31

    
           class TestBeta(BaseTestDistributionRandom): 
        
               pymc_dist = pm.Beta 
        
               pymc_dist_params = {"alpha": 2.0, "beta": 5.0} 
        
               expected_rv_op_params = {"alpha": 2.0, "beta": 5.0} 
        
               reference_dist_params = {"a": 2.0, "b": 5.0} 
        
               size = 15 
        
               reference_dist = lambda self: ft.partial(clipped_beta_rvs, random_state=self.get_random_state())  # noqa E731 
        
               checks_to_run = [ 
        
                   "check_pymc_params_match_rv_op", 
        
                   "check_pymc_draws_match_reference", 
        
                   "check_rv_size", 
        
               ] 
        
           class TestBetaMuSigma(BaseTestDistributionRandom): 
        
               pymc_dist = pm.Beta 
        
               pymc_dist_params = {"mu": 0.5, "sigma": 0.25} 
        
               expected_alpha, expected_beta = pm.Beta.get_alpha_beta( 
        
                   mu=pymc_dist_params["mu"], sigma=pymc_dist_params["sigma"] 
        
               ) 
        
               expected_rv_op_params = {"alpha": expected_alpha, "beta": expected_beta} 
        
               checks_to_run = ["check_pymc_params_match_rv_op"]

pymc/distributions/continuous.py

codecov-commenter · 2024-04-28T20:39:04Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.35%. Comparing base (60a6314) to head (bcb1b5d).
Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7252      +/-   ##
==========================================
+ Coverage   91.67%   92.35%   +0.68%     
==========================================
  Files         102      102              
  Lines       17017    17069      +52     
==========================================
+ Hits        15600    15764     +164     
+ Misses       1417     1305     -112

Files	Coverage Δ
pymc/distributions/__init__.py	`100.00% <ø> (ø)`
pymc/distributions/continuous.py	`97.93% <100.00%> (+0.09%)`	⬆️

... and 4 files with indirect coverage changes

ricardoV94 · 2024-04-29T06:23:47Z

pymc/distributions/continuous.py

+
+    @classmethod
+    def rng_fn(cls, rng, a, b, mu, sigma, size=None) -> np.ndarray:
+        return np.asarray(stats.jf_skew_t.rvs(a=a, b=b, size=size, random_state=rng)) * sigma + mu


Can't sigma and mu be passed as arguments to the rvs?

The way you wrote it has a similar bug that was addressed by this recent bugfix, when there are batched sigma and mu and size is None: #7288

You marked as resolved but I still see the old code?

Sorry, I had forgotten to push.

michaelosthege · 2024-05-06T07:21:27Z

Looks like test_skewstudentt_logp is failing on main, or is at least flaky?

ricardoV94 · 2024-05-06T08:41:20Z

These tests run a random subset of 100 combinations. We should set n_samples=-1 locally and see if every combination passes (both float64 and float32), I guess it does not.

CC @fonnesbeck

michaelosthege · 2024-05-07T12:04:47Z

Running float64 this gives:

E           AssertionError: 
E           Arrays are not almost equal to 6 decimals
E           {'a': array(0.01), 'b': array(100.), 'mu': array(-2.1), 'sigma': array(0.01), 'value': array(-0.01)}
E           x and y -inf location mismatch:
E            x: array(-751.339449)
E            y: array(-inf)

ricardoV94 · 2024-05-07T12:08:36Z

Which underflows? PyMC or Scipy?

michaelosthege · 2024-05-07T12:09:51Z

Which underflows? PyMC or Scipy?

SciPy produces the -inf.

What's the fix? I have it open and can push a branch real quick (if it's simple)

ricardoV94 · 2024-05-07T12:11:01Z

Which underflows? PyMC or Scipy?

SciPy produces the -inf.

What's the fix? I have it open and can push a branch real quick (if it's simple)

Try slightly less extreme parameter domains

fonnesbeck added 2 commits April 13, 2024 13:02

Added skewed T distribution

1fca6c6

Merge branch 'main' into skewed_t

27d3180

Bugfixes to pdf

a97b37f

fonnesbeck requested review from jessegrabowski and aloctavodia April 13, 2024 18:40

ricardoV94 reviewed Apr 14, 2024

View reviewed changes

tests/distributions/test_continuous.py Outdated Show resolved Hide resolved

Removed superfluous test, added support point test

641895d

fonnesbeck added 2 commits April 14, 2024 16:22

Fixed test bug in skew student t

58e69f1

Removed superfluous test

e9a2e69

aloctavodia reviewed Apr 15, 2024

View reviewed changes

pymc/distributions/continuous.py Outdated Show resolved Hide resolved

fonnesbeck added 5 commits April 15, 2024 11:42

Added RV test

8bde1d8

typo;

834759b

Use betaln

821a5fc

Merge branch 'main' into skewed_t

58393b9

Fixed logp scaling bug

bcb1b5d

fonnesbeck requested a review from aloctavodia April 28, 2024 20:10

ricardoV94 requested changes Apr 29, 2024

View reviewed changes

Use explicit loc and scale arguments for rvs

92b1833

fonnesbeck requested a review from ricardoV94 April 29, 2024 16:27

ricardoV94 approved these changes May 4, 2024

View reviewed changes

ricardoV94 merged commit 606d4ff into pymc-devs:main May 4, 2024
20 checks passed

ricardoV94 added the enhancements label May 4, 2024

fonnesbeck deleted the skewed_t branch May 4, 2024 22:31

michaelosthege mentioned this pull request May 7, 2024

Fix test_skewstudentt_logp #7301

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skewed Student-T distribution #7252

Skewed Student-T distribution #7252

fonnesbeck commented Apr 13, 2024 •

edited

Loading

ricardoV94 commented Apr 13, 2024

jessegrabowski commented Apr 14, 2024 •

edited

Loading

ricardoV94 commented Apr 14, 2024

ricardoV94 commented Apr 14, 2024

fonnesbeck commented Apr 14, 2024 •

edited

Loading

fonnesbeck commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

codecov-commenter commented Apr 28, 2024

ricardoV94 Apr 29, 2024

ricardoV94 Apr 29, 2024

fonnesbeck May 3, 2024

michaelosthege commented May 6, 2024

ricardoV94 commented May 6, 2024 •

edited

Loading

michaelosthege commented May 7, 2024

ricardoV94 commented May 7, 2024

michaelosthege commented May 7, 2024

ricardoV94 commented May 7, 2024

Skewed Student-T distribution #7252

Skewed Student-T distribution #7252

Conversation

fonnesbeck commented Apr 13, 2024 • edited Loading

Description

Checklist

Type of change

ricardoV94 commented Apr 13, 2024

jessegrabowski commented Apr 14, 2024 • edited Loading

ricardoV94 commented Apr 14, 2024

ricardoV94 commented Apr 14, 2024

fonnesbeck commented Apr 14, 2024 • edited Loading

fonnesbeck commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

ricardoV94 commented Apr 15, 2024

codecov-commenter commented Apr 28, 2024

Codecov Report

ricardoV94 Apr 29, 2024

Choose a reason for hiding this comment

ricardoV94 Apr 29, 2024

Choose a reason for hiding this comment

fonnesbeck May 3, 2024

Choose a reason for hiding this comment

michaelosthege commented May 6, 2024

ricardoV94 commented May 6, 2024 • edited Loading

michaelosthege commented May 7, 2024

ricardoV94 commented May 7, 2024

michaelosthege commented May 7, 2024

ricardoV94 commented May 7, 2024

fonnesbeck commented Apr 13, 2024 •

edited

Loading

jessegrabowski commented Apr 14, 2024 •

edited

Loading

fonnesbeck commented Apr 14, 2024 •

edited

Loading

ricardoV94 commented May 6, 2024 •

edited

Loading