Fix bug whereby partial traces have fewer draws than would be available #4318

MarcoGorelli · 2020-12-09T12:25:18Z

Adding a quick test for _choose_chains, which is reached if there's a keyboard interrupt during parallel sampling.

something I'm confused about is this part of pymc3/sampling.py :

    final_length = l_sort[0]
    last_total = 0
    for i, length in enumerate(l_sort):
        total = (i + 1) * length
        if total < last_total:
            use_until = i
            break
        last_total = total
        final_length = length
    else:
        use_until = len(lengths)

So, we iterate through the traces (from the longest one to the shortest one), and stop when the total ((i + 1) * length is smaller than it was for the previous trace.

For example, if we have traces of length 10, 7, 3, then we would get:

with i=0, length=10: total=10, last_total=0 : as total >= last_total, we continue
with i=1, length=7: total=14, last_total=10: as total >= last_total, we continue
with i=2, length=3: total=9, last_total=14: as total < last_total, we break

I just don't see why we'd do that - what's the significance of (i + 1) * length? This function comes from #3011 (cc @aseyboldt in case you could offer any help understanding this)

codecov · 2020-12-09T12:54:59Z

Codecov Report

Merging #4318 (79c7e87) into master (2a38198) will increase coverage by 0.11%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #4318      +/-   ##
==========================================
+ Coverage   87.54%   87.66%   +0.11%     
==========================================
  Files          88       88              
  Lines       14272    14264       -8     
==========================================
+ Hits        12495    12504       +9     
+ Misses       1777     1760      -17

Impacted Files	Coverage Δ
pymc3/sampling.py	`89.66% <100.00%> (+1.99%)`	⬆️

ColCarroll · 2020-12-09T13:14:01Z

Wow, good catch! I think this is clever code, and maybe has an error in it. It could use a comment, in any case.

The goal here is to "trim the traces", while keeping as many draws as possible. Recall that the traces are drawn using multiprocessing, so they will have various lengths when a keyboard interrupt hits. I think the intended algorithm is:

Sort lengths in a descending manner, so the longest is first.
for j=0...len(lengths)
we can trim the first j traces, and have a total of (j + 1) * lengths[j] draws
choose j to maximize this total

I think the correct implementation is

use_until = np.argmax(l_sort * np.arange(1, l_sort.shape[0] + 1))

ColCarroll · 2020-12-09T13:17:28Z

Two side notes:

This code path is not used overly often, since no one has spotted this bug.
In our heart of hearts we want to maximize the effective sample size (not the number of draws), and would prefer to keep the number of chains specified by the user.

Given side note 1, I don't think spending the time to get side note 2 is worth it.

MarcoGorelli · 2020-12-09T15:03:29Z

Thanks @ColCarroll for your quick review!

Your explanation of the goal to trim the traces makes sense to me, and I think your np.argmax implementation is clearer. It's also better, as it passes the first tests case I added (which failed on master).

e.g. if we have traces of lengths 5, 2, 2, then the implementation on master would've done:

with i=0, length=5: total=5, last_total=0 : as total >= last_total, we continue
with i=1, length=2: total=4, last_total=5: as total < last_total, we break

However, if it had continued, it would've found

with i=2, length=2: total=6

which would be the actual maximum. np.argmax would find this

michaelosthege

Great test!
I just requested a change on a docstring that is a bit misleading. Otherwise ✔️

pymc3/sampling.py

add test for _choose_chains

669ba45

MarcoGorelli requested review from junpenglao, aseyboldt and michaelosthege December 9, 2020 12:29

MarcoGorelli added 2 commits December 9, 2020 14:54

fix bug - choose overall maximum

2523001

update release notes

3281215

MarcoGorelli changed the title ~~Add test for uncovered _choose_chains~~ Fix bug whereby partial traces have fewer draws than would be available Dec 9, 2020

📝

67a795f

MarcoGorelli added 3 commits December 9, 2020 15:04

🎨

541b888

minimise diff

b42241e

minimise diff

5478a73

michaelosthege requested changes Dec 12, 2020

View reviewed changes

pymc3/sampling.py Outdated Show resolved Hide resolved

MarcoGorelli commented Dec 12, 2020

View reviewed changes

pymc3/sampling.py Outdated Show resolved Hide resolved

MarcoGorelli added 2 commits December 12, 2020 15:14

Update pymc3/sampling.py

6d264f2

Merge branch 'master' into cover-choose-chains

79c7e87

michaelosthege approved these changes Dec 12, 2020

View reviewed changes

twiecki merged commit 6f15cbb into pymc-devs:master Dec 12, 2020

MarcoGorelli deleted the cover-choose-chains branch December 12, 2020 18:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug whereby partial traces have fewer draws than would be available #4318

Fix bug whereby partial traces have fewer draws than would be available #4318

MarcoGorelli commented Dec 9, 2020

codecov bot commented Dec 9, 2020 •

edited

Loading

ColCarroll commented Dec 9, 2020

ColCarroll commented Dec 9, 2020

MarcoGorelli commented Dec 9, 2020 •

edited

Loading

michaelosthege left a comment

Fix bug whereby partial traces have fewer draws than would be available #4318

Fix bug whereby partial traces have fewer draws than would be available #4318

Conversation

MarcoGorelli commented Dec 9, 2020

codecov bot commented Dec 9, 2020 • edited Loading

Codecov Report

ColCarroll commented Dec 9, 2020

ColCarroll commented Dec 9, 2020

MarcoGorelli commented Dec 9, 2020 • edited Loading

michaelosthege left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 9, 2020 •

edited

Loading

MarcoGorelli commented Dec 9, 2020 •

edited

Loading