nlive calculation bug fix #119

williamjameshandley · 2020-08-18T13:16:28Z

Description

There is an off-by-one nlive bug that has been lurking for a while. This is most clearly seen in likelihoods with plateau regions, but has also been occuring in the computation of decreasing nlive values for the final set of live points.

In anesthetic, nlive is a column in the table, and therefore 'belongs' to the point corresponding to a row. It is used to compute the volume compression via the probability distribution:

P(t_i) = n_i t_i**(n_i -1)

n_i therefore corresponds to the number of live points for which row i is the lowest likelihood live point. This means for example that the final point at the end of a run should have nlive=1, not 0.

This PR adjusts the tests so that the values are correct, introduces a failing test for a 'wedding cake' likelihood, which is then fixed by subsequent commits

Fixes Possible issue in dlogX calculation #83 as nlive is now never zero

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings (pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes (python -m pytest)
I have added tests that prove my fix is effective or that my feature works

williamjameshandley · 2020-08-18T13:19:11Z

@lukashergt, this likely has implications for your work concerning re-weighting/slicing of nested sampling chains. It would likely be worth your pushing a PR with a failing test that captures that behaviour, which we can check if merging this provides a fix.

codecov · 2020-08-18T13:21:08Z

Codecov Report

Merging #119 into master will increase coverage by 0.74%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #119      +/-   ##
==========================================
+ Coverage   91.98%   92.72%   +0.74%     
==========================================
  Files          16       16              
  Lines        1460     1458       -2     
==========================================
+ Hits         1343     1352       +9     
+ Misses        117      106      -11

Impacted Files	Coverage Δ
anesthetic/gui/plot.py	`97.05% <100.00%> (+2.88%)`	⬆️
anesthetic/samples.py	`100.00% <100.00%> (+1.63%)`	⬆️
anesthetic/utils.py	`97.32% <100.00%> (ø)`
anesthetic/gui/widgets.py	`98.86% <0.00%> (+4.54%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c1202ce...a2e74f2. Read the comment docs.

anesthetic/utils.py

lukashergt · 2020-08-18T14:07:56Z

@lukashergt, this likely has implications for your work concerning re-weighting/slicing of nested sampling chains. It would likely be worth your pushing a PR with a failing test that captures that behaviour, which we can check if merging this provides a fix.

I'm not sure I can turn this into a failing test.

What I have done is infer the evidence in two separate ways:

by masking my NestedSamples data frame and recomputing live points:
```
ns_masked._compute_nlive(ns_masked.logL_birth)
```
which then let's me run ns_masked.ns_output() again to get a distribution for my logZ_new1
by calculating prior and posterior volumes from the masked weights and doing
```
logZ_new2 = logZ_old.mean() + np.log(V_post) - np.log(V_prior)
```

Should I expect that logZ_new1.mean() == logZ_new2 ?

williamjameshandley · 2020-08-18T14:19:06Z

Should I expect that logZ_new1.mean() == logZ_new2 ?

Or at least that they're close to within the error margins of ns_output.

Basically it would be good to do this for a standard example (e.g. on one of the ones in tests/example_data).

andrewfowlie · 2020-08-19T03:49:53Z

I'm not that up to speed with the anesthetic codebase, but I have checked that the logZ method now gives an acceptable answer on a MultiNest run with plateaus.

Let me know if you want the MN run files or more details.

andrewfowlie

loooks good, a couple of very small comments

bin/wedding_cake.py

williamjameshandley · 2020-08-19T07:44:05Z

@andrewfowlie if you're happy with the trimming down of the code, could you re-approve the PR so it can be merged?

@lukashergt let me know if you have any problem with/comments on this merge.

williamjameshandley · 2020-08-19T07:45:04Z

as the coverage checks were failing with the minor reorganisation I also added a couple of tests to bring us up to 100% coverage of samples.

lukashergt

I think it would be good to have a version update for this PR (2.0.0b2) such that we can easier reference these changes later.

Other than that I'm not sure whether this indicates that there might be an over-correction, such that there still is an off-by-one error and whether that should be addressed here, or in #120.

williamjameshandley · 2020-08-19T10:14:38Z

I think it would be good to have a version update for this PR (2.0.0b2) such that we can easier reference these changes later.

Done, and I made 2.0.0-beta.1 an actual beta release with a tag.

Other than that I'm not sure whether this indicates that there might be an over-correction, such that there still is an off-by-one error and whether that should be addressed here, or in #120.

I plan to look quite carefully now at #120, as it's not impossible that off-by-one errors remain.

As @andrewfowlie is in NZ and likely asleep by now, could @lukashergt re-approve the version bump.

williamjameshandley added 3 commits August 18, 2020 12:13

Added a wedding cake example likelihood

a0c1e9d

Adjusted tests to fail correctly

d00e1e2

fix to nlive computation

450b0f6

williamjameshandley requested review from andrewfowlie and lukashergt August 18, 2020 13:17

Removed numpy error guards as initially noticed in #83 and #81

aa5fad2

williamjameshandley commented Aug 18, 2020

View reviewed changes

anesthetic/utils.py Show resolved Hide resolved

lukashergt mentioned this pull request Aug 18, 2020

logzero masking at prior and likelihood level #120

Merged

5 tasks

williamjameshandley added this to the 2.0.0 milestone Aug 18, 2020

andrewfowlie previously approved these changes Aug 19, 2020

View reviewed changes

bin/wedding_cake.py Outdated Show resolved Hide resolved

bin/wedding_cake.py Outdated Show resolved Hide resolved

williamjameshandley added 2 commits August 19, 2020 08:07

Removed redundant txt file, as it can be generated on demand efficiently

52aaae3

Made samples 100% coverage

a299ba2

williamjameshandley dismissed andrewfowlie’s stale review via a299ba2 August 19, 2020 07:09

Added docstrings to the wedding cake function

65d80ab

andrewfowlie previously approved these changes Aug 19, 2020

View reviewed changes

lukashergt requested changes Aug 19, 2020

View reviewed changes

beta version bumped

a2e74f2

williamjameshandley dismissed andrewfowlie’s stale review via a2e74f2 August 19, 2020 10:06

lukashergt approved these changes Aug 19, 2020

View reviewed changes

williamjameshandley merged commit f546bc7 into master Aug 19, 2020

williamjameshandley deleted the nlive_bug_fix branch August 19, 2020 10:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nlive calculation bug fix #119

nlive calculation bug fix #119

williamjameshandley commented Aug 18, 2020 •

edited

Loading

williamjameshandley commented Aug 18, 2020

codecov bot commented Aug 18, 2020 •

edited

Loading

lukashergt commented Aug 18, 2020

williamjameshandley commented Aug 18, 2020

andrewfowlie commented Aug 19, 2020 •

edited

Loading

andrewfowlie left a comment

williamjameshandley commented Aug 19, 2020

williamjameshandley commented Aug 19, 2020

lukashergt left a comment •

edited

Loading

williamjameshandley commented Aug 19, 2020

nlive calculation bug fix #119

nlive calculation bug fix #119

Conversation

williamjameshandley commented Aug 18, 2020 • edited Loading

Description

Checklist:

williamjameshandley commented Aug 18, 2020

codecov bot commented Aug 18, 2020 • edited Loading

Codecov Report

lukashergt commented Aug 18, 2020

williamjameshandley commented Aug 18, 2020

andrewfowlie commented Aug 19, 2020 • edited Loading

andrewfowlie left a comment

Choose a reason for hiding this comment

williamjameshandley commented Aug 19, 2020

williamjameshandley commented Aug 19, 2020

lukashergt left a comment • edited Loading

Choose a reason for hiding this comment

williamjameshandley commented Aug 19, 2020

williamjameshandley commented Aug 18, 2020 •

edited

Loading

codecov bot commented Aug 18, 2020 •

edited

Loading

andrewfowlie commented Aug 19, 2020 •

edited

Loading

lukashergt left a comment •

edited

Loading