Fixing negative binomial phi cutoff #1497

martinmodrak · 2019-12-09T20:12:28Z

Summary

This introduces more tests on the behavior of negative binomial 2 distribution when phi is large and will try to fix issues discovered along the way. Currently:

improved numerical stability of the density by using binomial_coefficient_log and rearranging some operations
improved numerical stability of the derivative w.r.t phi
this obviated the necessity to delegate to Poisson density for large phi and the whole code branch was removed
addressed potential size mismatch noticed in Cleanup code in neg_binomial_2_lpmf and neg_binomial_2_log_lpmf #1531
renamed local variables with double underscores in names

This PR requires the improvements from PR #1614 to be merged before it is merged.

Tests

New tests:

Created a set of test values and derivatives in Mathematica (Wolfram Cloud) and test against those
Changed the finite differences test for derivatives to cover more of the parameter space and use complex step differentiation instead of finite diffs. This test requires somewhat looser tolerance for derivative wrt. phi for n > 100000 and large phi. Importantly, the sign of the derivative does not match (e.g. expected = -3.24e-10, actual = 9e-12), not sure if the complex step or the function is to blame, so I assume it is OK for now.
Extended the test for extreme phi values to cover more of the parameter space
Tests of values and derivatives against analytical solutions for n = 0 and n = 1 (this allows me to test even mu and phi values where Mathematica will not calculate exact solutions).

Side Effects

The way the density for neg_binomial_2_lpmf is computed changed to improve stability, at the cost of tangling mu and phi together earlier, so less computation can be avoided when propto=true and phi is fixed.
Generally I would expect a very minor speed drop for some parameter values (when previously we delegated to poisson which is probably cheaper).

Checklist

Math issue The way neg_binomial_2_lpmf delegates to Poisson is broken #1496
Copyright holder: Institute of Microbiology of the Czech Academy of Sciences

By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

…stable/2017-11-14)

…xp1~20180509124008.99 (branches/release_50)

martinmodrak · 2019-12-10T14:53:20Z

So, fixing the phi cutoff triggered some failures in the finite differences test. I tried to fix those and wrote more tests to make sure I am on the right track. Those tests revealed other issues. I failed to fix all of them so far.

In particular, there are numerical differences between the values we compute and what I get from Mathematica and the finite differences test does not pass.

bob-carpenter · 2019-12-10T17:39:55Z

How far off is the finite diff test? Let me know if I can help somehow with the testing.

…gs/RELEASE_500/final)

martinmodrak · 2019-12-11T08:17:49Z

Will continue the discussion here and not at the issue (hope that's OK).

There are currently two failing tests a precomputed test, using numbers from Mathematica and the original finiteDiffs test, which I expanded to test over wider range of values (but it fails even for the original values tested).

The numbers I get from Mathematica for the precomputed test are computed in this notebook: https://www.wolframcloud.com/obj/martin.modrak/Published/NegBinomial2_Tests.nb - I am still a Mathematica beginner, so hope I made no silly mistake, but since I am using what they call "Exact numbers" the outputs I get should allow arbitrary precision (I instruct it to work with 40 digits, then convert to what they call "CForm" which - hopefully - stores as much precision as C would accept, the number-to-string formatting in Mathematica is complicated). The precomputed test fails for some with relative errors up to 1e-5, all happening for either small mu or small phi here is a sample of the test output:

test/unit/math/rev/scal/prob/neg_binomial_2_test.cpp:84: Failure
The difference between gradients[1] and t.grad_phi is 1.7763568394002505e-015, which exceeds fabs(t.grad_phi * 1e-8), where
gradients[1] evaluates to -2.6984512402350447e-009,
t.grad_phi evaluates to -2.6984530165918841e-009, and
fabs(t.grad_phi * 1e-8) evaluates to 2.6984530165918841e-017.
grad_phi n = 14, mu = 1.5, phi = 162345

test/unit/math/rev/scal/prob/neg_binomial_2_test.cpp:82: Failure
The difference between gradients[0] and t.grad_mu is 4.0061531662412956e-017, which exceeds fabs(t.grad_mu * 1e-8), where
gradients[0] evaluates to -1.2600009924312872e-009,
t.grad_mu evaluates to -1.2600010324928188e-009, and
fabs(t.grad_mu * 1e-8) evaluates to 1.2600010324928188e-017.
grad_mu n = 10233, mu = 10586, phi = 0.00040000000000000002

The finiteDiffs test on the other hand fails with many values being off by 1 or more. I added one of the failing values (n=7, mu = 8, phi = 150000) to the precomputed test and there it passes for mu but doesn't pass for phi, but the error for phi is at the 6th digit, so I am starting to think this might be a problem with the finite diff.

Here is the relevant error output from the precomputed test:

gradients[1] evaluates to 1.333262389380252e-010,
t.grad_phi evaluates to   1.333280152948646e-010

And here is from the finiteDiffs test:

gradients[0] evaluates to -0.12499333368886989,
finite_diffs[0] evaluates to -1.4533085845869209, and

gradients[1] evaluates to 1.333262389380252e-010,
finite_diffs[1] evaluates to 1.1641532182693481, and

The test passed before my code changes, but all the failures are for phi values that used to delegate to Poisson, but are now below the cutoff.

The full test output is seen in the failed Jenkins build: https://jenkins.mc-stan.org/blue/organizations/jenkins/Math%20Pipeline/detail/PR-1497/9/pipeline/.

martinmodrak · 2019-12-11T08:28:01Z

Also, I've run out of both ideas and time to improve the actual computation (if the test failures above are convincing enough to show that it needs improvement). Will be able to work on this more after New Year (unless I'll need very hard to procrastrinate on the stuff I should be doing :-) ).

martinmodrak · 2019-12-11T11:23:49Z

OK, obviously, I can't stop procrastrinating :-) For the differences in derivative wrt. phi at least part of the problem is different results of digamma between boost, R and Mathematica. R code to show the differences:

n <- 7
mu <- 0.0000256
phi <- 15324.0

digamma_phi_boost <-        9.6371428769091523 
digamma_phi_mathematica <-  9.6371428769091527082 
digamma_n_phi_boost <-        9.6375995872973039
digamma_n_phi_mathematica <-  9.6375995872973037537

base <- 1.0 - (n + phi) / (mu + phi) + log(phi) - log(mu + phi)

#Expected value (by mathematica)
-8.9402263370175206e-008
#Computed by Stan math
-8.9402261593818366e-008

base - digamma_phi_boost + digamma_n_phi_boost #Matches Stan
base - digamma(phi) + digamma(n + phi) #Matches mathematica 
base - digamma_phi_mathematica + digamma_n_phi_mathematica #Matches mathematica

…stable/2017-11-14)

martinmodrak · 2019-12-11T12:15:21Z

So it turns out I was chasing very small errors. Taking absolute tolerance into account (i.e. making the tolerance to be max(true_value * 1e-8, 1e-14)) made the tests pass for almost all values. Slightly rearranging the computation of the density removed the remaining test failures, at the cost of being able to save less computation when propto=true and phi is constant.

So the only remaining failing test is the finite diff. test. I am not sure what to do about this one.

The PR description has been updated to reflect the current situation.

Implemented gradient wrt. phi for Poisson approximation to make the test pass

…stable/2017-11-14)

martinmodrak · 2020-03-14T19:11:25Z

@bob-carpenter So that code evolved quite a bit and the required changes to binomial_coefficient_log have been merged. I believe this is now in good shape for a re-review.

bbbales2 · 2020-03-14T20:05:03Z

I can take the review over if that's convenient since I did the last binomial one

mcol

I had a quick look and noted a few duplicate variables and wrongly removed headers.

stan/math/prim/prob/neg_binomial_2_lpmf.hpp

…son-phi-cutoff

…stable/2017-11-14)

martinmodrak · 2020-03-15T14:01:23Z

@mcol thanks for catching those - I was obviously sloppy with the merge, sorry for that. I resolved those.

stan-buildbot · 2020-03-15T19:36:06Z

Name	Old Result	New Result	Ratio	Performance change( 1 - new / old )
gp_pois_regr/gp_pois_regr.stan	4.89	4.88	1.0	0.17% faster
low_dim_corr_gauss/low_dim_corr_gauss.stan	0.02	0.02	0.97	-2.99% slower
eight_schools/eight_schools.stan	0.09	0.09	0.99	-0.6% slower
gp_regr/gp_regr.stan	0.22	0.22	1.0	0.04% faster
irt_2pl/irt_2pl.stan	6.44	6.46	1.0	-0.31% slower
performance.compilation	87.82	86.25	1.02	1.79% faster
low_dim_gauss_mix_collapse/low_dim_gauss_mix_collapse.stan	7.58	7.58	1.0	0.08% faster
pkpd/one_comp_mm_elim_abs.stan	20.75	21.07	0.98	-1.57% slower
sir/sir.stan	93.74	93.14	1.01	0.64% faster
gp_regr/gen_gp_data.stan	0.05	0.05	0.98	-2.03% slower
low_dim_gauss_mix/low_dim_gauss_mix.stan	2.95	2.95	1.0	-0.13% slower
pkpd/sim_one_comp_mm_elim_abs.stan	0.34	0.31	1.06	6.04% faster
arK/arK.stan	1.74	1.73	1.0	0.43% faster
arma/arma.stan	0.65	0.66	1.0	-0.18% slower
garch/garch.stan	0.51	0.51	1.0	0.09% faster
Mean result: 1.00136891454

Jenkins Console Log
Blue Ocean
Commit hash: 6cfff52

Machine information

ProductName: Mac OS X ProductVersion: 10.11.6 BuildVersion: 15G22010

CPU:
Intel(R) Xeon(R) CPU E5-1680 v2 @ 3.00GHz

G++:
Configured with: --prefix=/Applications/Xcode.app/Contents/Developer/usr --with-gxx-include-dir=/usr/include/c++/4.2.1
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

Clang:
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

bbbales2 · 2020-03-16T22:29:42Z

@bob-carpenter can I take this over?

bob-carpenter · 2020-03-17T16:57:19Z

Yes, please!

…

On Mon, Mar 16, 2020 at 6:29 PM Ben Bales ***@***.***> wrote: @bob-carpenter <https://github.com/bob-carpenter> can I take this over? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1497 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZ2D753ZL3JGTHSICYNBUDRH2R6HANCNFSM4JYRE7UQ> .

bbbales2

Both the comments are optional. I'll merge if you say so. I think it's super adequately tested but it looked like there were a couple things you meant to have there that aren't there.

test/unit/math/prim/prob/neg_binomial_2_log_test.cpp

test/unit/math/rev/prob/neg_binomial_2_test.cpp

martinmodrak · 2020-03-22T06:53:30Z

Thanks @bbbales2 will try to resolve shortly. Note to self: #1679 was fixed and merged, can also expand the lbeta tests.

bbbales2 · 2020-03-29T13:13:50Z

Quick ping.

can also expand the lbeta tests.

I don't think there's a need to add many new tests here if that's what you're getting at! Just those couple things, and really I'd be fine with the pull as-is.

…son-phi-cutoff

…stable/2017-11-14)

martinmodrak · 2020-03-29T13:58:16Z

OK, just added the the test for d/dphi at n = 1. Also the lbeta thing was literally that I had a line of code saying "Delete this once #1679 is merged", so I deleted it :-) Tests pass on my computer, hopefully they pass on Jenkins as well.

stan-buildbot · 2020-03-29T19:29:37Z

Name	Old Result	New Result	Ratio	Performance change( 1 - new / old )
gp_pois_regr/gp_pois_regr.stan	4.81	4.93	0.98	-2.45% slower
low_dim_corr_gauss/low_dim_corr_gauss.stan	0.02	0.02	1.0	-0.4% slower
eight_schools/eight_schools.stan	0.09	0.09	1.04	3.57% faster
gp_regr/gp_regr.stan	0.22	0.22	0.99	-1.03% slower
irt_2pl/irt_2pl.stan	6.5	6.47	1.0	0.47% faster
performance.compilation	89.41	86.72	1.03	3.01% faster
low_dim_gauss_mix_collapse/low_dim_gauss_mix_collapse.stan	7.53	7.52	1.0	0.11% faster
pkpd/one_comp_mm_elim_abs.stan	20.89	20.69	1.01	0.96% faster
sir/sir.stan	94.21	91.13	1.03	3.27% faster
gp_regr/gen_gp_data.stan	0.05	0.05	1.01	1.21% faster
low_dim_gauss_mix/low_dim_gauss_mix.stan	2.95	2.99	0.99	-1.24% slower
pkpd/sim_one_comp_mm_elim_abs.stan	0.33	0.33	0.98	-2.06% slower
arK/arK.stan	1.74	1.75	0.99	-0.84% slower
arma/arma.stan	0.66	0.66	1.01	0.85% faster
garch/garch.stan	0.51	0.51	0.99	-0.87% slower
Mean result: 1.00337330892

Jenkins Console Log
Blue Ocean
Commit hash: 4b2f032

Machine information

ProductName: Mac OS X ProductVersion: 10.11.6 BuildVersion: 15G22010

CPU:
Intel(R) Xeon(R) CPU E5-1680 v2 @ 3.00GHz

G++:
Configured with: --prefix=/Applications/Xcode.app/Contents/Developer/usr --with-gxx-include-dir=/usr/include/c++/4.2.1
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

Clang:
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

bob-carpenter · 2020-03-29T21:17:56Z

Sorry I missed this was blocked on me.

martinmodrak and others added 8 commits December 9, 2019 18:17

Failing test for cutoff when delegate neg_binomial_2 to poisson

d95e377

Failing test for vectors needing cutoff

25b8ebe

Phi cutoff no longer overrides logp accumulator

e47d5d7

Failing test for propto conservation around the cutoff

6f76adf

Merge commit 'fd244866e1ff9fd3988516b39f54c8870d46fba1' into HEAD

211dd5b

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

b43aa0d

…stable/2017-11-14)

Using binomial_coefficient_log, better Poisson defer, further tests

8bcd8b4

[Jenkins] auto-formatting by clang-format version 5.0.2-svn328729-1~e…

c96d9d8

…xp1~20180509124008.99 (branches/release_50)

martinmodrak mentioned this pull request Dec 10, 2019

The way neg_binomial_2_lpmf delegates to Poisson is broken #1496

Closed

Fixed formatting issues

7b992e9

martinmodrak and others added 5 commits December 11, 2019 09:02

Improved tests and their reporting

64e034f

Merge commit '45231d114908d5a7d236fed9dffa2501ee8e368b' into HEAD

09ec4f1

[Jenkins] auto-formatting by clang-format version 5.0.0-3~16.04.1 (ta…

a6eccc8

…gs/RELEASE_500/final)

Updated precomputed test values

50653af

[Jenkins] auto-formatting by clang-format version 5.0.0-3~16.04.1 (ta…

02865a9

…gs/RELEASE_500/final)

martinmodrak and others added 3 commits December 11, 2019 13:08

Changing order of operations, passing precomputed test

abc6858

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

2dcbb82

…stable/2017-11-14)

Removed unnecessary debug info

b6bf80c

martinmodrak and others added 3 commits December 11, 2019 13:22

Use precomputed values

85059fd

Test for continuity of gradients at cutoff

788c894

Implemented gradient wrt. phi for Poisson approximation to make the test pass

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

8ec6a37

…stable/2017-11-14)

martinmodrak changed the title ~~[WIP] Fixing negative binomial phi cutoff~~ Fixing negative binomial phi cutoff Dec 11, 2019

martinmodrak added 2 commits December 11, 2019 14:26

Updated Mathematica code+reference

eb7f7fc

Removing old code

d54bedd

martinmodrak and others added 2 commits March 13, 2020 17:07

Reducing the scope of neg_binomial_2_log_lpmf tests

360478c

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

1e121d3

…stable/2017-11-14)

mcol linked an issue Mar 13, 2020 that may be closed by this pull request

The way neg_binomial_2_lpmf delegates to Poisson is broken #1496

Closed

mcol reviewed Mar 15, 2020

View reviewed changes

martinmodrak and others added 4 commits March 15, 2020 14:33

Fixed duplicated from merge

ce9e525

Merge remote-tracking branch 'stan-dev/develop' into bugfix/1496-pois…

56a4d22

…son-phi-cutoff

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

fb55067

…stable/2017-11-14)

A bit more cleanup

63bafd7

Removed unused headers

6cfff52

bbbales2 requested changes Mar 19, 2020

View reviewed changes

test/unit/math/prim/prob/neg_binomial_2_log_test.cpp Show resolved Hide resolved

test/unit/math/rev/prob/neg_binomial_2_test.cpp Show resolved Hide resolved

martinmodrak and others added 3 commits March 29, 2020 15:44

Merge remote-tracking branch 'stan-dev/develop' into bugfix/1496-pois…

4ef18c9

…son-phi-cutoff

Test against dphi for n = 1, lifted lbeta test restrictions

ce0b062

[Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

4b2f032

…stable/2017-11-14)

bbbales2 approved these changes Mar 29, 2020

View reviewed changes

bob-carpenter approved these changes Mar 29, 2020

View reviewed changes

bob-carpenter merged commit 6db1e57 into stan-dev:develop Mar 29, 2020

martinmodrak mentioned this pull request Apr 10, 2020

More stable implementation of neg_binomial_2_log_lpmf #1830

Merged

5 tasks

SteveBronder mentioned this pull request Apr 16, 2020

Stan Math 3.2 release #1826

Closed

bbbales2 mentioned this pull request Apr 20, 2020

Stanc3 release for Cmdstan 2.23 stan-dev/stanc3#498

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing negative binomial phi cutoff #1497

Fixing negative binomial phi cutoff #1497

martinmodrak commented Dec 9, 2019 •

edited

Loading

martinmodrak commented Dec 10, 2019

bob-carpenter commented Dec 10, 2019 via email

martinmodrak commented Dec 11, 2019 •

edited

Loading

martinmodrak commented Dec 11, 2019

martinmodrak commented Dec 11, 2019

martinmodrak commented Dec 11, 2019 •

edited

Loading

martinmodrak commented Mar 14, 2020

bbbales2 commented Mar 14, 2020

mcol left a comment

martinmodrak commented Mar 15, 2020

stan-buildbot commented Mar 15, 2020

bbbales2 commented Mar 16, 2020

bob-carpenter commented Mar 17, 2020 via email

bbbales2 left a comment

martinmodrak commented Mar 22, 2020

bbbales2 commented Mar 29, 2020

martinmodrak commented Mar 29, 2020

stan-buildbot commented Mar 29, 2020

bob-carpenter commented Mar 29, 2020

Fixing negative binomial phi cutoff #1497

Fixing negative binomial phi cutoff #1497

Conversation

martinmodrak commented Dec 9, 2019 • edited Loading

Summary

Tests

Side Effects

Checklist

martinmodrak commented Dec 10, 2019

bob-carpenter commented Dec 10, 2019 via email

martinmodrak commented Dec 11, 2019 • edited Loading

martinmodrak commented Dec 11, 2019

martinmodrak commented Dec 11, 2019

martinmodrak commented Dec 11, 2019 • edited Loading

martinmodrak commented Mar 14, 2020

bbbales2 commented Mar 14, 2020

mcol left a comment

Choose a reason for hiding this comment

martinmodrak commented Mar 15, 2020

stan-buildbot commented Mar 15, 2020

bbbales2 commented Mar 16, 2020

bob-carpenter commented Mar 17, 2020 via email

bbbales2 left a comment

Choose a reason for hiding this comment

martinmodrak commented Mar 22, 2020

bbbales2 commented Mar 29, 2020

martinmodrak commented Mar 29, 2020

stan-buildbot commented Mar 29, 2020

bob-carpenter commented Mar 29, 2020

martinmodrak commented Dec 9, 2019 •

edited

Loading

martinmodrak commented Dec 11, 2019 •

edited

Loading

martinmodrak commented Dec 11, 2019 •

edited

Loading