Compound Gamma-Poisson Distribution #2775

andrjohns · 2022-06-30T11:25:20Z

Summary

This PR adds a new compound distribution: poisson_gamma(), as the likelihood for a Poisson random variate with a Gamma prior for the rate parameter, after marginalising out the rate parameter.

All distribution functions are implemented using the existing neg_binomial_2 distribution, and reference values for testing calculated using the GammaPoiss family in the extraDistr R package: https://rdrr.io/cran/extraDistr/man/GammaPoiss.html

Tests

prim tests added only, as the gradients are handled by the neg_binomial_2_ functions

Side Effects

N/A

Release notes

Introduced new gamma_poisson() compund distribution

Checklist

Math issue New compound distribution - poisson_gamma #2772
Copyright holder: Andrew Johnson

The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

rok-cesnovar

Looks great. The only thing is removing the _log suffix files. Those are not required anymore in stanc3 so there is no reason to add them in Math.

stan/math/prim/prob/gamma_poisson_ccdf_log.hpp

stan/math/prim/prob/gamma_poisson_cdf_log.hpp

stan/math/prim/prob/gamma_poisson_log.hpp

bob-carpenter · 2022-07-05T14:42:47Z

This is just another parameterization of the negative binomial and I think it should be named as such. If we have gamma-Poisson and negative-binomial, it'll be confusing, because they're effectively synonyms.

I don't know how to resolve the naming---I'd name after parameters if possible rather than with numbering as we've done so far.

andrjohns · 2022-07-06T06:07:39Z

This is just another parameterization of the negative binomial and I think it should be named as such. If we have gamma-Poisson and negative-binomial, it'll be confusing, because they're effectively synonyms.

I don't know how to resolve the naming---I'd name after parameters if possible rather than with numbering as we've done so far.

I might disagree about the naming here actually. My intention was for the distribution to be viewed analogously to the beta_binomial - explicitly as a means of estimating a specific distribution with a latent parameter marginalised out. While the negative-binomial and Gamma+Poisson distributions are essentially synonymous, the parameters of the negative-binomial still need to be transformed to specify the desired Gamma prior. This gamma_poisson() was intended to provide a simpler approach to specifying the model by allowing the user to provide the Gamma shape & rate directly.

It's also likely that some less-statistical users might not be aware of the relationship with the negative-binomial, so I think this also provides a clearer implementation for unfamiliar users.

bob-carpenter · 2022-07-07T19:20:34Z

I'd rather mention in the doc that we have a negative binomial with standard gamma prior parameterization. Then someone could find it. I very much do not want to introduce gamma-poisson as a new distribution name given that it really is just another way of saying "negative binomial". I don't think we'll have many statistically unsophisticated users thinking they need a marginalized gamma-Poisson but walking away because they've never heard of the negative binomial.

But the real question is why are we adding this distribution? I would think the mean + over dispersion parameterization given by negative-binomial-2 would be the most natural to parameterize. Having the mean be alpha / beta is a mess in similar situations like the beta distribution (where it's alpha / (alpha + beta)), for which a reparameterization in terms of mean and dispersion is much more effective.

So I think we disagree here and need a way to resolve.

bob-carpenter · 2022-07-07T19:24:20Z

prim tests added only, as the gradients are handled by the neg_binomial_2_ functions

We need to include autodiff tests even if we think we know they're going to work for some external reason (like delegating to neg-binomial-2). That way, if someone adds analytic gradients or otherwise monkeys with the function, the tests will be in place. Also, it's surprisingly easy to write things that autodiff fails for even when we think they should work (like square(sqrt(x)) type problems, which for x = 0 returns NaN (0 / 0) rather than returning a derivative of 1.)

andrjohns · 2022-07-07T22:26:22Z

I'm not in any way determined on having this included. If there's a preference against it, I'm happy to close this PR and instead add a section to the docs or similar explaining the option

bob-carpenter · 2022-07-08T19:31:03Z

Hi, @andrjohns. I was just expressing my own preference---I don't claim to speak for everyone. So you might want to ask others if they have a use for it and what they'd like it to be called.

betanalpha · 2022-08-23T14:42:06Z

I’ll add a vote against including this because it’s right on the precipice of a very slippery slope — many density functions can be written as mixtures of two other density functions. There’s the compound Poisson-gamma giving a negative binomial but also a compound normal-exponential giving a Laplace, a compound normal-gamma giving a Student t, etc, etc. The beta-binomial is a little different in that the mixture doesn’t result in a common probability mass function, so it has to be implemented as its own family. Personally I think that focusing on implementing families of density functions that are not yet implemented, and avoiding redundant families that are just reparameterizations of each other, will help keep the language from being overburdened. Relationships like the compound Poisson-gamma and the existing negative binomial (either of them) would be excellent contributions to the documentation so that users could learn about the relationships and how to implement the compound distributions using the existing families. But definitely a trade off in user effort and language compactness.

bob-carpenter · 2022-08-23T19:49:55Z

I'd like to deal with these on a case by case basis. I think having too many distributions of the same kind (like multiple gamma-Poisson, i.e., neg binomial) is confusing, but on the other hand, it's error prone to reparameterize yourself, especially around distributions with multiple standard parameterizations like normal and gamma and negative binomial.

I like that we have the beta proportion distribution in addition to the standard one, because the new one's more natural to formulate priors for. I like having the Cholesky factor multi-normals because they're more efficient than the covariance version even though we could feed L * L' into the regular one and get the same result (up to numerical issues and speed). I'm not convinced we need the compound gamma-Poisson because the gamma distribution doesn't lead to natural priors (it's like the beta that way in how it tangles the parameters), so I don't see the point. I absolutely don't think it should be our goal to implement as many known distributions as we can.

betanalpha · 2022-09-23T20:12:52Z

I'd like to deal with these on a case by case basis. I think having too many distributions of the same kind (like multiple gamma-Poisson, i.e., neg binomial) is confusing, but on the other hand, it's error prone to reparameterize yourself, especially around distributions with multiple standard parameterizations like normal and gamma and negative binomial.

No disagreement on the error proneness of implementing reparameterizations by hand. I think the best trade off is implementing reparameterized density families or documenting the reparameterizations with code that can be copy-pasted.

I like having the Cholesky factor multi-normals because they're more efficient than the covariance version even though we could feed L * L' into the regular one and get the same result (up to numerical issues and speed).

Isn’t this just an implementation problem? Technically if the multi_normal calls the efficiently Cholesky code it should be just as efficient as Cholesky’ing the covariance matrix and then passing that to multi_normal_cholesky. The real difference is when modeling an unstructured covariance matrix itself in which case modeling the Cholesky factor directly, and plugging it into multi_normal_cholesky, will be more efficient.

I like that we have the beta proportion distribution in addition to the standard one, because the new one's more natural to formulate priors for. I'm not convinced we need the compound gamma-Poisson because the gamma distribution doesn't lead to natural priors (it's like the beta that way in how it tangles the parameters), so I don't see the point. I absolutely don't think it should be our goal to implement as many known distributions as we can.

I would argue that beta_proportion is more commonly applied to constrained regression models, i.e. y ~ beta_proportion( logistic(alpha + beta * x) , psi) than non-uniform prior modeling for unit-interval constrained parameters, but that’s a judgment call. The complication is that this is only one parameterization of the beta family that isolates the mean — for example in https://betanalpha.github.io/assets/case_studies/probability_densities.html#24_the_beta_family <https://betanalpha.github.io/assets/case_studies/probability_densities.html#24_the_beta_family> I argue that another parameterization (using the inverse of the beta_proportion scale parameter) is more useful in practice. A similar argument can be made for the second argument of the neg_binomial_2 family. Transforming between these two is relatively straightforward because the reparameterization is limited to a single parameter, but it’s still annoying. On the prior modeling side I hesitate because I think that there is too much heterogeneity in the community. For example when using a beta prior density I use quantile constraints, and then transform them into alpha and beta parameters, rather than mean +/- std_dev heuristics. This avoids the need for beta_proportion entirely. Using prior modeling to motivate which families to implement can also lead to detritus in the language as strategies change, for example the rarely used neg_binomial in Stan (which frustratingly is not equal to the standard negative binomial process definition which makes implementing negative binomial models a pain). All of that is to say that I think the only “fair” restriction to make is structural rather than interpretational. For example allowing the implementation of a “standard/conventional” and a “mean” parameterizations of families (the latter covering regression and some prior modeling strategies) but requiring users to reparameterize for any others themselves. Exceptions could be made for convolutions that can’t be implemented with a simple reparameterization of an existing family.

andrjohns and others added 6 commits June 29, 2022 15:11

First implementation of all dist functions

029f2fb

No propto for cdfs

5ab134a

Update doc and vectorised input handling

2be807f

Update tests and doc

53efa8f

Updated naming

6be538f

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

dcbdfd7

andrjohns changed the title ~~Compound -Gamma Distribution~~ Compound Gamma-Poisson Distribution Jun 30, 2022

Fix naming

c2ca27a

rok-cesnovar requested changes Jul 5, 2022

View reviewed changes

stan/math/prim/prob/gamma_poisson_ccdf_log.hpp Outdated Show resolved Hide resolved

stan/math/prim/prob/gamma_poisson_cdf_log.hpp Outdated Show resolved Hide resolved

stan/math/prim/prob/gamma_poisson_log.hpp Outdated Show resolved Hide resolved

andrjohns added 2 commits July 6, 2022 09:19

Remove deprecated _log files

eafb2b1

Merge branch 'develop' into feature/poisson-gamma

4669f73

andrjohns closed this Jul 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compound Gamma-Poisson Distribution #2775

Compound Gamma-Poisson Distribution #2775

andrjohns commented Jun 30, 2022 •

edited

Loading

rok-cesnovar left a comment

bob-carpenter commented Jul 5, 2022

andrjohns commented Jul 6, 2022

bob-carpenter commented Jul 7, 2022

bob-carpenter commented Jul 7, 2022

andrjohns commented Jul 7, 2022

bob-carpenter commented Jul 8, 2022

betanalpha commented Aug 23, 2022 via email

bob-carpenter commented Aug 23, 2022

betanalpha commented Sep 23, 2022 via email

Compound Gamma-Poisson Distribution #2775

Compound Gamma-Poisson Distribution #2775

Conversation

andrjohns commented Jun 30, 2022 • edited Loading

Summary

Tests

Side Effects

Release notes

Checklist

rok-cesnovar left a comment

Choose a reason for hiding this comment

bob-carpenter commented Jul 5, 2022

andrjohns commented Jul 6, 2022

bob-carpenter commented Jul 7, 2022

bob-carpenter commented Jul 7, 2022

andrjohns commented Jul 7, 2022

bob-carpenter commented Jul 8, 2022

betanalpha commented Aug 23, 2022 via email

bob-carpenter commented Aug 23, 2022

betanalpha commented Sep 23, 2022 via email

andrjohns commented Jun 30, 2022 •

edited

Loading