Ensure that `binomlogpdf` returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

devmotion · 2021-09-28T13:20:35Z

It seems that #125 broke some tests in Distributions (e.g. https://github.com/JuliaStats/Distributions.jl/pull/1387/checks?check_run_id=3732360108) since binomlogpdf(5, 0.0, 0) and binomlogpdf(5, 1.0, 5) return values that are slightly above zero (around 4e-16). The PR sets an upper bound for these values and adds integration tests with Distributions.jl to ensure that we avoid such breakages in the future.

Edit: Additionally, the PR fixes the (log)pdf of a t distribution with infinite parameter which Rmath handles and Distributions tests for.

codecov-commenter · 2021-09-28T13:47:44Z

Codecov Report

Merging #126 (7f08d4f) into master (e3fe89d) will increase coverage by 0.14%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #126      +/-   ##
==========================================
+ Coverage   37.41%   37.55%   +0.14%     
==========================================
  Files          12       12              
  Lines         417      418       +1     
==========================================
+ Hits          156      157       +1     
  Misses        261      261

Impacted Files	Coverage Δ
src/distrs/binom.jl	`100.00% <100.00%> (ø)`
src/distrs/tdist.jl	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e3fe89d...7f08d4f. Read the comment docs.

andreasnoack · 2021-09-28T14:31:18Z

I'm not sure why those examples would give nonzero values. Aren't the two examples

julia> Distributions.betalogpdf(1, 6, 0.0) - log(6)
0.0

julia> Distributions.betalogpdf(6, 1, 1.0) - log(6)
0.0

devmotion · 2021-09-28T16:29:20Z

This is what one gets with StatsFuns 0.9.10 in which betalogpdf uses the Rmath implementation. With the Julia implementation in StatsFuns 0.9.11 one gets the approx 4e-16 I mentioned above:

julia> Distributions.betalogpdf(1, 6, 0.0) - log(6)
4.440892098500626e-16

julia> Distributions.betalogpdf(6, 1, 1.0) - log(6)
4.440892098500626e-16

Apparently the Rmath implementation of the density of the Beta distribution handles the cases x = 0 and x = 1 explicitly and hence returns exactly log(6) for the logpdf: https://github.com/JuliaStats/Rmath-julia/blob/5c5dfd6baca358103fbb47cc03dc0ecee04fb1ff/src/dbeta.c#L71 and https://github.com/JuliaStats/Rmath-julia/blob/5c5dfd6baca358103fbb47cc03dc0ecee04fb1ff/src/dbeta.c#L76

Whereas the new Julia implementation returns -SpecialFunctions.logbeta(1, 6) and -SpecialFunctions.logbeta(6, 1) (the other terms are exactly zero) which is slightly different from log(6):

julia> -SpecialFunctions.logbeta(1, 6)
1.7917594692280554

julia> -SpecialFunctions.logbeta(6, 1)
1.7917594692280554

julia> log(6)
1.791759469228055

I guess the more general (and better?) fix would be to handle this special case (one argument equal to 1) in the implementation of SpecialFunctions.logbeta.

devmotion · 2021-09-28T19:00:28Z

I opened a PR with the more general fix of logabsbeta (and beta): JuliaMath/SpecialFunctions.jl#349

nalimilan · 2021-09-28T20:20:21Z

.github/workflows/IntegrationTest.yml

@@ -0,0 +1,52 @@
+name: IntegrationTest


What's this file?

It runs integration tests. Currently only with Distributions.

The same action is eg used by ChainRulesCore, DiffRules, and many Turing and SciML packages.

andreasnoack · 2021-09-28T20:53:45Z

I've been back and forth on this tonight. I'm not sure it's appropriate to add the extra branches in logabsbeta since it doesn't really improve the precision for that function in isolation. Hence, I think the fix here might be better with the fix in this PR to ensure that probabilities are one at most.

Any idea why the PoissonBinomial tests are failing in Distributions. The tolerance there does look a bit tight.

devmotion · 2021-09-28T21:11:17Z

I'm not sure it's appropriate to add the extra branches in logabsbeta since it doesn't really improve the precision for that function in isolation.

To me it seems it does improve the precision. Currently, we have e.g.

julia> beta(1.0, 200.0)
0.005000000000000002

julia> logabsbeta(1.0, 4.0)
(-1.3862943611198908, 1)

julia> -log(4.0)
-1.3862943611198906

whereas with the PR to SpecialFunctions one gets

julia> beta(1.0, 200.0)
0.005

julia> logabsbeta(1.0, 4.0)
(-1.3862943611198906, 1)

julia> -log(4.0)
-1.3862943611198906

However, I think the min(0, ...) fix in this PR is useful in addition since it guarantees that the resulting (log) probabilities are in the correct domain also for other values and even if any other floating point issues might occur.

devmotion · 2021-09-28T21:22:05Z

Any idea why the PoissonBinomial tests are failing in Distributions. The tolerance there does look a bit tight.

I think it is just caused by slightly different values from the Julia implementation of binomlogpdf and/or floating point errors. The tests would pass with atol=1e-14, so I think we should just adjust the tolerances in Distributions.

devmotion · 2021-09-29T00:32:04Z

This PR to Distributions fixes the remaining test errors: JuliaStats/Distributions.jl#1398

andreasnoack · 2021-09-29T08:09:24Z

To me it seems it does improve the precision.

My point is that it's just one ULP so less than the general precision for the function and therefore not worth the branches. Those branches then happen to then give exact zeros in a downstream function but I don't thing that is really the right evaluation metric for these kinds of functions. Here, the simpler approach is to just cap the log-probabilities. Alternatively, we could add branches in the binomial pdf for the degenerate cases.

devmotion · 2021-09-29T09:43:01Z

An additional advantage is that it is much faster for this special case but does not impact performance significantly in the other cases. With SpecialFunctions#master I get

julia> using SpecialFunctions, BenchmarkTools

julia> @btime beta(1.0, 200.0);
  47.907 ns (0 allocations: 0 bytes)

julia> @btime logabsbeta(1.0, 4.0);
  60.584 ns (0 allocations: 0 bytes)

julia> @btime beta(3.4, 200.0);
  66.080 ns (0 allocations: 0 bytes)

julia> @btime logabsbeta(3.4, 4.0);
  78.036 ns (0 allocations: 0 bytes)

whereas with the PR I get

julia> using SpecialFunctions, BenchmarkTools

julia> @btime beta(1.0, 200.0); # seems it is compiled away completely...
  0.019 ns (0 allocations: 0 bytes)

julia> @btime logabsbeta(1.0, 4.0);
  1.451 ns (0 allocations: 0 bytes)

julia> @btime beta(3.4, 200.0);
  67.098 ns (0 allocations: 0 bytes)

julia> @btime logabsbeta(3.4, 4.0);
  79.259 ns (0 allocations: 0 bytes)

The timings are consistent with

julia> @btime inv(200.0);
  0.018 ns (0 allocations: 0 bytes)

julia> @btime -log(4.0);
  1.450 ns (0 allocations: 0 bytes)

We also handle special cases in other functions, e.g., https://github.com/JuliaMath/SpecialFunctions.jl/blob/0c4181254cf664923c1504de599c5d1f2c243831/src/gamma.jl#L234-L235, hence to me it does not seem completely unusual to exploit this mathematical property and handle this case separately.

devmotion added 4 commits September 28, 2021 15:14

Ensure that binomlogpdf returns non-positive values

c5f3a31

Add test

d9b53bf

Add integration test

0060b7a

Bump version

f094bbe

devmotion requested a review from nalimilan September 28, 2021 13:21

Handle student t distribution with infinite ν

906e8af

devmotion changed the title ~~Ensure that binomlogpdf returns non-positive values and add integration tests~~ Ensure that binomlogpdf returns non-positive values, t distributions with infinite parameter are supported, and add integration tests Sep 28, 2021

devmotion mentioned this pull request Sep 28, 2021

Handle arguments of 1 in beta and logabsbeta JuliaMath/SpecialFunctions.jl#349

Open

nalimilan reviewed Sep 28, 2021

View reviewed changes

devmotion mentioned this pull request Sep 29, 2021

Fix tests JuliaStats/Distributions.jl#1398

Merged

Merge branch 'master' into dw/binom_special

7f08d4f

andreasnoack merged commit f13b618 into master Sep 29, 2021

andreasnoack deleted the dw/binom_special branch September 29, 2021 08:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure that `binomlogpdf` returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

Ensure that `binomlogpdf` returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

devmotion commented Sep 28, 2021 •

edited

Loading

codecov-commenter commented Sep 28, 2021 •

edited

Loading

andreasnoack commented Sep 28, 2021

devmotion commented Sep 28, 2021

devmotion commented Sep 28, 2021

nalimilan Sep 28, 2021

devmotion Sep 28, 2021

devmotion Sep 28, 2021

andreasnoack commented Sep 28, 2021

devmotion commented Sep 28, 2021

devmotion commented Sep 28, 2021 •

edited

Loading

devmotion commented Sep 29, 2021

andreasnoack commented Sep 29, 2021

devmotion commented Sep 29, 2021

Ensure that binomlogpdf returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

Ensure that binomlogpdf returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

Conversation

devmotion commented Sep 28, 2021 • edited Loading

codecov-commenter commented Sep 28, 2021 • edited Loading

Codecov Report

andreasnoack commented Sep 28, 2021

devmotion commented Sep 28, 2021

devmotion commented Sep 28, 2021

nalimilan Sep 28, 2021

Choose a reason for hiding this comment

devmotion Sep 28, 2021

Choose a reason for hiding this comment

devmotion Sep 28, 2021

Choose a reason for hiding this comment

andreasnoack commented Sep 28, 2021

devmotion commented Sep 28, 2021

devmotion commented Sep 28, 2021 • edited Loading

devmotion commented Sep 29, 2021

andreasnoack commented Sep 29, 2021

devmotion commented Sep 29, 2021

Ensure that `binomlogpdf` returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

Ensure that `binomlogpdf` returns non-positive values, t distributions with infinite parameter are supported, and add integration tests #126

devmotion commented Sep 28, 2021 •

edited

Loading

codecov-commenter commented Sep 28, 2021 •

edited

Loading

devmotion commented Sep 28, 2021 •

edited

Loading