fixed floating point error #713

dnabanita7 · 2021-08-31T15:30:57Z

related to issue #712

nalimilan · 2021-08-31T20:29:56Z

Thanks for the PR! However, I'm not sure the docs should correspond exactly to the implementation. The docstring gives the mathematical formula, and the implementation is slightly different for performance reasons. Floating point approximations are generally not considered as significant. As was noted on Discourse, even 3/10 == 3*0.1 doesn't hold in floating point. As long as isapprox holds, everything is fine.

dnabanita7 · 2021-08-31T23:26:37Z

Alright! Should I just mention floating-point precision or anything in the docstrings? Or should I close the PR?

andreasnoack · 2021-09-01T11:52:37Z

I think the existing version is fine. Separately from that, I also think this should be implemented with division instead of multiplication with the inverse.

nalimilan · 2021-09-01T12:29:23Z

Separately from that, I also think this should be implemented with division instead of multiplication with the inverse.

I also wondered whether using the inverse was really a good idea. I imagine it can only be faster if the range of levels is very wide compared with the number of values, but even then it's not obvious that it would make a big difference.

@dnabanita7 Would you feel like checking whether using / for the implementation rather than inv would be acceptable for performance?

dnabanita7 · 2021-09-01T15:18:03Z

I think it is not making much difference with shorter collections.

julia> @btime counts(x1, s1) .* inv(length(x1))
  546.541 ns (5 allocations: 368 bytes)
7-element Vector{Float64}:
 0.1
 0.2
 0.1
 0.1
 0.30000000000000004
 0.0
 0.2

julia> @btime counts(x1, s1) / length(x1)
  133.574 ns (2 allocations: 288 bytes)
7-element Vector{Float64}:
 0.1
 0.2
 0.1
 0.1
 0.3
 0.0
 0.2

It gives better performance with larger collections.

julia> @btime counts(x1, s1) / length(x1)
25.491 s (5 allocations: 7.45 GiB)
500000000-element Vector{Float64}:

julia> @btime counts(x1, s1) .* inv(length(x1))
  28.704 s (8 allocations: 7.45 GiB)
500000000-element Vector{Float64}:

nalimilan · 2021-09-02T19:37:56Z

What are x1 and s1 in these cases? I couldn't find cases where performance differs between inv and /.

dnabanita7 · 2021-09-03T05:44:03Z

So, for the first case, x1, s1 are 7 element vector as in the output and for the second case x1, s1 is a random generated 5 * 10^8 element vector. As you can see in the second case, inv takes up more number of memory allocations as well as more time around 3 seconds than /.

nalimilan · 2021-09-03T08:15:53Z

Hmm, I still don't get it. In general, posting code is more explicit.

Anyway, inv doesn't seem faster, so let's switch to /. Would you update the PR?

(BTW, be careful when looking at the number of allocations with @btime: it's safer to wrap the code in a function, in particular when it uses broadcasting. When doing that I see 4 allocations for both.)

nalimilan · 2021-09-05T16:57:14Z

I meant that we should switch the implementation, not the docs.

dnabanita7 · 2021-09-06T05:24:44Z

oh sure, right!

dnabanita7 · 2021-09-14T00:26:41Z

cc @nalimilan

nalimilan

Thanks!

fixed floating point error

651b14b

Update counts.jl

40beeae

dnabanita7 added 2 commits September 6, 2021 18:17

switched from inv() to /

e772ce1

changed inv() to /

b85e55e

nalimilan approved these changes Oct 5, 2021

View reviewed changes

nalimilan requested a review from andreasnoack October 5, 2021 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed floating point error #713

fixed floating point error #713

dnabanita7 commented Aug 31, 2021

nalimilan commented Aug 31, 2021

dnabanita7 commented Aug 31, 2021

andreasnoack commented Sep 1, 2021

nalimilan commented Sep 1, 2021

dnabanita7 commented Sep 1, 2021

nalimilan commented Sep 2, 2021

dnabanita7 commented Sep 3, 2021 •

edited

nalimilan commented Sep 3, 2021

nalimilan commented Sep 5, 2021

dnabanita7 commented Sep 6, 2021

dnabanita7 commented Sep 14, 2021

nalimilan left a comment

fixed floating point error #713

Are you sure you want to change the base?

fixed floating point error #713

Conversation

dnabanita7 commented Aug 31, 2021

nalimilan commented Aug 31, 2021

dnabanita7 commented Aug 31, 2021

andreasnoack commented Sep 1, 2021

nalimilan commented Sep 1, 2021

dnabanita7 commented Sep 1, 2021

nalimilan commented Sep 2, 2021

dnabanita7 commented Sep 3, 2021 • edited

nalimilan commented Sep 3, 2021

nalimilan commented Sep 5, 2021

dnabanita7 commented Sep 6, 2021

dnabanita7 commented Sep 14, 2021

nalimilan left a comment

Choose a reason for hiding this comment

dnabanita7 commented Sep 3, 2021 •

edited