mass_absolute returns NaN for some negative time-series #286

plodocus · 2020-12-01T15:46:35Z

I found this while playing with the PAMAP dataset. Haven't been able to reproduce this with random numbers, so here's the relevant data and code.

Q = np.array([-13.09, -14.1 , -15.08, -16.31, -17.13, -17.5 , -18.07, -18.07,
       -17.48, -16.24, -14.88, -13.56, -12.65, -11.93, -11.48, -11.06,
       -10.83, -10.67, -10.59, -10.81, -10.92, -11.15, -11.37, -11.53,
       -11.19, -11.08, -10.48, -10.14,  -9.92,  -9.99, -10.11,  -9.92,
        -9.7 ,  -9.47,  -9.06,  -9.01,  -8.79,  -8.67,  -8.33,  -8.  ,
        -8.26,  -8.  ,  -7.54,  -7.32,  -7.13,  -7.24,  -7.43,  -7.93,
        -8.8 ,  -9.71])
print(stumpy.core.mass_absolute(Q, Q))

I get nan, but of course it should be 0. The problem is some float imprecision in _mass_absolute that leads to negative values that can't be properly np.sqrted.

Easy to fix, of course. A test for this edge case should be added as well.

The text was updated successfully, but these errors were encountered:

seanlaw · 2020-12-01T18:51:00Z

@DanBenHa Thank you for this reproducer! You're right. It looks like it is trying to take the square root of a very small negative number so maybe we should just replace with:

D = Q_squared + T_squared - 2 * QT
D[D < 0] = 0.0
return np.sqrt(D)

And then, as you said, add this to our test suite. Would you like to submit a PR for this? Otherwise, I can do it

plodocus · 2020-12-01T20:08:52Z

Sure, I'll do it. My ad hoc solution was using more of numpy's syntactic sugar, i.e.
np.sqrt((Q_squared + T_squared - 2 * QT).clip(min=0))
Not sure if this is less maintainable, but I liked the one-liner ;)

seanlaw · 2020-12-01T20:19:06Z

Oooh, I like that (please go for it)! I learned something new too

plodocus · 2020-12-02T09:01:01Z

Ah, unfortunately numba doesn't support clip yet: numba/numba#3468

Fixes TDAmeritrade#286

* test mass_absolute for nan caused by float imprecision * Min-clip values at 0 in _mass_absolute Fixes #286

seanlaw · 2020-12-02T18:56:58Z

Here are some reference discussion (but no solution):

From what I can tell, the issue may be coming from imprecisions in the sliding dot product

plodocus · 2020-12-03T08:44:54Z

Thanks for posting the references!

seanlaw added the bug Something isn't working label Dec 1, 2020

plodocus added a commit to plodocus/stumpy that referenced this issue Dec 2, 2020

Min-clip values at 0 in _mass_absolute

593098f

Fixes TDAmeritrade#286

plodocus mentioned this issue Dec 2, 2020

Fix mass absolute nan #288

Merged

10 tasks

seanlaw closed this as completed in #288 Dec 2, 2020

seanlaw pushed a commit that referenced this issue Dec 2, 2020

Fixed #286 mass absolute nan (#288)

94cbda2

* test mass_absolute for nan caused by float imprecision * Min-clip values at 0 in _mass_absolute Fixes #286

seanlaw assigned plodocus Dec 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mass_absolute returns NaN for some negative time-series #286

mass_absolute returns NaN for some negative time-series #286

plodocus commented Dec 1, 2020

seanlaw commented Dec 1, 2020 •

edited

plodocus commented Dec 1, 2020

seanlaw commented Dec 1, 2020 •

edited

plodocus commented Dec 2, 2020

seanlaw commented Dec 2, 2020 •

edited

plodocus commented Dec 3, 2020

mass_absolute returns NaN for some negative time-series #286

mass_absolute returns NaN for some negative time-series #286

Comments

plodocus commented Dec 1, 2020

seanlaw commented Dec 1, 2020 • edited

plodocus commented Dec 1, 2020

seanlaw commented Dec 1, 2020 • edited

plodocus commented Dec 2, 2020

seanlaw commented Dec 2, 2020 • edited

plodocus commented Dec 3, 2020

seanlaw commented Dec 1, 2020 •

edited

seanlaw commented Dec 1, 2020 •

edited

seanlaw commented Dec 2, 2020 •

edited