Diff transformer #9

solalatus · 2019-10-13T17:36:15Z

Hi,

Pdpipe is marvelous, very nice tool!

One addon possibility that I was wondering:
In case of time series, de-trending is a common operation, done by taking the first differential of the data eg. with pd.DataFrame.diff(1). The problem with this is, that the initial value gets dropped and there is no easy way to "back transform".
Here a "fittable", Scikit like transformer could come in handy.

I have sketched such a thing for myself here: https://gist.github.com/solalatus/9a3fc5330e7c0cd83e61094db75d2dc3

Can this be interesting as an addition?
Many thanks!

The text was updated successfully, but these errors were encountered:

shaypal5 · 2019-12-03T08:30:55Z

Yes, definitely! Though I have to say no pdpipe stage at the moment has an inverse_transform method, so you still won't have invertible pipelines... :|

solalatus · 2019-12-03T10:49:41Z

Well, the Scikit dependent ones might. Or am I mistaken?

shaypal5 · 2019-12-21T19:40:55Z

Yep, the sklearn ones can definitely be made invertible.

The NLTK ones for sure don't. For example, if you drop rare tokens or stem words, you have no way to go back, as these are transformations that map many different inputs into the same output (e.g. "grabbing" and "grabbed" to "grab").

shaypal5 added enhancement help wanted good first issue labels Dec 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diff transformer #9

Diff transformer #9

solalatus commented Oct 13, 2019

shaypal5 commented Dec 3, 2019

solalatus commented Dec 3, 2019

shaypal5 commented Dec 21, 2019 •

edited

Diff transformer #9

Diff transformer #9

Comments

solalatus commented Oct 13, 2019

shaypal5 commented Dec 3, 2019

solalatus commented Dec 3, 2019

shaypal5 commented Dec 21, 2019 • edited

shaypal5 commented Dec 21, 2019 •

edited