Walkthrough entropy #70

kahaaga · 2022-08-25T07:51:01Z

What is this PR?

This PR implement the walkthrough entropy (Stoop et al, 2021) for a symbol sequence x.

Walkthrough entropy is the first step in implementing excess entropy (#69), which is just a normalized and averaged version of walkthrough entropy, but is a useful method in itself - hence this PR.

Excess entropy will be pursued in another PR. The reason for this is that I'm having some issues understanding the implementation of the normalization step (commented in the _walkthrough_entropy function docstring). I will investigate this further and submit a PR when ready (if you @Datseris or someone else has any input, feel free to comment).

Interface

walkthrough_entropy(x, n) computes the walkthrough entropy for x at position(s) n, where 1 <= n <= length(x).

Internal changes

Implemented a generic EntropyGenerator struct with a corresponding entropygenerator(x, method, [, rng]) (like we do in TimeseriesSurrogates). Why? Walkthrough entropy is a function of the position n, but if not having a generator, we'd need to do initial calculations (histogram estimation) multiple times, which grows linearly with length(x).
For completeness, other methods should implement entropygenerator(x, method, [, rng]) too, but I haven't done so yet, before getting some feedback on this approach.
Added another histogram method vec_countmap(x) which returns both the unique elements of x and their frequencies. The element type of the frequencies can be customized (defaults to BigInt, because that is needed for binomial calculations with large n/N for the walkthrough entropy to avoid overflow).

(Currently) Unused files

walkthrough_prob.jl is currently unused, but is implemented for completeness for reproducing Stoop et al. (2021). it is conceivable that these methods become useful in some future algorithm. Note: the factorials quickly blow up. Should only be used for experimentation.

Potential future improvements

Use specialized methods for BigInt calculations. These are quite slow and allocates a lot at the moment.

Testing

The original paper doesn't provide any concrete examples to test on, so tests are generic. However, in the documentation example, I successfully reproduce the basic examples in Figure 1 from Stoop et al. (2021).

References

Stoop, R. L., Stoop, N., Kanders, K., & Stoop, R. (2021). Excess entropies suggest the physiology of neurons to be primed for higher-level computation. Physical Review Letters, 127(14), 148101.

codecov · 2022-08-25T08:07:20Z

Codecov Report

Merging #70 (dd10654) into main (3c95c69) will increase coverage by 1.81%.
The diff coverage is 91.83%.

@@            Coverage Diff             @@
##             main      #70      +/-   ##
==========================================
+ Coverage   79.32%   81.13%   +1.81%     
==========================================
  Files          21       25       +4     
  Lines         624      721      +97     
==========================================
+ Hits          495      585      +90     
- Misses        129      136       +7

Impacted Files	Coverage Δ
src/walkthrough/walkthrough_entropy.jl	`87.75% <87.75%> (ø)`
src/histogram_estimation.jl	`95.91% <92.00%> (-0.09%)`	⬇️
src/api.jl	`100.00% <100.00%> (ø)`
src/walkthrough/walkthrough.jl	`100.00% <100.00%> (ø)`
src/walkthrough/walkthrough_prob.jl	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Datseris · 2022-08-25T08:57:38Z

I'll need to review this in detail before we merge, but a detail review may take a month or two...

kahaaga · 2022-08-25T09:00:45Z

I'll need to review this in detail before we merge, but a detail review may take a month or two...

No pressure! I'm just tagging you for review here so you're aware of the PR.

I will probably be able to figure out the missing normalization step in the meanwhile, so that the excess entropy estimator is also included by that time.

kahaaga · 2023-12-29T21:17:42Z

After the package redesign, it is clear that the walkthrough entropy isn't really an entropy - but is is an information measure of some sort according to our definition, since it is a function of probabilities. It is possible to create an OutcomeSpace which follows the reasoning in the paper and to define counts over the outcomes, and this outcome space is worth having. However, it isn't clear to me how these counts would relate to the probabilities that they use in the paper to define their "walkthrough entropy". I have to dig a bit deeper.

This PR will be modified accordingly.

kahaaga and others added 9 commits August 25, 2022 09:06

Walkthrough entropy

66a37c9

Don't include excess entropy

4bbae11

Normalization does nothing

38f4953

Keep all histogram estimation in one place

75f623d

Add method for multiple n

ce9d94b

Merge branch 'main' into walkthrough_entropy

1b9412b

Separate (unused) walkthrough probability

ec33801

Add Random as dependency

787cfc9

Correct path

0c32d05

kahaaga requested a review from Datseris August 25, 2022 08:19

kahaaga added 3 commits August 25, 2022 10:36

Also test walkthrough probability

3f07fe6

Methods are not exported

9967bb8

Include file to test it

c242a5c

Estimator is needed in both files

dd10654

This was referenced Aug 31, 2022

Permutation entropy in higher dimensions (2D) #74

Closed

Refactor api #89

Merged

kahaaga mentioned this pull request Dec 17, 2022

Current status of API is unclear #184

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Walkthrough entropy #70

Walkthrough entropy #70

kahaaga commented Aug 25, 2022 •

edited

codecov bot commented Aug 25, 2022 •

edited

Datseris commented Aug 25, 2022

kahaaga commented Aug 25, 2022

kahaaga commented Dec 29, 2023 •

edited

Walkthrough entropy #70

Are you sure you want to change the base?

Walkthrough entropy #70

Conversation

kahaaga commented Aug 25, 2022 • edited

What is this PR?

Interface

Internal changes

(Currently) Unused files

Potential future improvements

Testing

References

codecov bot commented Aug 25, 2022 • edited

Codecov Report

Datseris commented Aug 25, 2022

kahaaga commented Aug 25, 2022

kahaaga commented Dec 29, 2023 • edited

kahaaga commented Aug 25, 2022 •

edited

codecov bot commented Aug 25, 2022 •

edited

kahaaga commented Dec 29, 2023 •

edited