[WIP] An implementation of discontiguous sampling of the SRerf variant; we call it MTORF(?) #353

adam2392 · 2020-12-01T17:16:51Z

Summary

@ChesterHuynh and I were interested in extending the SRerf variant that seems work very well on low-sample image datasets to low-sample multivariate-time series (mts). A corresponding issue was created here: adam2392#1 to discuss and design how this might look. This PR addresses the issue raised and implements MTORF(?).

We would love some feedback and potentially get this merged in so that way we can "pip install" this variant.

Details of Implementation

Assuming that mts are structured as (S x T), where S are time series signals and T is time, then MTORF essentially discontiguizes the sampling along the row dimensions, while keeping contiguous chunks in time (T).

@ChesterHuynh did a c++ implementation in the code that is attached and we have been running experiments to further some studies we have. I will summarize them here below.

Studies to Back it up

Simulation of a Multivariate Gaussian With Noisy Samples in Between
First, we did a simulation study that takes a 3-dim Gaussian and then generate 3 white noise signals. We generate ~1000 samples of each. Then we stack them as such:

signal = 3-dim Gaussian
noise_1 = white noise
noise_2 = white noise
noise_3 = white noise

# this is now a 6 x 1000 array
noisy_signal = np.concatenate((signal[0], noise_1, signal[1], noise_2, signal[2], noise_3), axis=0)

This was the result:

This essentially demonstrates when MTORF vs SRERF is desirable. This motivated us to then proceed w/ some real data.

Classification task for epilepsy:
I used this variant when I set up an epilepsy outcome classification task based on the quantiles of features computed from iEEG data around a seizure onset. It was very helpful because I was able to utilize the fact that my input matrix was correlated in time, but I did not have to impose that each of the quantiles were correlated to its neighboring quantiles (SRerf vs MTORF). This example is a bit difficult to explain, so happy to add more details if desired.
motor decoding from iEEG data:
Chester and I are currently working on a research project trying to decode motor movements (L, R, Up, Down) from iEEG signals. We hypothesize that a subset of the iEEG data that we recorded is actually useful for decoding movement, and hence the MTORF variant is particularly useful.

Additional Information

Jesse helped me navigate where we might want to make the code change back in Feb 2020(?). Lol sorry for the delay in floating this back up. Jovo initially showed me the SRerf variant during I think a summer workshop he hosted. I prolly should do more tests comparing the different variants, but haven't found the time. I also briefly discussed things w/ Ronan and Hayden a long long time ago, so just trying to get this back on track :p.

Any critiques are appreciated.

…umber generation

…random number generation" This reverts commit dd381ae.

…eeding

…atrix and tried switching linear systems

git push

…v/mtorf

…pectral analysis

…_card for efri07

…v/mtorf

netlify · 2020-12-01T17:17:02Z

Deploy preview for rerf failed.

Built with commit fcfabaa

https://app.netlify.com/sites/rerf/deploys/5fc67a8a27fa4f00074b09d8

adam2392 · 2020-12-01T17:17:20Z

Currently some tests failed for me due to:

    def test_urerf(projection_matrix):
        n_samples = 100
        n_classes = 2
        X, y = make_blobs(
            n_samples=n_samples, centers=n_classes, n_features=2, random_state=2 ** 4
        )
    
        clf = UnsupervisedRandomForest(projection_matrix=projection_matrix)
        clf.fit(X)
        sim_mat = clf.transform()
    
        assert np.array_equal(sim_mat.diagonal(), np.ones(n_samples))
    
        cluster = AgglomerativeClustering(n_clusters=n_classes).fit(sim_mat)
        predict_labels = cluster.fit_predict(sim_mat)
        score = adjusted_rand_score(y, predict_labels)
>       assert score > 0.9
E       assert 0.48526863084922006 > 0.9

Not sure if this is related to us tho.

adam2392 · 2020-12-01T17:25:23Z

packedForest/src/forestTypes/binnedTree/processingNodeBin.h

+				} // END randMatStructured
+
+
+				inline void randMatMultivariateTimePatchv2(std::vector<weightedFeature>& featuresToTry, std::vector<std::vector<int> > patchPositions){


this version can be safely ignored. I need to get rid of this one.

ChesterHuynh and others added 30 commits March 18, 2020 16:21

Implemented multivariate time series projection. TODO: Check random n…

dd381ae

…umber generation

Revert "Implemented multivariate time series projection. TODO: Check …

1abf75c

…random number generation" This reverts commit dd381ae.

Implemented multivariate time series projection. TODO: Check random s…

5896228

…eeding

Adding mtorf demo.

e99fcc3

Adding demo of mtorf.

5335e6a

Adding updatees to mtorf.

2cb4a96

Adding mtorf.

2c1990d

Adding simulation for mtorf as a function of covariance factor.

d1edc00

Adding deescription to the notebook.

c638142

Ran a few more experiments.

bfe4082

Adding MT-MORF.

8bc0ee0

Adding kuramoto model.

e80f0db

Fixing the demo to include more info.

36cccfc

Adding deepESN model

ff9ab77

Removing .DS_Store

9085695

Adding MVAR(1) experiments

d60c905

Adding updated mts morf autoregressive exploration.

22c3373

Run ar experiment.

4d8d60c

Adding experiment output.

cedd168

Adding experiment output.

e23ed7a

Adding updated experimeents.

d94179c

added experiments with substituting white noise channels with 6x6 A m…

9eabe54

…atrix and tried switching linear systems

Adding bids conversion and io to read in epochs for decision making.

cddc8ce

Adding updated simulation experiment.

b525ef0

Adding refactored reading funcs.

f590f26

Changing demo to use discrete linear time system.

7a78e13

Adding temporarily bette rexperimeint.'

6acd870

git push

Adding updated example to load in data.

538041f

Merge branch 'dev/mtorf' of https://github.com/adam2392/SPORF into de…

c022cef

…v/mtorf

Running experiment and demoing loading of data.

7295eaf

adam2392 and others added 17 commits April 15, 2020 11:01

Running experiment and demoing loading of data.

3ee5dbd

Here's the interesting sim.

a6afafc

adding high-low card experiment and refactor simulation notebook

ec954b8

Code to run high-low card experiment on efri06. Added notebook with s…

2ce56ee

…pectral analysis

fixed seeding for shuffling row indices in mtsmorf, changing high_low…

8b225eb

…_card for efri07

Adding read and utils for cv experiment.

19d0ea9

Merge branch 'dev/mtorf' of https://github.com/adam2392/SPORF into de…

fcc829b

…v/mtorf

Adding read and utils for cv experiment.

b4d5681

Moving label binarizer.

a989056

Adding updated notebook to run through analyses.

fdc660d

Removing data files.

490e7e2

Syncing to run MOVE multiclass experiment with feature importance.

d0b1a96

Adding MOVE experiment scripts

a0ac907

Adding code changes to install venv.

cfb59ce

Adding updated exp.

51c0f1f

Fixing plot issues.

e08bf1a

Deleting all experiment files and putting into separate repository.:

fcfabaa

adam2392 commented Dec 1, 2020

View reviewed changes

adam2392 mentioned this pull request Mar 16, 2021

Scope out/write pseudocode for network-based sampling adam2392/motor-decoding#9

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] An implementation of discontiguous sampling of the SRerf variant; we call it MTORF(?) #353

[WIP] An implementation of discontiguous sampling of the SRerf variant; we call it MTORF(?) #353

adam2392 commented Dec 1, 2020 •

edited

netlify bot commented Dec 1, 2020 •

edited

adam2392 commented Dec 1, 2020

adam2392 Dec 1, 2020

		} // END randMatStructured


		inline void randMatMultivariateTimePatchv2(std::vector<weightedFeature>& featuresToTry, std::vector<std::vector<int> > patchPositions){

[WIP] An implementation of discontiguous sampling of the SRerf variant; we call it MTORF(?) #353

Are you sure you want to change the base?

[WIP] An implementation of discontiguous sampling of the SRerf variant; we call it MTORF(?) #353

Conversation

adam2392 commented Dec 1, 2020 • edited

Summary

Details of Implementation

Studies to Back it up

Additional Information

netlify bot commented Dec 1, 2020 • edited

adam2392 commented Dec 1, 2020

adam2392 Dec 1, 2020

Choose a reason for hiding this comment

adam2392 commented Dec 1, 2020 •

edited

netlify bot commented Dec 1, 2020 •

edited