[WIP] Refactor MSM #222

rmcgibbo · 2014-07-23T00:59:24Z

No description provided.

kyleabeauchamp · 2014-07-23T20:35:52Z

Am I supposed to be reviewing this yet?

rmcgibbo · 2014-07-23T20:42:17Z

Not yet. I'll let you know when I'm ready.

On Wed, Jul 23, 2014 at 1:35 PM, kyleabeauchamp notifications@github.com
wrote:

Am I supposed to be reviewing this yet?

—
Reply to this email directly or view it on GitHub
#222 (comment).

mpharrigan · 2014-07-23T21:55:52Z

Why not use reversible_type.lower() and use 'none' instead of None to simplify case-insensitivity

rmcgibbo · 2014-07-23T21:57:31Z

Sure. I'll use str(self.reversible_type).lower(), which should cover it.

rmcgibbo · 2014-07-24T00:20:43Z

Okay. Any review would be helpful at this point.

rmcgibbo · 2014-07-24T00:24:23Z

The major changes are

All matrices are dense
Explicitly support input data that is not integers in (0, ..., n_states). Many of the tests now use input data that are strings, for example (e.g. msm.fit([['a', 'b', 'a', 'a', ...]]). This means never assuming that n_states == np.max(sequences)+1

Also, I added an option to use more aggressive ergodic trimming, with a threshold higher than just a single count in both direction. This has come up at Pande group meeting.

mpharrigan · 2014-07-24T00:40:54Z

Issues with the docs:

default n_timescales is n_states - 3, this was from the sparse implementation
countsmat_ says it returns the symmetrized counts, whereas I think it returns raw counts

Thought:

Maybe don't have ergodic_trimming (bool) and trim_weight as two different parameters. Setting trim_weight to 0 would turn off ergodic trimming

rmcgibbo · 2014-07-24T01:02:14Z

Maybe don't have ergodic_trimming (bool) and trim_weight as two different parameters. Setting trim_weight to 0 would turn off ergodic trimming

+1. This is a good idea, I think.

Thanks for the other comments too.

rmcgibbo · 2014-07-24T06:48:45Z

@kyleabeauchamp: What do you think of the API for transform I implemented? https://github.com/rmcgibbo/mixtape/pull/222/files#diff-b4b9f1ef5ddb392515f54dc4b10fb560R183

rmcgibbo · 2014-07-28T23:33:54Z

I rebased onto master.

[WIP] Refactor MSM

kyleabeauchamp · 2014-07-29T18:56:44Z

Nice.
On Jul 29, 2014 2:56 PM, "Robert McGibbon" notifications@github.com wrote:

Merged #222 #222.

—
Reply to this email directly or view it on GitHub
#222 (comment).

rmcgibbo · 2014-07-29T18:59:37Z

There is one bug in the tests, on py3, which is from a numpy py3 regression (numpy/numpy#641), but this shouldn't effect practical use (only present when the sequences argument to fit contains sequences with mixed types, such as [1, 1, None, 'hello', 0, 1, ...]

rmcgibbo added 6 commits July 21, 2014 11:10

Started Prinz MLE

c050047

New MLE

3eefa15

tests

7511a0a

fix compile-time warning

20b322a

Add default to cython

64ad78e

Refactoring MSM class

d3e14fd

rmcgibbo added 3 commits July 23, 2014 14:22

add two files

46e7588

test with strings

5c931a2

more tests

0d2071e

rmcgibbo added 3 commits July 23, 2014 17:10

lots of stuff

9ac9afb

docstring

ef6085c

remove unused import

3585e26

a couple of @mpharrigan's comments and WIP stuff

9973d8c

rmcgibbo mentioned this pull request Jul 24, 2014

msm.timescales_ fails when n_states = 2 #218

Closed

rmcgibbo added 4 commits July 23, 2014 22:50

Fixes and tests

b05c3b6

Found bug when Nan is in the input sequence

82a73fa

Fix handling of nans in input data

12914f8

Document transform

cc0f382

rmcgibbo added 3 commits July 24, 2014 02:29

Some new stuff

b42c4c4

make it possible to just use 1d inputs, without wrapping in list

a95eb3c

add todo

6557614

rmcgibbo added 19 commits July 28, 2014 16:32

Fix output argument

0b39fae

Already normalized

bcb6b35

add test

0c739b2

added some trimming to test_12

c098408

Ergodic cutoff

794ef55

flake8

3ba95a0

futurize fixes

bbf4421

Mark some slow tests for skipping

1e35d8c

add verbose flag

8fa7019

more

51ecc49

fix

3d48a85

disable two tests

e1dee51

minor

8c10cea

more small fixes

838eae8

fitghmm doesnt need to reference featurizer.n_features

8533b07

fix csr_matrix import

735916f

xrange->range

4d6c43d

Merge branch 'master' of github.com:rmcgibbo/mixtape into refactor-msm

8c83a60

merge

ace4cd2

rmcgibbo added 2 commits July 28, 2014 16:45

fix FloatingPointError?

376645d

py3

9943efb

rmcgibbo added a commit that referenced this pull request Jul 29, 2014

Merge pull request #222 from rmcgibbo/refactor-msm

8d98fc7

[WIP] Refactor MSM

rmcgibbo merged commit 8d98fc7 into master Jul 29, 2014

rmcgibbo deleted the refactor-msm branch July 29, 2014 18:56

This was referenced Jul 30, 2014

Make timescales a @property? #113

Closed

Segfault in Reversibility Solver #163

Closed

msm.mapping_ #186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Refactor MSM #222

[WIP] Refactor MSM #222

rmcgibbo commented Jul 23, 2014

kyleabeauchamp commented Jul 23, 2014

rmcgibbo commented Jul 23, 2014

mpharrigan commented Jul 23, 2014

rmcgibbo commented Jul 23, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

mpharrigan commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 28, 2014

kyleabeauchamp commented Jul 29, 2014

rmcgibbo commented Jul 29, 2014

[WIP] Refactor MSM #222

[WIP] Refactor MSM #222

Conversation

rmcgibbo commented Jul 23, 2014

kyleabeauchamp commented Jul 23, 2014

rmcgibbo commented Jul 23, 2014

mpharrigan commented Jul 23, 2014

rmcgibbo commented Jul 23, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

mpharrigan commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 24, 2014

rmcgibbo commented Jul 28, 2014

kyleabeauchamp commented Jul 29, 2014

rmcgibbo commented Jul 29, 2014