Question: Why is d = k + 1 for Kaiser and Broken Stick? #11

danlurie · 2022-03-01T01:19:34Z

I noticed that the ID estimates provided by the Kaiser and broken_stick methods in id.lPCA are k + 1, where k is the number of components to be kept according to the most commonly used implementations of these rules (i.e. keep only components with an eigenvalue > 1 [Kaiser], or keep only components with greater than expected explained variance [broken stick]).

I'm wondering what the thinking was behind this choice, and if there are any papers I can cite justifying this modification.

Thanks!

The text was updated successfully, but these errors were encountered:

j-bac · 2022-04-01T09:58:41Z

I think this is non-standard indeed. It is mentioned in the docstring but might be confusing, I can make the change.
There are some alternative/modified versions of Kaiser, e.g. https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008591

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Why is d = k + 1 for Kaiser and Broken Stick? #11

Question: Why is d = k + 1 for Kaiser and Broken Stick? #11

danlurie commented Mar 1, 2022

j-bac commented Apr 1, 2022

Question: Why is d = k + 1 for Kaiser and Broken Stick? #11

Question: Why is d = k + 1 for Kaiser and Broken Stick? #11

Comments

danlurie commented Mar 1, 2022

j-bac commented Apr 1, 2022