Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Output GEM format is not tidy #265

Open
JohnHadish opened this issue Oct 31, 2022 · 0 comments
Open

Output GEM format is not tidy #265

JohnHadish opened this issue Oct 31, 2022 · 0 comments

Comments

@JohnHadish
Copy link
Collaborator

JohnHadish commented Oct 31, 2022

Description of feature

The output GEM format is not in the tidy format recognized by most other programs (everything except base r and KINC). We should consider changing this format. The only change that would be needed would be to change the first column from a rowname format to one where the gene name is considered a column as well. We would want to coordinate KINC with this as well.

While this would still not technically be True Tidy format, I think it would make it easier for users wanting to use the GEM in other programs.

Current format:

           Sample1 Sample 2 Sample3
Gene1       1              2            3
Gene2       2              3            4
Gene3       5              6            7

Proposed format:

gene    Sample1 Sample 2 Sample3
Gene1       1              2            3
Gene2       2              3            4
Gene3       5              6            7

True Tidy Format:

gene     sample     count
Gene1  Sample1     1
Gene2  Sample1     2
Gene3  Sample1     3
Gene1  Sample1     2
Gene2  Sample1     3
Gene3  Sample1     4
Gene1  Sample1     5
Gene2  Sample1     6
Gene3  Sample1     7
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant