Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not all attributes are being exported #210

Open
returnOfTheYeti opened this issue Feb 12, 2024 · 6 comments
Open

Not all attributes are being exported #210

returnOfTheYeti opened this issue Feb 12, 2024 · 6 comments
Labels
enhancement New feature or request

Comments

@returnOfTheYeti
Copy link

In the SRA db, in the run info, as well as in the XML, one can see variables such as "GISAID_Accession" and "SARS-CoV-2_diagnostic_pcr_Ct_value_1" for certain samples (below).

https://www.ncbi.nlm.nih.gov/sra/?term=SRR15168846

But when I extract the detailed data for this sample using:

pysradb metadata SRR15168847 --detailed | head

these attributes mentioned above, are missing from the pysradb output. Is there any way to retrieve ALL of the metadata? Or at least, specific attributes that are not included in the "detailed" setting?

I downloaded pysradb on Feb 12, 2024 via conda

@returnOfTheYeti returnOfTheYeti added the bug Something isn't working label Feb 12, 2024
@saketkc saketkc added enhancement New feature or request and removed bug Something isn't working labels Feb 14, 2024
@saketkc
Copy link
Owner

saketkc commented Feb 14, 2024

Thank you for the suggestion! While it will take me a while to get to this, we always encourage PRs especially since you already know what is going on in the xml! Let me know if you need any help.

@returnOfTheYeti
Copy link
Author

returnOfTheYeti commented Feb 21, 2024 via email

@arcones
Copy link

arcones commented Feb 21, 2024

if you don't want to use grep, maybe you can create a python script to select the fields you like, for example:

from pysradb import SRAweb

[...]
srp = 'SRP481544'
raw_pysradb_data_frame = SRAweb().srp_to_srr(srp)
srrs = list(raw_pysradb_data_frame['run_accession']) # Here you can put the field you want

@saketkc
Copy link
Owner

saketkc commented Feb 22, 2024

hi @returnOfTheYeti, I would go with @arcones' recommendation here.

@returnOfTheYeti
Copy link
Author

returnOfTheYeti commented Feb 28, 2024 via email

@saketkc
Copy link
Owner

saketkc commented Feb 29, 2024

These are not standard fields that are defined for each project and hence currently not supported.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants