Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat-stoichiometry coefficients for subunits in the enzyme complex #297

Open
feiranl opened this issue Jan 31, 2022 · 10 comments
Open

Feat-stoichiometry coefficients for subunits in the enzyme complex #297

feiranl opened this issue Jan 31, 2022 · 10 comments

Comments

@feiranl
Copy link
Collaborator

feiranl commented Jan 31, 2022

Description of the issue:

  • Stoichiometry coefficients are missing in current GEM but available in Complex Portal and PDB database. Should we include this information and which field is suitable to store such info?
@hongzhonglu
Copy link
Collaborator

Maybe firstly store such information in a tsv file firstly? @feiranl @edkerk

@edkerk
Copy link
Member

edkerk commented Mar 29, 2022

That's a good idea. It's not that straightforward where this information should be stored. In the long run it should ideally be kept in the model file, and not just as a separate TSV. But this raises three issues:

  • Neither RAVEN nor COBRA MATLAB structures have a dedicated field for this. Of course we could define a new model field/structure, but this should be done smartly.
  • It should also be properly recorded in the SBML file. I don't think the FBCv2 package has a way to record this.
  • RAVEN functions should be modified to consider this new field when making modifications to the model structure. For instance, if a reaction associated to a complex is deleted, then this complex should also be deleted. Similar as how removeReactions also removes genes and annotations when required.

This is not of the highest priority, but would be good to start a discussion on how to overcome especially the first two issues.

@hongzhonglu
Copy link
Collaborator

hongzhonglu commented Mar 29, 2022

After we have the curated datasets. we can put it under this folder https://github.com/SysBioChalmers/yeast-GEM/tree/main/data/databases?

@mihai-sysbio
Copy link
Member

mihai-sysbio commented Mar 29, 2022

  • Stoichiometry coefficients are missing in current GEM but available in Complex Portal and PDB database.

I'm really unsure about which coefficients this discussion is about, could you give an example?

@edkerk
Copy link
Member

edkerk commented Mar 29, 2022

The number of protein subunits that make up a protein complex. It is not always just 1 copy of each subunit. See for instance pyruvate dehydrogenase: https://www.ebi.ac.uk/complexportal/complex/CPX-3207:

image

This would for instance make a difference if the model would be turned into an ec-model, but there are likely also other use cases.

@mihai-sysbio
Copy link
Member

This would for instance make a difference if the model would be turned into an ec-model, but there are likely also other use cases.

Thanks for the example @edkerk. Wouldn't then this be more useful to be dealt with by a future version of GECKO instead? I'm thinking that it would make more sense for this information to be following the same structure regardless of the model.

@edkerk
Copy link
Member

edkerk commented Mar 29, 2022

But that is if GECKO is the only purpose. GECKO should be modified to be able to deal with such information anyway (it currently assumes 1 copy per subunit), but should that information only be provided in GECKO, or distributed as part of the generic yeast-GEM, also ready for other applications?

@hongzhonglu
Copy link
Collaborator

generic yeast-GEM, also ready for other applications

Seems there no direct applications of this information in yeast-GEM?

@mihai-sysbio
Copy link
Member

I'm very hesitant to "copy" external data in any repository, unless it would be useful for a good chunk of the userbase.

also ready for other applications?

I see (but I don't know how often this would be the case). Perhaps what can be stored in this repository is a script that fetches the data via an API, and maybe does some reformatting, but not the data itself, since that will by default get stale so it requires work to keep up to date.

@feiranl
Copy link
Collaborator Author

feiranl commented Mar 29, 2022

I agree with Mihail. But here it is not just "copy". We get this stoi info from ComplexPortal and information from PDB database. The script extracts protein structures from PDB database, mapping the subunit through sequence alignment and then find the stoi.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants