Map all annotations to BiGG #20

draeger · 2017-03-01T11:00:52Z

Suggested enhancement by @tpfau: Look over all present annotations and map every annotation that can be mapped to BiGG. For instance, if there is a KEGG compound annotation that compound will be assigned its corresponding BiGG id along with all other annotations available in BiGG. Especially since that annotation data is already present in the BiGG Models Database, this would make ModelPolisher much more useful.

As long as ModelPolisher only relies on BiGG ids as an input this will always require manual matching of the original id used to BiGG ids or assume that the model originally used BiGG ids. It would be much better to make it database dependent.

The text was updated successfully, but these errors were encountered:

mephenor · 2020-01-27T16:15:21Z

While this has been implemented during GSoC19, proper testing of the feature has not taken place yet.
As discussed some models containing annotations from BioModels could be used for initial manual testing and converted into test cases later on, after validating that 1) additional annotations are obtained and 2) those annotations are in fact accurate.

mephenor · 2020-03-29T04:08:05Z

Finding a good BioModels subset is a task in itself, so this should likely be done differently.
Polishing one model with BiGGIds twice, once with the correct id and once with a scrambled variant should be a valid test for this functionality.
Setting up a database for this testing procedure is currently the problem here, as discussed.
This will be done after the beta release.

Schmoho · 2022-08-02T11:46:04Z

For species this seems to work as expected:

ModelPolisher/src/test/java/edu/ucsd/sbrg/bigg/annotation/SpeciesAnnotationTest.java

Lines 61 to 88 in 876bec7

    
           @Test 
        
           public void unknownMetaboliteCanBeInferredFromCV() { 
        
               var m = new Model(3, 2); 
        
               var s = m.createSpecies("big_chungus"); 
        
               var cvTerm = new CVTerm(); 
        
               cvTerm.setQualifier(CVTerm.Qualifier.BQB_IS); 
        
               cvTerm.addResource("http://identifiers.org/reactome.compound/113592"); 
        
               s.addCVTerm(cvTerm); 
        
               var annotator = new SpeciesAnnotation(s); 
        
               annotator.annotate(); 
        
               assertEquals("big_chungus", s.getId()); 
        
               assertEquals("ATP C10H12N5O13P3", s.getName()); 
        
               assertEquals("SBO:0000240", s.getSBOTermID()); 
        
               assertEquals(1, s.getCVTermCount()); 
        
               assertEquals(30, s.getCVTerm(0).getNumResources()); 
        
               assertCVTermIsPresent(s, 
        
                       CVTerm.Type.BIOLOGICAL_QUALIFIER, 
        
                       CVTerm.Qualifier.BQB_IS, 
        
                       "http://identifiers.org/reactome.compound/113592"); 
        
               assertCVTermsArePresent(s, 
        
                       CVTerm.Type.BIOLOGICAL_QUALIFIER, 
        
                       CVTerm.Qualifier.BQB_IS, 
        
                       expectedATPAnnotations, 
        
                       "Expected uris are not present."); 
        
           }

Schmoho · 2022-08-02T12:12:01Z

For reactions it also kind of works like expected, however there is an issue with foreign IDs that map to more than one BiGG-ID: those are discarded.

ModelPolisher/src/test/java/edu/ucsd/sbrg/bigg/annotation/ReactionAnnotationTest.java

Lines 21 to 78 in 62b6b21

    
           @Test 
        
           public void getBiGGIdFromResourcesTest() { 
        
               initParameters(); 
        
               var m = new Model("iJO1366", 3, 2); 
        
               var r1 = m.createReaction("some_name"); 
        
               var r2 = m.createReaction("some_other_name"); 
        
               var r3 = m.createReaction("some_third_name"); 
        
               r1.addCVTerm(new CVTerm( 
        
                       CVTerm.Type.BIOLOGICAL_QUALIFIER, 
        
                       CVTerm.Qualifier.BQB_IS, 
        
                       "http://identifiers.org/biocyc/META:ACETATEKIN-RXN")); 
        
               r2.addCVTerm(new CVTerm( 
        
                       CVTerm.Type.BIOLOGICAL_QUALIFIER, 
        
                       CVTerm.Qualifier.BQB_IS, 
        
                       "http://identifiers.org/metanetx.reaction/MNXR103371")); 
        
               r3.addCVTerm(new CVTerm( 
        
                       CVTerm.Type.BIOLOGICAL_QUALIFIER, 
        
                       CVTerm.Qualifier.BQB_IS, 
        
                       "http://identifiers.org/kegg.reaction/R00299")); 
        
               var gPlugin = (GroupsModelPlugin) m.getPlugin(GroupsConstants.shortLabel); 
        
               assertEquals(0, gPlugin.getGroupCount()); 
        
               new ReactionAnnotation(r1).annotate(); 
        
               new ReactionAnnotation(r2).annotate(); 
        
               new ReactionAnnotation(r3).annotate(); 
        
               var r1FbcPlugin = (FBCReactionPlugin) r1.getPlugin(FBCConstants.shortLabel); 
        
               var gpa1 =  r1FbcPlugin.getGeneProductAssociation(); 
        
               assertNull(gpa1); 
        
               assertEquals(false, r1.isSetCompartment()); 
        
               assertEquals("", r1.getName()); 
        
               assertEquals(1, r1.getCVTermCount()); 
        
               assertEquals(1, r1.getCVTerm(0).getNumResources()); 
        
               assertEquals(1, r2.getCVTermCount()); 
        
               assertEquals(1, r2.getCVTerm(0).getNumResources()); 
        
               var r3FbcPlugin = (FBCReactionPlugin) r3.getPlugin(FBCConstants.shortLabel); 
        
               var gpa3 =  r3FbcPlugin.getGeneProductAssociation(); 
        
               assertNotNull(gpa3); 
        
               assertEquals("G_b2388", ((GeneProductRef) gpa3.getAssociation()).getGeneProduct()); 
        
               assertEquals(false, r1.isSetCompartment()); 
        
               assertEquals("", r1.getName()); 
        
               assertEquals(1, r3.getCVTermCount()); 
        
               assertEquals(11, r3.getCVTerm(0).getNumResources()); 
        
               assertEquals(1, gPlugin.getGroupCount()); 
        
               assertEquals("glycolysis/gluconeogenesis", gPlugin.getGroup(0).getName()); 
        
               assertEquals(Set.of("some_third_name"), gPlugin.getGroup(0) 
        
                       .getListOfMembers().stream().map(Member::getIdRef).collect(Collectors.toSet())); 
        
               assertFalse(r3.isSetListOfReactants()); 
        
               assertFalse(r3.isSetListOfProducts()); 
        
           }

Schmoho · 2022-08-02T15:30:19Z

Running

select distinct r.bigg_id as reaction_bigg_id, c.bigg_id as compartment_bigg_id, c.name as compartment_name
from reaction_matrix rm, compartmentalized_component cc, compartment c, reaction r
where rm.reaction_id in (select ome_id
                      from synonym
                      where synonym ilike '%ACETATEKIN-RXN%')
           and rm.compartmentalized_component_id = cc.id
           and cc.compartment_id = c.id
           and rm.reaction_id = r.id;

yields

"reaction_bigg_id"	"compartment_bigg_id"	"compartment_name"
"ACKr"	                "c"	"cytosol"
"ACKrh"	                "h"	 "chloroplast"
"ACKrm"	                "m"	  "mitochondria"

Schmoho · 2022-08-02T15:37:46Z

The offending code is here:

ModelPolisher/src/main/java/edu/ucsd/sbrg/db/BiGGDB.java

Lines 753 to 758 in 62b6b21

    
           results = results.stream().filter(biggId -> biggId != null && !biggId.isEmpty()).collect(Collectors.toSet()); 
        
           if (results.size() == 1) { 
        
             return Optional.of(results.iterator().next()); 
        
           } else { 
        
             return Optional.empty(); 
        
           }

Unfortunately this is somewhat deep in the stack and embedded in creative attempts at code deduplication.

getBiggIdFromParts:329, BiGGAnnotation (edu.ucsd.sbrg.bigg.annotation)
lambda$getBiGGIdFromResources$1:306, BiGGAnnotation (edu.ucsd.sbrg.bigg.annotation)
apply:-1, 28318221 (edu.ucsd.sbrg.bigg.annotation.BiGGAnnotation$$Lambda$607)
flatMap:294, Optional (java.util)
getBiGGIdFromResources:306, BiGGAnnotation (edu.ucsd.sbrg.bigg.annotation)
checkId:91, ReactionAnnotation (edu.ucsd.sbrg.bigg.annotation)
annotate:58, ReactionAnnotation (edu.ucsd.sbrg.bigg.annotation)
getBiGGIdFromResourcesTest:50, ReactionAnnotationTest (edu.ucsd.sbrg.bigg.annotation)

Schmoho · 2022-08-02T18:43:42Z

Last commit introduced a change to the reaction annotations.
We now consider all potential reaction hits from foreign IDs and filter on matching compartment.
I.e. even if a foreign ID (e.g. a kegg ID) is associated with multiple BiGG-IDs, we only discard those that don't match the compartment of the reaction.
On the flip side, this will no longer annotate in case there is only a single hit but no matching compartment.

ModelPolisher/src/main/java/edu/ucsd/sbrg/db/BiGGDB.java

Lines 725 to 738 in 8e2b3e5

    
           var query = "SELECT R.BIGG_ID AS REACTION_BIGG_ID, " 
        
                   + "C.BIGG_ID AS COMPARTMENT_BIGG_ID, " 
        
                   + "C.NAME AS COMPARTMENT_NAME " 
        
                   + "FROM REACTION R " 
        
                   + "left join REACTION_MATRIX RM " 
        
                   + "on RM.REACTION_ID = R.ID " 
        
                   + "left join COMPARTMENTALIZED_COMPONENT CC " 
        
                   + "on RM.COMPARTMENTALIZED_COMPONENT_ID = CC.ID " 
        
                   + "left join COMPARTMENT C " 
        
                   + "on CC.COMPARTMENT_ID = C.ID " 
        
                   + "join synonym s " 
        
                   + "on synonym = ? and r.id = s.ome_id " 
        
                   + "join data_source d " 
        
                   + "on s.data_source_id = d.id and d.bigg_id = ?";

draeger added the enhancement label Mar 1, 2017

draeger assigned draeger and mephenor Mar 1, 2017

mephenor added this to Close open issues in Release 2.1 Nov 7, 2019

mephenor moved this from Close open issues to Backlog in Release 2.1 Jan 31, 2020

mephenor moved this from Backlog to Started in Release 2.1 Jan 31, 2020

mephenor moved this from Started to Mostly finished in Release 2.1 Jan 31, 2020

Schmoho closed this as completed May 3, 2022

Schmoho reopened this May 3, 2022

Release 2.1 automation moved this from Mostly finished to Backlog May 3, 2022

Schmoho moved this from Backlog to Mostly finished in Release 2.1 May 3, 2022

Schmoho moved this from Mostly finished to Started in Release 2.1 May 10, 2022

Schmoho added feature Issues that aim to introduce new feature in ModelPolisher. and removed enhancement labels May 10, 2022

Schmoho unassigned mephenor May 10, 2022

Schmoho moved this from In Progress to Todo in Release 2.1 Jul 26, 2022

Schmoho added a commit that referenced this issue Aug 2, 2022

issue #121 test #20

876bec7

Schmoho added a commit that referenced this issue Aug 2, 2022

issue #121 test #20

2e1e7c6

Schmoho added a commit that referenced this issue Aug 2, 2022

isse #121 test #20

57c0d9e

Schmoho added a commit that referenced this issue Aug 2, 2022

isse #121 test #20

62b6b21

Schmoho added a commit that referenced this issue Aug 2, 2022

issue #121 test #20

43136ee

Schmoho added a commit that referenced this issue Aug 2, 2022

issue #121 test #20

8e2b3e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Map all annotations to BiGG #20

Map all annotations to BiGG #20

draeger commented Mar 1, 2017

mephenor commented Jan 27, 2020

mephenor commented Mar 29, 2020

Schmoho commented Aug 2, 2022 •

edited

Schmoho commented Aug 2, 2022 •

edited

Schmoho commented Aug 2, 2022

Schmoho commented Aug 2, 2022 •

edited

Schmoho commented Aug 2, 2022

Map all annotations to BiGG #20

Map all annotations to BiGG #20

Comments

draeger commented Mar 1, 2017

mephenor commented Jan 27, 2020

mephenor commented Mar 29, 2020

Schmoho commented Aug 2, 2022 • edited

Schmoho commented Aug 2, 2022 • edited

Schmoho commented Aug 2, 2022

Schmoho commented Aug 2, 2022 • edited

Schmoho commented Aug 2, 2022

Schmoho commented Aug 2, 2022 •

edited

Schmoho commented Aug 2, 2022 •

edited

Schmoho commented Aug 2, 2022 •

edited