Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change prefix of OMIM cross-references to MIM #1301

Closed
allenbaron opened this issue Feb 8, 2024 · 12 comments · Fixed by #1334
Closed

Change prefix of OMIM cross-references to MIM #1301

allenbaron opened this issue Feb 8, 2024 · 12 comments · Fixed by #1334
Assignees
Labels

Comments

@allenbaron
Copy link
Collaborator

OMIM made a request that we use MIM as a prefix instead of OMIM:

We prefer that resources use MIM as the CURIE for our numbers. The "O" stands for "online" which is redundant (see
sections 1.8 and 1.9 at https://omim.org/help/linking). We are linking to the Alliance of Genome Resources and when we requested this change, they directed us to you as the source of MIM versus OMIM prefix. Any chance you would change this?

I'm opening this issue so we can discuss and resolve this.

@allenbaron
Copy link
Collaborator Author

A few thoughts:

  1. Their team has worked hard to consolidate prefixes for their resource to MIM, instead of OMIM.
  2. It would be easy to implement this change. The biggest issue is how it might affect downstream users of the DO.

Here's my own personal opinion:

TL;DR - I wholly support this change and doing whatever work is needed to make it happen.

Reasoning - The creator and maintainer of a resource should be the one to define it and it is best to ask that/those individual(s) when there is some decision on how their resource will be used. I believe some of the messiness in data and proliferation of competing resources exists because individuals do not think often enough about working together or are unwilling to do the work needed to discuss issues and come to a consensus. I see this as another unrecorded, aspect of FAIRness/TRUST-worthiness: users should communicate with the maintainers of a resource. If all OMIM users respected OMIM's desire to consolidate to the MIM prefix there would be only 1 prefix and no need for any programmatic efforts to correct them, making all data annotated with their identifiers more FAIR.

@allenbaron
Copy link
Collaborator Author

Tagging @fschiettecatte so he's aware of this discussion and can chime in if needed (I hope you don't mind).

@allenbaron
Copy link
Collaborator Author

@sbello what might the Alliance need to make this change?

@lschriml, the only thing I can think of that we need to change, beyond the data in the ontology, is the loader that supports the tree views of the DO at https://disease-ontology.org/do. Can you help me identify other resources beyond the Alliance who might benefit from a heads up and help me set a timeframe for this change?

@allenbaron
Copy link
Collaborator Author

@fschiettecatte does capitalization matter? Any preference for MIM vs mim?

@allenbaron
Copy link
Collaborator Author

Do we want to change all references from OMIM to MIM? I'm wondering about the name of some files, e.g. DOreports/OMIMinDO.tsv, and references to OMIM in src/ontology/README_DO_Files.

@sbello
Copy link
Collaborator

sbello commented Feb 9, 2024

@allenbaron MGI and the Alliance and I suspect most other DO users would need to modify our loaders to recognize the new prefix. I don't think this would break the loads but it would mean that we would fail to load any OMIM IDs until the loader was updated so early warning would be good.
I think once we did that we wouldn't need to change anything on the front end.
Given that the OMIM website is called OMIM.org I don't know that you would need to change anything else.

@lschriml
Copy link
Contributor

lschriml commented Feb 9, 2024

Items to change in the DO files:
-- only change the xref prefix from OMIM to MIM

On the DO website
-- Advance Search
-- SPARQL sandbox
-- the loader that supports the tree views of the DO

Code:
-- SPARQL queries (src/sparql)
-- make test/make release
-- UMLS updating

OMIM susceptibility import

File names:
-- I would keep the file names OMIM

@allenbaron allenbaron added this to the Mar 2024 release milestone Feb 9, 2024
@allenbaron
Copy link
Collaborator Author

allenbaron commented Feb 9, 2024

-- I would keep the file names OMIM

I agree that it's probably best to change only the prefixes.

-- make test/make release

Nothing needs to be done for the DO build files/make rules since we aren't changing file names.

OMIM susceptibility import

This is one I'd forgotten about and it probably presents the biggest challenge because both the IDs and URIs are based on "OMIM" (e.g. OMIM:106300, http://purl.obolibrary.org/obo/OMIM_106300). It seems like we should change the URIs along with the IDs. I guess that's one more place that users may need to update but maybe it's not as big a deal since the URIs are not currently resolvable anyway? @lschriml @sbello thoughts?

On the DO website
-- Advance Search
-- SPARQL sandbox

Neither of these is dependent on prefixes, so we shouldn't need any updates for them. I'll only have to check the example SPARQL query examples we provide with the sandbox.

I decided to identify the use of "OMIM" for all non-OWL/OBO/JSON files in this repo (excluding files in the build, deprecated, and hidden directories). The files identified using this command are listed below along with the ontology edit/import files that need review/modification.

find . ! -name "*.owl" ! -name "*.obo" ! -name "*.json*" ! -path "*build*" ! -path "*deprecated*" ! -path "./.*" -type f -print0 | 
xargs -0 grep -c "OMIM" | 
grep -E "[1-9]+$"

Files in this repo:

  • src/sparql/DOreports/OMIMinDO.rq
  • src/ontology/imports/omim_susceptibility.owl
  • src/ontology/doid-edit.owl
    • DOreports/allXREFinDO.tsv - will be automatically fixed when updating doid-edit.owl file
    • DOreports/OMIMinDO.tsv - will be automatically fixed when updating doid-edit.owl file
  • src/ontology/README_DO_Files - only mentions that DO has OMIM xrefs, no need to update
  • CONTRIBUTING.md - only mentions that DO has OMIM xrefs, no need to update
  • RELEASES.md - OMIM in prior Github release notes, no need to update

Files outside this repo

  • disease-ontology.org OWL/OBO tree loader
  • UMLS updater (this one we can probably do as part of planned improvements)

allenbaron added a commit that referenced this issue Feb 9, 2024
allenbaron added a commit to DiseaseOntology/SPARQLqueries that referenced this issue Feb 9, 2024
In preparation for change in DO outlined in
DiseaseOntology/HumanDiseaseOntology#1301.
@fschiettecatte
Copy link

fschiettecatte commented Feb 12, 2024

@fschiettecatte does capitalization matter? Any preference for MIM vs mim?

Apologies for the slow reply, I was out through today.

We want upper case for 'MIM', see our linking help sections 1.8 and 1.9.

@lschriml
Copy link
Contributor

lschriml commented Feb 12, 2024 via email

@allenbaron
Copy link
Collaborator Author

We've decided to push this change back one additional month to the April release to provide more time for the many resources using DO to prepare for this change.

@allenbaron
Copy link
Collaborator Author

A test release, equivalent to official release v2024-02-28, with "OMIM:" prefixes replaced as "MIM:" has been created on the mim_test_release branch.

Test files are available at https://github.com/DiseaseOntology/HumanDiseaseOntology/tree/mim_test_release/src/ontology

Please distribute this information as widely as possible so that all DO users are prepared for the April release in which this change will be implemented. Please reach out if you need any assistance.

allenbaron added a commit to DiseaseOntology/DO.utils that referenced this issue May 28, 2024
plot_xref_counts() and get_url() will now be able to use
either "OMIM:" or "MIM:" input and still provide the expected
output.

inventory_omim() and generify_obo() will now forcibly convert
"OMIM:" to the preferred "MIM:" prefix in their output.

Changes prompted by DiseaseOntology/HumanDiseaseOntology#1301.
allenbaron added a commit to DiseaseOntology/DO.utils that referenced this issue May 28, 2024
Will ensure compatibility with future inventories of OMIM in the
DO, now that DO has moved to OMIM's preferred prefix.

See DiseaseOntology/HumanDiseaseOntology#1301.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment