Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

**DON'T MERGE -- ARCHIVE ** Build overhaul v4.0.0 #122

Draft
wants to merge 115 commits into
base: master
Choose a base branch
from

Conversation

callahantiff
Copy link
Owner

@callahantiff callahantiff commented Dec 8, 2021


🛑 DO NOT USE THIS BRANCH OR MODIFY THIS PR -- CONTENT IS KEPT FOR NOTES 🛑


Purpose

This PR addresses several issues and overhauls many aspects of the current build, which is described in more detail below. The primary changes made impact the amount, type, and storage of metadata at both the node- and triple-level.

Issues Addressed by PR

Scripts Impacted

  • owlnets.py.
    • Updated to fix the prior bad assumption about classes and axioms built using UnionOf constructors
  • metadata.py
    • Get new functionality for processing Biolink types
  • edge_list.py
    • Get new functionality for adding Bioregistry identifiers
  • utils/data_utils.py

Data Sources/Documentation Impacted

  • edge_source_list.txt
    • Added back chemical-rna edge data
  • resource_info.txt
    • Updated metadata for many of the edges, most often in an effort to soften the initial formatting that was applied to the data (i.e., having a more liberal and inclusive build, but providing the user with the ability to enforce specific filtering choices)
  • added back information for the chemical-rna edge

Notebooks Impacted

  • OWLNETS_Example_Application.ipynb
  • Data_Preparation.ipynb

Output Impacted

  • All output files will be g-zipped in order to improve resource use

Other Updates

  • The following Wiki pages have been udated:
    • v2-Data-Sources
      • Updated to included better descriptions
    • KG Construction
      • Section describing the KG output has been updated to note that all output are g-zipped
    • OWL-NETS 2.0
      • Section describing the KG output has been updated to note that all output are g-zipped

@sonarcloud
Copy link

sonarcloud bot commented Apr 4, 2022

Kudos, SonarCloud Quality Gate passed!    Quality Gate passed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 3 Code Smells

95.7% 95.7% Coverage
0.0% 0.0% Duplication

@callahantiff callahantiff changed the title Build overhaul v4.0.0 **DON'T MERGE -- ARCHIVE ** Build overhaul v4.0.0 Mar 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
coding enhancement New feature or request release v4.0.0 noting work and issues related to release v4.0.0
Projects
Knowledge Graph tasks
  
Awaiting triage
1 participant