Skip to content
@Imageomics

Imageomics Institute

Imageomics: Bringing machine learning to life.

The Imageomics Institute GitHub organization hosts the development and distribution of a collection of open-source ML tools used to study the biological information encoded in images and videos integrated with structured biological knowledge.

What is the Imageomics Institute?

The Imageomics Institute is funded by the US National Science Foundation's Harnessing the Data Revolution (HDR) program under Award #2118240 (Imageomics: A New Frontier of Biological Information Powered by Knowledge-Guided Machine Learning). It started in Oct 2021.

You can find a full mission, vision, and abstract under the Imageomics website's About page. In short, the vision of the Institute is to "establish a new scientific field called imageomics that harnesses revolutions in data science and computing, as well as the rapidly expanding collections of biological image data, in order to accelerate biological understanding of phenotypic traits extracted from images of organisms."

History

The inception and research of the Imageomics Institute builds heavily on the "Biology-Guided Neural Networks for Discovering Phenotypic Traits" (BGNN) project, also funded by the US National Science Foundation. BGNN itself built in part on the Phenoscape project (funded by NSF multiple times), which started in 2007 and was incubated at the NSF-funded National Evolutionary Synthesis Center (NESCent).

Code repositories overview

Due to the history (see above) and highly collaborative and cross-disciplinary nature of the Institute, important software products and other code repositories are distributed over several organizations in GitHub, in addition to the ones found here. The following gives an overview and useful links.

Imageomics Institute

Institute collaborators


Disclaimer: Any opinions, findings and conclusions or recommendations expressed in the materials here are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Pinned

  1. INTR INTR Public

    This is an official implementation for [ICLR'24] INTR: Interpretable Transformer for Fine-grained Image Classification.

    Jupyter Notebook 25 2

  2. bioclip bioclip Public

    This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral].

    Python 63 1

  3. Andromeda Andromeda Public

    A website that enables users to explore high-dimensional image data

    Jupyter Notebook 2 1

  4. dashboard-prototype dashboard-prototype Public

    Prototype data dashboard for Imageomics Data

    Python 5 2

  5. pybioclip pybioclip Public

    Python package that simplifies using the BioCLIP foundation model.

    Python 4

  6. data-workshop-AH-2024 data-workshop-AH-2024 Public

    Repository for Mastering Data Management Workshop at the All Hands 2024.

    Jupyter Notebook 1 2

Repositories

Showing 10 of 46 repositories
  • distributed-downloader Public

    MPI-based distributed downloading tool for retrieving data from diverse domains.

    1 MIT 0 2 0 Updated May 22, 2024
  • dashboard-prototype Public

    Prototype data dashboard for Imageomics Data

    Python 5 MIT 2 8 (1 issue needs help) 1 Updated May 22, 2024
  • sum-buddy Public

    Package to generate CSV with filepath, filename, checksum for all contents of given directory.

    Python 1 MIT 0 5 0 Updated May 21, 2024
  • char-sim Public

    Pipeline to create model for comparing character state descriptions including ontology similarity

    Python 0 MIT 1 0 0 Updated May 21, 2024
  • LatLonCover Public

    Land usage descriptions for neighborhoods around given lat/long

    Jupyter Notebook 2 MIT 0 4 1 Updated May 21, 2024
  • snakemake-workshop Public

    Lesson on creating a Snakemake Workflow with an Imageomics emphasis

    0 0 4 0 Updated May 20, 2024
  • bioclip Public

    This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral].

    Python 63 1 2 0 Updated May 17, 2024
  • parquet-cli Public

    An Apptainer image for more convenient use of Parquet files at the command line.

    0 MIT 0 2 0 Updated May 16, 2024
  • pybioclip Public

    Python package that simplifies using the BioCLIP foundation model.

    Python 4 MIT 0 4 0 Updated May 16, 2024
  • Andromeda Public

    A website that enables users to explore high-dimensional image data

    Jupyter Notebook 2 MIT 1 7 1 Updated May 10, 2024