Skip to content

cns-iu/ccf-research-kaggle-2021

Repository files navigation

Segmentation of human functional tissue units at scale

The Human BioMolecular Atlas Program aims to compile a reference atlas for the healthy human adult body at the cellular level. Functional tissue units (FTU, e.g., renal glomeruli and colonic crypts) are of pathobiological significance and relevant for modeling and understanding disease progression. Yet, annotation of FTUs is time consuming and expensive when done manually and existing algorithms achieve low accuracy and do not generalize well. This paper compares the five winning algorithms from the “Hacking the Kidney” Kaggle competition to which more than a thousand teams from sixty countries contributed. We compare the accuracy and performance of the algorithms on a large-scale renal glomerulus Periodic acid-Schiff stain dataset and their generalizability to a colonic crypts hematoxylin and eosin stain dataset. Results help to characterize how the number of FTUs per unit area differs in relationship to their position in kidney and colon with respect to age, sex, BMI, and other clinical data and are relevant for advancing pathology, anatomy, and surgery.

The repo is structured in the following way:

├── models
│   ├── 1-Tom
│   └── 2-Gleb
│   └── 3-Whats goin on
│   └── 4-Deeplive.exe
│   └── 5-Deepflash2
├── supporting-information
├── thumbnails
├── utils

Data

All data (Images, ground truth masks, and predictions) are available as a Zenodo Dataset and can be downloaded from https://doi.org/10.5281/zenodo.7729609.

The trained models are available as a Zenodo Dataset and can be downloaded from https://doi.org/10.5281/zenodo.7730027.

The HuBMAP kidney data (30 WSIs) is also available as a HuBMAP collection.

In addition to this Guthub repo, a version of code (at the time of publication) is also available on Zenodo at https://doi.org/10.5281/zenodo.7730067.

Models

The repository contains 5 models:

  1. Tom (1st prize)
  2. Gleb (2nd prize)
  3. Whats goin on (3rd prize)
  4. Deeplive.exe (1st Judges prize)
  5. Deepflash2 (2nd Judges prize)

Visualization

The ground truth and predictions from the five winning models are visualized using Vitessce for 10 PAS-stained kidney tissue images from the test set at the HuBMAP Data Portal's Publication Page.