ssl-for-echocardiograms

Code for the MLHC 2021 paper: A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from Echocardiograms

Demo

Setup

Download dataset

Please visit our website https://tmed.cs.tufts.edu/index.html and download the data

Install Anaconda

Follow the instructions here: https://conda.io/projects/conda/en/latest/user-guide/install/index.html

Create environment

conda env create -f environment.yml

Process the dataset

Folder Data_Processing provides script to process the raw png files into tfrecords, which can then be loaded to train the model. For example, if you want to create the tfrecord according to our suggested split1 on TMED-18-18

bash process_TMED_18_18_fold0.sh

Running experiments

The commands for reproducing major tables in the paper are provided in runs

Image-level predictions

For example, if you want to run a fully supervised baseline on our suggested split1 on the TMED-18-18

bash launch_fs_fold0.sh run_here

Patient-level predictions

We have provided the saved image level predictions in folder predictions, so that the patient level prediction code can run smoothly to reproduce our results in the paper.

For example, if you want to run the patient level prediction for MixMatch on our suggested split1 on TMED-156-52, go to runs/table7

bash launch_MixMatch_fold0.sh run_here

In practice, the user can save their own image level predictions to the folder and run the script instead.

Analysing results

The results including the training curves, validation and test balanced accuracy with and without ensemble etc, will be automatically saved under the results_analysis folder (created during training) under your specified train_dir in the bash script.

A note on reproducibility

While the focus of our paper is reproducibility, ultimately exact comparison to the results in our paper will be conflated by subtle differences such as the version of TensorFlow used, random seeds, etc.

Citing this work

@inproceedings{huangSemisupervisedEchocardiogramBenchmark2021, title = {A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from Echocardiograms}, booktitle = {Machine Learning for Healthcare}, author = {Huang, Zhe and Long, Gary and Wessler, Benjamin and Hughes, Michael C.}, year = {2021}, }

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Data_Processing		Data_Processing
models/TMED-156-52		models/TMED-156-52
predictions/ImageLevel_predictions		predictions/ImageLevel_predictions
runs		runs
sample		sample
split_info/TMED-156-52		split_info/TMED-156-52
src		src
.gitignore		.gitignore
LICENSE		LICENSE
LoadCheckpoint_Demo.ipynb		LoadCheckpoint_Demo.ipynb
LoadData_Visualize.ipynb		LoadData_Visualize.ipynb
README.md		README.md
environment.yml		environment.yml

License

tufts-ml/ssl-for-echocardiograms

Folders and files

Latest commit

History

Repository files navigation