Skip to content

stuchalk/fair-health-data

Repository files navigation

FAIR Health Data

Personal health data for Stuart J. Chalk made available using the FAIR principles.

What is this data?

As an example of creating FAIR (Findable, Accessible, Interoperable, Reusable) data for a seminar I presented on April 21st, 2021 (included PowerPoint file), I decided to collect and publish my health data, starting January 1, 2021. The data consists of:

  • body weight
  • body temperature (with room temperature as a 'control')
  • resting heart rate
  • data from my daily runs including
    • VO2 max
    • calories burned
    • distance
    • average pace
    • a link to the activity (only one is open access due to privacy concerns)

The data is collected in an MS Excel spreadsheet exported to a .csv text file and then processed through a Python script run in a Juypter notebook.

The generated JSON-LD (JSON for Linked Data) files are uploaded every month, and a new tagged version of the data is generated. Using Zenodo, each release of this repository gets an automatically assigned Digital Object Identifier (DOI).

The .csv data file, Juypter notebook, and python scripts are available as part of the dataset. Finally, a list of FAIR resources are available here.

For Findability

Each dataset has a DOI assigned through Zenodo:

  • January 2021 dataset DOI
  • January-Feburary 2021 dataset DOI
  • January-March 2021 dataset DOI

For Accessibility

The data is made available for free via this GitHub repository

For Interoperability

The data is made available in JSON-LD format, a text-based human-readable encoding of RDF, easily ingested via any scripting language.

For Reusabililty

The data is made available under the Creative Commons CC-BY-NC-SA 4.0 International License.