Skip to content

RAP paralelization orchestration engine template: Optimised driver and worker scripts for genomic analyses on UKBB cloud-based, Research Analysis Platform (RAP)

aehrc/RAPpoet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 

Repository files navigation

RAPpoet

RAP parallelisation orchestration engine template: Optimised driver and worker scripts for genomic analyses on UKBB cloud-based, Research Analysis Platform (RAP)

This repository contains templates for driver and worker script for steps 1-3 for running a genomic analysis pipeline on RAP.

Included Scripts

For each step a driver (drive_N.sh) and worker script (worker_N.sh) is included.

Driver Script: This script orchestrates the execution of the genomic analysis pipeline.

Worker Script: This script contains the necessary commands and functions for performing steps 1-3 of the pipeline.

Pipeline Steps

Quality Control Step 1: This step involves sample filtering and variant filtering, normalisation and renaming.

Quality Control Step 2: In this step involves chunking vcfs, standard filering (geno, maf, hwe), and generating plink format files.

Merging Files and Logistic Regression with PLINK2: This step involves merging the QC filered files into one file and running a plink2 Logistic regression analysis.

About

RAP paralelization orchestration engine template: Optimised driver and worker scripts for genomic analyses on UKBB cloud-based, Research Analysis Platform (RAP)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages