Skip to content

yotammarton/bioinformatics_course

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

236523 - Introduction to Bioinformatics

image

Ben Filiarsky , Yotam Martin

Abstract

Schizophrenia (SZ) and Bipolar Disorder (BD) are serious neuropsychiatric disorders, severely harming patients' life quality. In this work, we perform a GWAS on RNAseq reads obtained from postmortem brain samples, from three different brain regions. We perform differential expression analysis (DEA) and successfully recover results of the paper our work is based on (Hu et al., 2016). As a part of our analysis, we compare DEA results with no normalization by housekeeping genes (HKG), with normalization by a general HKG set and with a gene set specifically tailored for postmortem brain samples. We show that using a custom-fit gene set for such correction is beneficial and encourage future usage of specifically tailored gene sets for correction. Significantly up and down regulated genes are then enriched using KEGG enrichment, resulting in a substantial enrichment of the MAPK signaling pathway in SZ samples in the BA9 brain region, a pathway that had been previously linked to SZ. An additional weighted gene co-expression analysis (WGCNA) combined with clustering was performed, followed by functional enrichment (GO and DO) of each cluster. In all the experiments we found a cluster with a significantly enriched term that had been tightly connected to SZ and BD development in previous works. We finish our work with raising ethical concerns regarding usage of such research in future applications. Our code repository and all the data for this work as well as results and plots are available via GitHub.

Project tree:

.
├── Differential Expression Analysis                              | DEA results
│   ├── Differential Expression results brain norm                | DEA results with brain HKG normalization
│   │   ├── exp1 brain schizophrenia VS. normal BA9 .csv          | 
│   │   ├── ...
│   │   └── exp6 brain bipolar_disorder VS. normal BA24 .csv
│   ├── Differential Expression results general norm              | DEA results with general HKG normailzation
│   │   ├── exp1 general schizophrenia VS. normal BA9 .csv
│   │   ├── ...
│   │   └── exp6 general bipolar_disorder VS. normal BA24 .csv
│   ├── Differential Expression results no norm                   | Initial results with no HKG correction
│   │   ├── exp1 no_norm schizophrenia VS. normal BA9 .csv
│   │   ├── ...
│   │   └── exp6 no_norm bipolar_disorder VS. normal BA24 .csv
│   └── DifferentialExpressionPlots                               | 📊📊 Plots for DEA
│       ├── Heatmap_down                                          | pheatmap of the down regulated significant genes
│       │   ├── exp1
│       │   │   ├── brain.tiff
│       │   │   ├── general.tiff
│       │   │   └── no_norm.tiff
│       │   └── ...
│       ├── Heatmap_up                                            | pheatmap of the up regulated significant genes
│       ├── Kmeans_tSNE                                           | Kmeans (k=2) results 
│       ├── QQ                                                    | QQ-plots of the adjusted pvalues
│       └── Volcano                                               | Volcano plots with annotation of significant genes
├── Differential Expression Analysis.Rmd                          | 💻💻 Rmd Code to reproduce DEA
├── E-GEOD-78936-atlasExperimentSummary.Rdata                     | 💿💿 Original data
├── E-GEOD-78936-query-results.tsv                                | 💿💿 Original data
├── E-GEOD-78936-raw-counts.tsv                                   | 💿💿 Original data
├── Pathway Enrichment Analysis                                   | Results of pathfindR enrichment results
│   └── pathfindR
│       ├── brain_norm
│       │   └── pathfindR_Results_exp_1_brain
│       ├── general_norm
│       │   └── pathfindR_Results_exp_1_general
│       └── no_norm
│           ├── pathfindR_Results_exp_1
│           ├── pathfindR_Results_exp_1_3_5
│           ├── pathfindR_Results_exp_3
│           └── pathfindR_Results_exp_5
├── Pathway Enrichment Analysis.Rmd                               | 💻💻 Rmd Code to reproduce enrichment analysis
├── Weighted Gene Co-Expression Analysis                          | 🌐🌐 Results for WGCNA 
│   ├── Nets
│   └── Plots                                                     | 📊📊 Plots for GO/DO enrichment
│       ├── DO
│       ├── GO
├── Weighted Gene Co-Expression Analysis.Rmd                      | 💻💻 Rmd Code to reproduce WGCNA
├── compare_results_to_paper.py                                   | 🐍 Comparing results to Hu et al. (2016) 
├── find_lncRNA.py                                                | 🐍 Arrange found genes table in DEA
├── genes_transcript_type_ncbi_kegg.csv                           | 🐍 Arrange found genes table in DEA
├── genes_transcript_type_ncbi_kegg_unique.csv                    | 🐍 Arrange found genes table in DEA
├── housekeeping_brain.csv                                        | Brain HKG list
├── housekeeping_general.csv                                      | General HKG list
├── general and brain keeping genes processing.py                 | 🐍 Organize HKG lists
├── significant_genes_by_experiment_brain.csv                     | Detailed table of significant genes (Exp1-6, brain HKG normalization)
├── significant_genes_by_experiment_general.csv                   | Detailed table of significant genes (Exp1-6, general HKG normalization)
├── significant_genes_by_experiment_no_norm.csv                   | Detailed table of significant genes (Exp1-6, no HKG normalization)
├── significant_genes_exp_1_to_6.py                               | 🐍 Find the significant genes from Exp1-6
├── up_down_reg_genes_per_experiment.csv                          | Table of significant genes (Exp1-6, no HKG normalization)
├── up_down_reg_genes_per_experiment_brain.csv                    | Table of significant genes (Exp1-6, brain HKG normalization)
└── up_down_reg_genes_per_experiment_general.csv                  | Table of significant genes (Exp1-6, general HKG normalization)

Releases

No releases published

Packages

No packages published