LCV (Latent Causal Variable model)

LCV is a method for inferring genetically causal relationships using GWAS data.

LCV is implemented in Matlab and R. In order to run LCV, you will need LD scores (non-stratified, with ancestry matching your GWAS data), which can be downloaded here. You can also compute your own LD scores using the LDSC software. You will also need signed summary statistics: either effect size estimates (in units of per-normalized-genotype effect size) or Z scores.

Usage of each function is described within the source code. There are example simulation scripts in Matlab and R, and an example script to run on real data in R.

Details and potential issues

The summary statistics and LD scores must be sorted by genomic position, as LCV uses a block-jackknife procedure to compute standard errors; if consecutive SNPs are not approximately contiguous, standard errors will be underestimated.
Your datasets should have approximately the same ancestry as each other, and with the LD scores. For example, it would be fine to use one UK Biobank dataset and one dataset which is a European meta-analysis, but don't try to use a European dataset with an East Asian one.
We recommend using SNPs with allele frequency greater than 0.05; adding additional SNPs will probably cause decreased power unless you assign them lower regression weights.
We recommend removing the MHC region in all analyses.

Changes in most recent update

A bug in the R implementation (specifically, the WeightedRegression function) was fixed. Previously this implementation would give incorrect estimates whenever the weights vector is not uniform.
The sign of the Z scores no longer depend on the sign of the genetic correlation. If you run RunLCV(LDscores,Z.1,Z.2), you will now get the same Z score (but opposite genetic correlation) as if you run RunLCV(LDscores,Z.1,-Z.2).
Error handling and warnings are now the same for both implementations.
The Matlab implementation now outputs a single data structure rather than a long list of output arguments. The R data structure output was also modified.
There is a new R example script - thank you to Katie Siewert for supplying it.
Removed run_LCV_parallel.m because it only runs a few times faster.

Reference:

O'Connor, L.J. and A.L. Price. "Distinguishing genetic correlation from causation across 52 diseases and complex traits." Nature genetics (2018).

Non-paywalled link: https://rdcu.be/bajzC

Contact: loconnor@broadinstitute.org

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Matlab		Matlab
R		R
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Matlab

Matlab

R

R

README.md

README.md

Repository files navigation

LCV (Latent Causal Variable model)

Contents:

Details and potential issues

Changes in most recent update

About

Releases

Packages

Languages

lukejoconnor/LCV

Folders and files

Latest commit

History

Repository files navigation

LCV (Latent Causal Variable model)

Contents:

Details and potential issues

Changes in most recent update

About

Resources

Stars

Watchers

Forks

Languages