cardioqtl

Installation

I used conda and bioconda to manage software dependencies. To replicate the computing environment, you will need to complete the following 4 steps. Note that this is only guaranteed to work on a Linux-64 based architecture.

Download and install Miniconda (instructions)
Download environment file. If you cloned the Git repo, you already have the environment file.
- conda install anaconda-client; anaconda download jdblischak/cardioqtl
Create conda environment:
- conda env create -n cardioqtl --file environment.yaml
Activate conda environment:
- To activate: source activate cardioqtl
- To deactivate: source deactivate cardioqtl

Code

The code is freely available for reuse with attribution via the MIT license.

Snakefile - Implements the analysis pipeline
submit-snakemake.sh - Submits individual jobs produced in Snakefile to Slurm. If your cluster uses a different job scheduler, you'll need to edit this file and cluster.json.
scripts/ - R scripts called by Snakefile
scratch/ - Exploratory analyses written in R Markdown

Data

data/counts-subread.txt - Gene counts after mapping to GRCh37 with Subjunc and summing counts per gene with featureCounts (Subread 1.5.0p3). Includes all genes in Ensembl release 75 (i.e. protein coding plus all other biotypes; see scripts/create-exons.R).
data/counts-clean.txt - Gene counts after removing samples 26302, 110232, and 160001 and removing genes with log2 cpm less than 0 (see scripts/clean-counts.R).
data/counts-normalized.txt - Gene counts after normalizing to N(0,1) within each sample follwed by normalizing to N(0,1) within each gene. Used the R function qqnorm (see scripts/normalize-counts.R).

Badges

Extracted from project README

Related Projects

anaconda-download-stats

Process and host download statistics for Anaconda packages

29 Jul 2019 3

PyReQTL

A collection of Python modules equivalent to R ReQTL Toolkit aims to identify the association bet...

24 Sep 2020 10

code_as_data

Analysis of code in R dev packages (for a planned talk)

16 Dec 2019 9

CD4-csaw

Reproducible reanalysis of a combined ChIP-Seq & RNA-Seq data set

03 May 2016 16