Example of Snakemake Workflow Implementation for Transcriptomics Tasks

Github Pipeline:

Build Status

Implemented Tasks:

Performed FastQC
Performed MultiQC
Trimmed Barcodes
Performed FastQC on Trimmed Sequences
Performed MultiQC on Trimmed Sequences

The biggest indicator of successful removal of adapters is the Adapter Content plot:

Before Trimming	After Trimming

Used STAR Aligner
Indexed BAM with Samtools Index, and FeatureCounts

Here we specify strandness:
0: Not stranded
1: Stranded
2: Reversely stranded

For Collibri, use 1 as it is stranded.
For KAPA, use 2 as it is reversely stranded.

Used DESeq2 to Perform Differential Expression (DE) Analysis Comparing UHRR vs HBR

As a result, obtained DE genes with p-adjusted values:

For Collibri:

And KAPA:

Also, volcano plots:

For Collibri:

And KAPA:

It has a Snakemake file, which will create the plot.

Performed PCA Using DE Genes

Plot for Collibri:

And KAPA:

It is clear that HBR and UHRR samples are separated into two distinct clusters.

More details and figures in separate deseq2.md.

Performed GSEA Using fgsea

We should use shrunken fold changes.

Performing on Reactome pathways did not give promising results, so performed on KEGG pathways.

Collibri showed slightly better results (with smaller p-adjusted values) than KAPA but found similar pathways. Some pathways were related to cancer:

KEGG_BASAL_CELL_CARCINOMA
KEGG_COLORECTAL_CANCER
KEGG_ENDOMETRIAL_CANCER
KEGG_WNT_SIGNALING_PATHWAY
KEGG_PATHWAYS_IN_CANCER
KEGG_REGULATION_OF_ACTIN_CYTOSKELETON

For Collibri:

For KAPA:

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
data		data
deseq2_files/figure-gfm		deseq2_files/figure-gfm
gsea		gsea
pathways		pathways
qc_plots		qc_plots
.gitignore		.gitignore
CITATION.cff		CITATION.cff
README.md		README.md
Snakefile		Snakefile
adapters.fa		adapters.fa
condition_treated_results.csv		condition_treated_results.csv
condition_treated_results_kapa.csv		condition_treated_results_kapa.csv
config.yaml		config.yaml
dag.svg		dag.svg
deseq.R		deseq.R
deseq2.Rmd		deseq2.Rmd
deseq2.md		deseq2.md
environment.yml		environment.yml
gsea.R		gsea.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example of Snakemake Workflow Implementation for Transcriptomics Tasks

Github Pipeline:

Implemented Tasks:

DAG of Snakemake Workflow:

About

Releases 1

Packages

Languages

jarekrzdbk/snakemake-example

Folders and files

Latest commit

History

Repository files navigation

Example of Snakemake Workflow Implementation for Transcriptomics Tasks

Github Pipeline:

Implemented Tasks:

DAG of Snakemake Workflow:

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages