forked from jmw86069/splicejam
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.Rmd
148 lines (118 loc) · 5.14 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
---
output: github_document
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r knitr_init, echo=FALSE}
knitr::opts_chunk$set(
collapse=TRUE,
warning=FALSE,
message=FALSE,
comment="#>",
fig.path="man/figures/README-"
);
```
# splicejam <img src="man/figures/splicejam_logo.png" width="133px" height="154px" align="right" style="padding-left:10px;background-color:white;" />
Splicejam was created to analyze and visualize RNA-seq
and transcript isoform splicing data.
Splicejam aims to provide sashimi plots with enough
customizations to support publication-quality figures.
An example sashimi plot is shown below for the gene *Gria1*.
Each panel shows transcript expression in a region of mouse
hippocampus, where "CA1_CB" shows data that originated from
cell bodies of CA1, and "CA2_CB" shows corresponding data
from cell bodies of CA2.
<br>
```{r, gria1_sashimi, echo=FALSE, fig.height=8, fig.width=14}
if (suppressPackageStartupMessages(require(splicejam))) {
data(demodata);
## Prepare sashimi data
sashimi <- prepareSashimi(gene="Gria1",
flatExonsByGene=demodata$exons,
filesDF=demodata$filesDF,
tx2geneDF=demodata$tx2geneDF,
minJunctionScore=500,
colorSub=demodata$colorSub,
addGaps=FALSE);
## Create the sashimi plot
plotSashimi(sashimi) +
xlab("chr11")
}
```
<figure>
<img src="man/figures/mouse_hippocampus.png" width="25%" height="25%" align="right" style="padding-left:10px;background-color:white;" />
<figcaption>
A depiction of the CA1 and CA2 regions of mouse hippocampus
is shown in Fig 4a from
<a href="https://dx.doi.org/10.1038/nrn.2015.22">Dudek et al. Nature Reviews Neuroscience(2016)</a>
</figcaption>
</figure>
<br>
## Features of a splicejam sashimi plot:
* The dark color polygons represent RNA transcript
sequence coverage for exons in the Gria1 gene.
* Wide ribbons are drawn to indicate splice junctions,
areas where the sequences are aligned with a wide gap,
typically spanning two exons.
* The thickness of the ribbon (in height) represents
the number of aligned reads that span the gap,
usually about 70-80% the height of the adjacent exons.
* The ribbons are shaded light to dark based upon how
predominant the splice junction is from exon to exon.
The darkest ribbons represent the most predominant path
from exon to exon through the gene.
* Introns are drawn with a fixed width (along the x-axis)
and do not use genome coordinates. Introns are
normally about 100 times wider than exons, and when drawn
to scale obscure the detailed coverage of the exons.
The x-axis labels indicate genome coordinates.
## Sashimi plot at closer range
The x-axis range is adjusted in the following plot
to show more detail around the differentially spliced exons.
Notice the relative heights of the exons differ across CA1 and CA2,
consistent with the corresponding changes in splice junctions.
Clearly CA1 and CA2 favor different and mutually exclusive
exons for Gria1.
```{r, gria1_sashimi_zoom, echo=FALSE, fig.height=6, fig.width=8}
buffer <- 25;
new_coords <- c(57289248+buffer, 57317800-buffer);
plotSashimi(sashimi,
label_coords=new_coords) +
coord_cartesian(xlim=new_coords) +
xlab("chr11")
```
Incidentally, these two isoforms of Gria1 represent the
["flop" and "flip"](https://www.sciencedirect.com/topics/neuroscience/gria1)
forms of the AMPA receptor complex. In human, the genes Gria2,
Gria3, and Gria4 comprise the AMPA receptor complex, and each
genes has two isoforms with "flop" and "flip" designations.
## Package Reference
A full online function reference is available via the pkgdown
documentation:
Full splicejam command reference:
[https://jmw86069.github.io/splicejam](https://jmw86069.github.io/splicejam)
The splicejam package is part of a suite of R packages
called "jampack" which is available through
GitHub, see [https://github.com/jmw86069/jampack](https://github.com/jmw86069/jampack)
## How to install
Install using the R package `remotes` and this command:
```
remotes::install_github("jmw86069/splicejam")
```
## About sashimi plots
Sashimi plots were originally envisioned by MISO, which introduced
the innovative idea to compress intron width in order to reveal
the detailed exon sequence coverage, and to highlight changes in
splice junctions.
Katz, Y, Wang ET, Silterra J, Schwartz S, Wong B, Thorvaldsdóttir H, Robinson JT, Mesirov JP, Airoldi EM, Burge, CB.
*Sashimi plots: Quantitative visualization of alternative isoform expression from RNA-seq data.*
http://biorxiv.org/content/early/2014/02/11/002576
Splicejam re-envisions classic sashimi plots by displaying splice junctions
as wide arcs proportional in size to the number of junction reads,
making them directly comparable to exon coverages on the same plot.
Inspired by:
* [Sankey plots](https://en.wikipedia.org/wiki/Sankey_diagram),
* [flow diagrams](https://en.wikipedia.org/wiki/Flow_diagram),
* [Alluvial diagrams](https://en.wikipedia.org/wiki/Alluvial_diagram).
These ribbons were ultimately made feasible to implement in R thanks to
the package [ggforce](https://ggforce.data-imaginist.com/) which
implemented [geom_diagonal_wide](https://ggforce.data-imaginist.com/reference/geom_diagonal_wide.html).