Skip to content

A Bioconductor R-package for cell evaluation, characterisation and authentication using high throughput sequencing data.

License

Notifications You must be signed in to change notification settings

fasterius/seqCAT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

seqCAT

Overview

Anaconda Cloud version License: MIT Build status Coverage Status

The High Throughput Sequencing Cell Authentication Toolkit (seqCAT) is an R-package for authenticating, evaluating and characterisation of cells using single nucleotide variants (SNVs) from sequencing data. Its input data should be on the form of VCF files, i.e. output from variant callers such as the Genome Analysis ToolKit and annotated with software such as SnpEff.

Installation

The seqCAT package is available on both Bioconductor and here on GitHub. You can install the latest, stable version from Bioconductor like so:

# install.packages("BiocManager")
BiocManager::install("seqCAT")

If you are interested in the development version of seqCAT, you can install it from GitHub:

# install.packages("devtools")
devtools::install_github("fasterius/seqCAT")

You may also install seqCAT using Conda:

conda install -c bioconda bioconductor-seqcat

To list the versions of seqCAT available on Conda, you can use the search functionality:

conda search -c bioconda bioconductor-seqcat

Usage

The general workflow of seqCAT consists of three steps:

1.  Creation of SNV profiles
2.  Comparisons of SNV profiles
3.  Evaluation of profile comparisons
# Load the package
library("seqCAT")

# Path to the example VCF file
vcf <- system.file("extdata", "example.vcf.gz", package = "seqCAT")

# Create SNV profiles
hct116 <- create_profile(vcf, "HCT116")
hke3 <- create_profile(vcf, "HKE3")
rko <- create_profile(vcf, "RKO")

# Compare all profiles to each other
profiles <- list(hct116, hke3, rko)
comparisons <- compare_many(profiles)

# Create an heatmap of comparisons and their similarity scores
plot_heatmap(comparisons[[1]])

Example heatmap

For more detailed instructions on how to use seqCAT, please see the vignette.

Citation

If you are using seqCAT to analyse your data, please cite the following article:

seqCAT: a Bioconductor R-package for variant analysis of high throughput sequencing data
Fasterius E. and Al-Khalili Szigyarto C.
F1000Research (2018), 7:1466
https://f1000research.com/articles/7-1466

License

The seqCAT package is released with a MIT licence and is a free software: you may redistribute and/or modify it under the terms of the license. For more information, please see the LICENCE file.

About

A Bioconductor R-package for cell evaluation, characterisation and authentication using high throughput sequencing data.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages