Skip to content

cutadapt removes adapter sequences from sequencing reads

License

Notifications You must be signed in to change notification settings

mkierczak/cutadapt

 
 

Repository files navigation

https://travis-ci.org/marcelm/cutadapt.svg?branch=master https://img.shields.io/pypi/v/cutadapt.svg?branch=master

Cutadapt

Cutadapt finds and removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads.

Cleaning your data in this way is often required: Reads from small-RNA sequencing contain the 3’ sequencing adapter because the read is longer than the molecule that is sequenced. Amplicon reads start with a primer sequence. Poly-A tails are useful for pulling out RNA from your sample, but often you don’t want them to be in your reads.

Cutadapt helps with these trimming tasks by finding the adapter or primer sequences in an error-tolerant way. It can also modify and filter reads in various ways. Adapter sequences can contain IUPAC wildcard characters. Also, paired-end reads and even colorspace data is supported. If you want, you can also just demultiplex your input data, without removing adapter sequences at all.

Cutadapt comes with an extensive suite of automated tests and is available under the terms of the MIT license.

If you use Cutadapt, please cite DOI:10.14806/ej.17.1.200 .

Links

About

cutadapt removes adapter sequences from sequencing reads

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.4%
  • Shell 0.6%