PARSEC (imPutAtion for spaRSE sequenCing) is a bioinformatics pipeline designed to genotype large populations using low coverage sequencing data.
It relies on bcftools mpileup
to detect SNP sites and stitch
to impute genotypes.
The pipeline is still in early development
- Index bams (
SAMtools
) - Prepare fixed size genomic chunks (
bedtools
) - Optionnal : call variants from sparse data
- Impute genotypes (
stitch
- Index vcf (
Tabix
) - Concatenate vcf files (
bcftools
) - Sort vcf (
bcftools
)
Note If you are new to Nextflow and nf-core, please refer to this page on how to set-up Nextflow. Make sure to test your setup with
-profile test
before running the workflow on actual data.
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md
file.
This pipeline uses code and infrastructure developed and maintained by the nf-core community, reused here under the MIT license.
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.