- Trim fastq files - Trimmomatic
- Reference based mapping - Bowtie2
- Add or replace read groups in bam file - Picard
- This tool enables the user to replace all read groups in the INPUT file with a single new read group and assign all reads to this read group in the OUTPUT BAM file
- Mark duplicates - Picard
- Identifies duplicate reads
- This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA
- Bam file must be indexed for the following GATK command
- Merge bam files - samtools merge
- this command is necessary to run Freebayes
- Freebayes requires one bam file, but it can be merged
- Run Freebayes to call variants on bam files - Freebayes
- Filter SNPs and Indels from VCF file output from Freebayes
- gatk SelectVariants
- gatk Variant Filtration
- Recalibrate base quality score - gatk BaseRecalibrator
- Apply BQSR reports into the bam files - gatk ApplyBQSR
- Merge recalibrated bam files - samtools merge
- Run Freebayes to call variants on recalibrated bam files
- Filter vcf file from freebayes for SNPs only - vcffilter
- Compressing and indexing the snp only vcf & variant normalization of the snp only vcf & decompressing the vcf
- compress vcf - bgzip
- index vcf - tabix
- normalize snp vcf - bcftools norm
- decompress the normalized vcf - bgzip
- Filter normalized vcf file and create SNP alignment fasta file - used custom made scripts
- vcflib_pipeline_HPC2.sh script
- uses vcf_fa_extractor.py to filter and make fasta file from vcf
-
Notifications
You must be signed in to change notification settings - Fork 0
taylorpaisie/variant_calling_pipeline
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
My variant calling pipeline for bacteria
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published