btv_sequencing

This repository contains scripts used to generate consensus sequences for newly-sequenced bluetongue virus (BTV) isolates.

The entry point to the pipeline is the run_mapping_pipeline_one_sample bash script, which runs a series of commands to generate consensus sequences for newly-sequenced bluetongue virus (BTV) isolates.

It assumes as input Illumina paired end data in 2 files named like this:

<isolate_name>_R1.fastq
<isolate_name>_R2.fastq

You pass in the <isolate_name> as a command line argument to the script

It works by mapping trimmed reads to a set of existing BTV reference sequences downloaded from Genbank using bowtie2.

It then uses the identify_best_segments_from_sam script to identify the best matching existing reference sequence, which is defined as the reference sequence with the most reads aligned to it.
It does this for all 10 segments.

It then uses a series of samtools and bcftools commands to create new consensus sequences for each of the 10 segments and finally remaps reads to these new draft consensus sequences.

Dependencies:

bowtie2
samtools
bcftools
the other scripts in this repository
An existing set of btv reference sequences and associated bowtie2 index (also in this repo.). Note that the fasta sequence names are pre-pended with their corresponding segment number E.g.: >s1_AY493686 is a segment 1 reference sequence with NCBI accession AY493686

Mark Stenglein, Dec, 2015

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
btv_index.1.bt2		btv_index.1.bt2
btv_index.2.bt2		btv_index.2.bt2
btv_index.3.bt2		btv_index.3.bt2
btv_index.4.bt2		btv_index.4.bt2
btv_index.rev.1.bt2		btv_index.rev.1.bt2
btv_index.rev.2.bt2		btv_index.rev.2.bt2
btv_refseq.fasta		btv_refseq.fasta
concat_fasta_records		concat_fasta_records
create_mask_file		create_mask_file
fastq_to_fasta		fastq_to_fasta
identify_best_segments_from_sam		identify_best_segments_from_sam
reconcile_fastq_to_fasta		reconcile_fastq_to_fasta
remove_trailing_fasta_Ns		remove_trailing_fasta_Ns
run_bt_align_paired		run_bt_align_paired
run_mapping_pipeline_one_sample		run_mapping_pipeline_one_sample
run_preprocessing_pipeline_one_sample		run_preprocessing_pipeline_one_sample

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

btv_sequencing

About

Releases

Packages

Languages

stenglein-lab/btv_sequencing

Folders and files

Latest commit

History

Repository files navigation

btv_sequencing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages