Metagenomic Intra-Species Diversity Analysis System (MIDAS)

MIDAS is an integrated pipeline that leverages >30,000 reference genomes to estimate bacterial species abundance and strain-level genomic variation, including gene content and SNPs, from shotgun metagnomes.

Applications

Profile bacterial species abundance: rapidly estimate the abundance of 5,952 bacterial species
Strain-level pan-genome profiling: estimate the gene content of populations based on mapping to genes from reference genomes
Single-nucleotide-polymorphism prediction: identify single-nucleotide polymorphisms (SNPs) of populations based on mapping to reference genomes
Phylogenetic inference: reconstruct the phylogeny of strains from metagenomes and reference genomes
Population genetic inference: quantify strain-level diversity, differentiation, and selection within and between metagenomes

[Dependencies, Download, Installation, Testing, and Updating] (https://github.com/snayfach/MIDAS/blob/master/docs/install.md)
[Reference database] (https://github.com/snayfach/MIDAS/blob/master/docs/ref_db.md)
[Tutorial] (https://github.com/snayfach/MIDAS/blob/master/docs/tutorial.md)
Scripts to run MIDAS on a single sample:

[Estimate species abundance] (https://github.com/snayfach/MIDAS/blob/master/docs/species.md)
[Predict pan-genome gene content] (https://github.com/snayfach/MIDAS/blob/master/docs/cnvs.md)
[Call single nucleotide polymorphisms] (https://github.com/snayfach/MIDAS/blob/master/docs/snvs.md)

Scripts to merge results across samples:

[Merge species abundance] (https://github.com/snayfach/MIDAS/blob/master/docs/merge_species.md)
[Merge gene content] (https://github.com/snayfach/MIDAS/blob/master/docs/merge_cnvs.md)
[Merge SNPs] (https://github.com/snayfach/MIDAS/blob/master/docs/merge_snvs.md)

Citation

If you use this tool, please cite: Nayfach, S. and Pollard, KS. "Population genetic analyses of metagenomes reveal extensive strain-level variation in prevalent human-associated bacteria". bioRxiv 2015.

Pipeline

**An integrated pipeline to estimate bacterial species abundance and strain-level genomic variation from shotgun metagnomes** _{**A) Metagenome species profiling.** Reads from a metagenomic sample are aligned against a database of phylogenetic marker genes and are assigned to species groups. Mapped reads are used to estimate the genome-coverage and relative abundance of 5,952 genome-clusters. **B) Metagenome pan-genome profiling.** A pan-genome database is dynamically constructed based on the subset of species that are present at high coverage (e.g. >1x) in the metagenome. Reads are mapped to the gene database using Bowtie2. Mapped reads are used to infer gene copy number and gene presence/absence. **C) Single-nucleotide variant prediction.** A representative genome database is constructed, as described in (B). Reads are globally aligned to the genome database using Bowtie2. Mapped reads are used to identify variants, predict consensus alleles, and estimate allele frequencies. **D) Merge results.** For each species, results are merged across one or more samples to generate several outputs, including: a gene presence/absence matrix, an allele frequency matrix, an approximate maximum-likelihood phylogenetic tree.}

Examples

**Comparative genomics of *Bacteroides ovatus* strains across host microbiomes** _{**A)** Presence or absence of genes in the *Bacteroides ovatus* pangenome across human faecal metagenomes. Column colors indicate whether a gene is core (blue; occurs in >95% of samples), auxiliary (red; occurs in 1-95% of samples ), or absent (green; occurs in < 1% of samples). **B)** Gene set enrichment analysis identifies functions overrepresented in the core genome, auxiliary genome, and genes that only occur in reference genomes.}

Name		Name	Last commit message	Last commit date
Latest commit History 177 Commits
bin		bin
docs		docs
images		images
midas		midas
scripts		scripts
test		test
.gitignore		.gitignore
CHANGES		CHANGES
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metagenomic Intra-Species Diversity Analysis System (MIDAS)

Applications

Table of Contents

Citation

Pipeline

Examples

About

Releases

Packages

Languages

License

palc/MIDAS

Folders and files

Latest commit

History

Repository files navigation

Metagenomic Intra-Species Diversity Analysis System (MIDAS)

Applications

Table of Contents

Citation

Pipeline

Examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages