-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADMEWQMCPIPELINE
36 lines (23 loc) · 2.14 KB
/
READMEWQMCPIPELINE
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
Most summary methods use files of gene trees to produce a species trees. The script wQMCquartetsfromgenetreefile.sh
takes a file containing a list of newick gene trees as input,
and then makes a file that contains the output quartets formatted for input to wQMC.
The script wQMCquartetsfromgenetreefile.sh depends on scripts automatically included in the directory created by the
git clone or tarball from this github: moving files will create errors.
wQMCquartetsfromgenetreefile.sh takes one command line argument: the file path to the file containinga list of gene trees.
It outputs a file in the same directory containing the gene tree file called *.WQMCquartets where * represents the name of your
gene tree file. This file contains a list ofweighted quartets where the weight represents the sum,
over the set of gene trees in your file of the number of times each quartet appears in the efficientquartets
representation of each gene tree.
Example usage (from UIUC Taub campus cluster): if you have 50 gene tree files and you want to have an
efficient wQMC-input summary of each file, the loop
for ind in `seq 1`; do ./wQMCquartetsfromgenetreefile.sh home/redavid2/phylogenetics/DATA/ASTRAL2.50Taxa/model.50.2000000.0.000001.true/$ind/truegenetrees
will write a file truegenetrees.WQMCquartets in the same directory as truegenetrees
Note this example is using a data set available in the supporting materials for the paper S. Mirarab and T. Warnow:
"ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes" available at
http://bioinformatics.oxfordjournals.org/content/31/12/i44
File paths will have to be updated for your data set and note wQMC will only run on a linux operating system.
For the scripts to work on your machine, you must have dendropy installed, available at http://dendropy.org/
Dendropy also has a google users group and github with the most up-to-date information.
These scripts have been most recently tested on a Scientific Linux 6.1 (Linux 2.6.32) operating system. This is the Taub system.
Please contact [email protected] for bug reports or questions.
Last updated 9/27/16