This project aims to evaluate the interpretation of support values generated by Felsenstein's bootstrap (FBP; Felsenstein 1985), transfer bootstrap expectation (TBE; Lemoine et al. 2018) and other bootstrap methods (e.g. Minh et al. 2013). To do so we rely on simulations where the true trees are known. The main idea is to look at the correlation between branch supports inferred by each method and the probability that the branch is true. While FBP is well known to be conservative (i.e., FBP support of >=70% typically means that the branch is true with a probability of >=0.95) (Hillis and Bull 1993), none is known about TBE. Thus, we want to assess if the suggested threshold of 70% applies well to TBE or not.
To replicate the results follow these steps:
-
Run
generate_datasets.R
located inscripts/
folder (this relies on thefunctions.R
script). This generates datasets and runs IQ-TREE on them. -
Run
calculate_tbes.R
. This uses BOOSTER to calculate TBE and FBP for each dataset). -
Run
bootstrap_data.R
. This generates the output filesimulated.csv
.
-
Download the simulated alignments based on the PANDIT database from .
-
Go to
dna
folder and for each sub-folder$i
, run IQ-TREE:iqtree -s data.$i -m `cat model.$i` -b 100
-
In each sub-folder, create a symbolic links:
ln -s data.$i alignment.fasta ln -s tree.$i phylogram.phy
-
Run
calculate_tbes.R
. -
Run
bootstrap_data.R
. This generates the output filepandit.csv
.
- Run
bootstrap_plots.R
. This combines the output from PANDIT analysis (pandit.csv
) with the output from birth-death analysis (simulated.csv
) and makes the plot.