To use all functions of this page, please activate cookies in your browser.
my.bionity.com
With an accout for my.bionity.com you can always see everything at a glance – and you can configure your own website and individual newsletter.
- My watch list
- My saved searches
- My saved topics
- My newsletter
Bayesian inference in phylogenyBayesian inference in phylogeny generates a posterior distribution for a parameter, composed of a phylogenetic tree and a model of evolution, based on the prior for that parameter and the likelihood of the data, generated by a multiple alignment. The Bayesian approach has become more popular due to advances in computational machinery, especially, Markov Chain Monte Carlo algorithms. Bayesian inference has a number of applications in molecular phylogenetics, for example, estimation of species phylogeny and species divergence times. Additional recommended knowledge
Basic Bayesian TheoryRecall that for Bayesian inference:
The denominator
where In the original Metropolis algorithm, given a current The LOCAL algorithm of Larget and SimonThe LOCAL algorithm begins by selecting an internal branch of the tree at random. The nodes at the ends of this branch are each connected to two other branches. One of each pair is chosen at random. Imagine taking these three selected edges and stringing them like a clothesline from left to right, where the direction (left/right) is also selected at random. The two endpoints of the first branch selected will have a sub-tree hanging like a piece of clothing strung to the line. The algorithm proceeds by multiplying the three selected branches by a common random amount, akin to stretching or shrinking the clothesline. Finally the leftmost of the two hanging sub-trees is disconnected and reattached to the clothesline at a location selected uniformly at random. This is the candidate tree. Suppose we began by selecting the internal branch with length $$ Assessing ConvergenceSuppose we want to estimate a branch length of a 2-taxon tree under JC, in which n1 sites are unvaried and n2 are variable. Assume exponential prior distribution with rate for unvaried sites, and Thus the unnormalized posterior distribution is: or, alternately, Update branch length by choosing new value uniformly at random from a window of half-width where Example: Metropolis-coupled MCMC (Geyer)If the target distribution has multiple peaks, separated by low valleys, the Markov chain may have difficulty in moving from one peak to another. As a result, the chain may get stuck on one peak and the resulting samples will not approximate the posterior density correctly. This is a serious practical concern for phylogeny reconstruction, as multiple local peaks are known to exist in the tree space during heuristic tree search under maximum parsimony (MP), maximum likelihood (ML), and minimum evolution (ME) criteria, and the same can be expected for stochastic tree search using MCMC. Many strategies have been proposed to improve mixing of Markov chains in presence of multiple local peaks in the posterior density. One of the most successful algorithms is the Metropolis-coupled MCMC (or In this algorithm, $m$ chains are run in parallel, with different stationary distributions so that the first chain is the cold chain with the correct target density, while chains At the end of the run, output from only the cold chain is used, while those from the hot chains are discarded. Heuristically, the hot chains will visit the local peaks rather easily, and swapping states between chains will let the cold chain occasionally jump valleys, leading to better mixing. However, if An obvious disadvantage of the algorithm is that
References
Categories: Bioinformatics | Phylogenetics | Computational phylogenetics |
|||||||||||||
This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "Bayesian_inference_in_phylogeny". A list of authors is available in Wikipedia. |