Mid Sweden University

miun.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Likelihood analysis of phylogenetic networks using directed graphical models
Mid Sweden University, Faculty of Science, Technology and Media, Department of Engineering, Physics and Mathematics.
2000 (English)In: Molecular biology and evolution, ISSN 0737-4038, E-ISSN 1537-1719, Vol. 17, no 6, p. 875-881Article in journal (Refereed) Published
Abstract [en]

A method for computing the likelihood of a set of sequences assuming a phylogenetic network as an evolutionary hypothesis is presented. The approach applies directed graphical models to sequence evolution on networks and is a natural generalization of earlier work by Felsenstein on evolutionary trees, including it as a special case. The likelihood computation involves several steps. First, the phylogenetic network is rooted to form a directed acyclic graph (DAG). Then, applying standard models for nucleotide/amino acid substitution, the DAG is converted into a Bayesian network from which the joint probability distribution involving all nodes of the network can be directly read. The joint probability is explicitly dependent on branch lengths and on recombination parameters (prior probability of a parent sequence). The likelihood of the data assuming no knowledge of hidden nodes is obtained by marginalization, i.e., by summing over all combinations of unknown states. As the number of terms increases exponentially with the number of hidden nodes, a Markov chain Monte Carlo procedure (Gibbs sampling) is used to accurately approximate the likelihood by summing over the most important states only. Investigating a human T-cell lymphotropic virus (HTLV) data set and optimizing both branch lengths and recombination parameters, we find that the likelihood of a corresponding phylogenetic network outperforms a set of competing evolutionary trees. In general, except for the case of a tree, the likelihood of a network will be dependent on the choice of the root, even if a reversible model of substitution is applied. Thus, the method also provides a way in which to root a phylogenetic network by choosing a node that produces a most likely network.

Place, publisher, year, edition, pages
2000. Vol. 17, no 6, p. 875-881
Keywords [en]
maximum likelihood, phylogenetic network, graphical model, Bayesian, network, evolutionary tree, Markov chain Monte Carlo sampling, learning probabilistic networks, tree topologies, evolution, dna, nucleotide, sequences, recombination, splitstree, inference, humans
National Category
Mathematics
Identifiers
URN: urn:nbn:se:miun:diva-13680ISI: 000087331600004PubMedID: 10833193Scopus ID: 2-s2.0-0034088330OAI: oai:DiVA.org:miun-13680DiVA, id: diva2:412202
Available from: 2011-04-21 Created: 2011-04-21 Last updated: 2025-09-25Bibliographically approved

Open Access in DiVA

No full text in DiVA

PubMedScopus

Authority records

Moulton, Vincent

Search in DiVA

By author/editor
Moulton, Vincent
By organisation
Department of Engineering, Physics and Mathematics
In the same journal
Molecular biology and evolution
Mathematics

Search outside of DiVA

GoogleGoogle Scholar

pubmed
urn-nbn

Altmetric score

pubmed
urn-nbn
Total: 49 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf