dc.date.accessioned |
2015-06-29T10:26:03Z |
und |
dc.date.accessioned |
2017-11-06T12:03:30Z |
|
dc.date.available |
2015-05-25 |
fi |
dc.date.available |
2015-06-29T10:26:03Z |
und |
dc.date.available |
2017-11-06T12:03:30Z |
|
dc.date.issued |
2015-05-21 |
|
dc.identifier.uri |
http://hdl.handle.net/10138/155608 |
|
dc.publisher |
Helsingin yliopisto |
fi |
dc.publisher |
Helsingfors universitet |
sv |
dc.publisher |
University of Helsinki |
en |
dc.title |
Likelihood-based Phylogenetic Network Inference by Approximate Structural Expectation Maximization |
en |
ethesis.department.URI |
http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14 |
|
ethesis.department |
Institutionen för datavetenskap |
sv |
ethesis.department |
Department of Computer Science |
en |
ethesis.department |
Tietojenkäsittelytieteen laitos |
fi |
ethesis.faculty |
Matematisk-naturvetenskapliga fakulteten |
sv |
ethesis.faculty |
Matemaattis-luonnontieteellinen tiedekunta |
fi |
ethesis.faculty |
Faculty of Science |
en |
ethesis.faculty.URI |
http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca |
|
ethesis.university.URI |
http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97 |
|
ethesis.university |
Helsingfors universitet |
sv |
ethesis.university |
University of Helsinki |
en |
ethesis.university |
Helsingin yliopisto |
fi |
dct.creator |
Nguyen, Quan |
|
dct.issued |
2015 |
|
dct.language.ISO639-2 |
eng |
|
dct.abstract |
Probabilistic phylogenetic trees are widely considered as the most powerful and reliable method for phylogenetic analysis. However, in reality, processes like hybridization, horizontal gene transfer, and recombination result in reticulation, which means that the evolutionary process can no longer be accurately described by a tree-like graph. A phylogenetic network, which is a general version of a phylogenetic tree is more appropriate in this situation. Unfortunately computational challenges arise when handling likelihood-based phylogenetic networks. Earlier methods often require the hypotheses to be in the neighborhood of the underlying true phylogeny and to be specified as a backbone tree or the number of possible reticulation events. Nevertheless their running time is still often too slow to be really helpful in many realistic scenarios. We propose a method called PhyloDAG, which is significantly faster than earlier methods, and thus restrictions on the network search can be removed. As a consequence the inference is more likely to be accurate. The key idea to speed up phylogenetic network inference by the proposed method, Stochastic Structural Expectation Maximization, which is an EM like algorithm, where in the E step it samples missing data while in the M step it optimizes both the parameters and the structure of the phylogenetic network on pseudo-complete data. Experiments on simulated data as well as real biological and textual data demonstrate that the proposed method, PhyloDAG, can efficiently infer accurate phylogenetic networks. |
en |
dct.language |
en |
|
ethesis.language.URI |
http://data.hulib.helsinki.fi/id/languages/eng |
|
ethesis.language |
English |
en |
ethesis.language |
englanti |
fi |
ethesis.language |
engelska |
sv |
ethesis.supervisor |
Roos, Teemu |
|
ethesis.thesistype |
pro gradu-avhandlingar |
sv |
ethesis.thesistype |
pro gradu -tutkielmat |
fi |
ethesis.thesistype |
master's thesis |
en |
ethesis.thesistype.URI |
http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis |
|
dct.identifier.urn |
URN:NBN:fi-fe2015062910525 |
|
dc.type.dcmitype |
Text |
|
dct.rights |
This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited. |
en |
dct.rights |
Publikationen är skyddad av upphovsrätten. Den får läsas och skrivas ut för personligt bruk. Användning i kommersiellt syfte är förbjuden. |
sv |
dct.rights |
Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty. |
fi |