Skip to main content
Login | Suomeksi | På svenska | In English

Cognate Discovery and Alignment in Computational Etymology

Show simple item record

dc.date.accessioned 2014-03-18T12:22:25Z und
dc.date.accessioned 2017-10-24T12:23:41Z
dc.date.available 2014-03-18T12:22:25Z und
dc.date.available 2017-10-24T12:23:41Z
dc.date.issued 2014-03-18T12:22:25Z
dc.identifier.uri http://radr.hulib.helsinki.fi/handle/10138.1/3552 und
dc.identifier.uri http://hdl.handle.net/10138.1/3552
dc.title Cognate Discovery and Alignment in Computational Etymology en
ethesis.department.URI http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14
ethesis.department Institutionen för datavetenskap sv
ethesis.department Department of Computer Science en
ethesis.department Tietojenkäsittelytieteen laitos fi
ethesis.faculty Matematisk-naturvetenskapliga fakulteten sv
ethesis.faculty Matemaattis-luonnontieteellinen tiedekunta fi
ethesis.faculty Faculty of Science en
ethesis.faculty.URI http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca
ethesis.university.URI http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97
ethesis.university Helsingfors universitet sv
ethesis.university University of Helsinki en
ethesis.university Helsingin yliopisto fi
dct.creator Lv, Guowei
dct.issued 2014
dct.language.ISO639-2 eng
dct.abstract This master thesis discusses two main tasks of computational etymology. First, finding cognates in multilingual text. Second, finding underlying correspondence rules by aligning cognates. For the first part, I briefly described two categories of methods in identifying cognates: symbol based and phonetic based. For the second part, I described the Etymon project, which I had been working in. The Etymon project uses a probabilistic method and Minimum Description Length principle to align cognate sets. The objective of this project is to build a model which can automatically find as much information in the cognates as possible without linguistic knowledge as well as find genetic relationship between those languages. I also discussed the experiment that I did to explore the uncertainty in the data source. en
dct.language en
ethesis.language.URI http://data.hulib.helsinki.fi/id/languages/eng
ethesis.language English en
ethesis.language englanti fi
ethesis.language engelska sv
ethesis.thesistype pro gradu-avhandlingar sv
ethesis.thesistype pro gradu -tutkielmat fi
ethesis.thesistype master's thesis en
ethesis.thesistype.URI http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis
ethesis.degreeprogram Algorithms and Machine Learning en
dct.identifier.urn URN:NBN:fi-fe2017112251391
dc.type.dcmitype Text

Files in this item

Files Size Format View
guowei-lv-thesis.pdf 1.386Mb PDF

This item appears in the following Collection(s)

Show simple item record