dc.date.accessioned |
2014-03-18T12:22:25Z |
und |
dc.date.accessioned |
2017-10-24T12:23:41Z |
|
dc.date.available |
2014-03-18T12:22:25Z |
und |
dc.date.available |
2017-10-24T12:23:41Z |
|
dc.date.issued |
2014-03-18T12:22:25Z |
|
dc.identifier.uri |
http://radr.hulib.helsinki.fi/handle/10138.1/3552 |
und |
dc.identifier.uri |
http://hdl.handle.net/10138.1/3552 |
|
dc.title |
Cognate Discovery and Alignment in Computational Etymology |
en |
ethesis.department.URI |
http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14 |
|
ethesis.department |
Institutionen för datavetenskap |
sv |
ethesis.department |
Department of Computer Science |
en |
ethesis.department |
Tietojenkäsittelytieteen laitos |
fi |
ethesis.faculty |
Matematisk-naturvetenskapliga fakulteten |
sv |
ethesis.faculty |
Matemaattis-luonnontieteellinen tiedekunta |
fi |
ethesis.faculty |
Faculty of Science |
en |
ethesis.faculty.URI |
http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca |
|
ethesis.university.URI |
http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97 |
|
ethesis.university |
Helsingfors universitet |
sv |
ethesis.university |
University of Helsinki |
en |
ethesis.university |
Helsingin yliopisto |
fi |
dct.creator |
Lv, Guowei |
|
dct.issued |
2014 |
|
dct.language.ISO639-2 |
eng |
|
dct.abstract |
This master thesis discusses two main tasks of computational etymology. First, finding cognates in multilingual text. Second, finding underlying correspondence rules by aligning cognates.
For the first part, I briefly described two categories of methods in identifying cognates: symbol based and phonetic based. For the second part, I described the Etymon project, which I had been working in. The Etymon project uses a probabilistic method and Minimum Description Length principle to align cognate sets. The objective of this project is to build a model which can automatically find as much information in the cognates as possible without linguistic knowledge as well as find genetic relationship between those languages. I also discussed the experiment that I did to explore the uncertainty in the data source. |
en |
dct.language |
en |
|
ethesis.language.URI |
http://data.hulib.helsinki.fi/id/languages/eng |
|
ethesis.language |
English |
en |
ethesis.language |
englanti |
fi |
ethesis.language |
engelska |
sv |
ethesis.thesistype |
pro gradu-avhandlingar |
sv |
ethesis.thesistype |
pro gradu -tutkielmat |
fi |
ethesis.thesistype |
master's thesis |
en |
ethesis.thesistype.URI |
http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis |
|
ethesis.degreeprogram |
Algorithms and Machine Learning |
en |
dct.identifier.urn |
URN:NBN:fi-fe2017112251391 |
|
dc.type.dcmitype |
Text |
|