dc.date.accessioned |
2016-08-17T10:00:06Z |
und |
dc.date.accessioned |
2017-10-24T12:24:10Z |
|
dc.date.available |
2016-08-17T10:00:06Z |
und |
dc.date.available |
2017-10-24T12:24:10Z |
|
dc.date.issued |
2016-08-17T10:00:06Z |
|
dc.identifier.uri |
http://radr.hulib.helsinki.fi/handle/10138.1/5701 |
und |
dc.identifier.uri |
http://hdl.handle.net/10138.1/5701 |
|
dc.title |
Bandits in Information Retrieval |
en |
ethesis.discipline |
Computer science |
en |
ethesis.discipline |
Tietojenkäsittelytiede |
fi |
ethesis.discipline |
Datavetenskap |
sv |
ethesis.discipline.URI |
http://data.hulib.helsinki.fi/id/1dcabbeb-f422-4eec-aaff-bb11d7501348 |
|
ethesis.department.URI |
http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14 |
|
ethesis.department |
Institutionen för datavetenskap |
sv |
ethesis.department |
Department of Computer Science |
en |
ethesis.department |
Tietojenkäsittelytieteen laitos |
fi |
ethesis.faculty |
Matematisk-naturvetenskapliga fakulteten |
sv |
ethesis.faculty |
Matemaattis-luonnontieteellinen tiedekunta |
fi |
ethesis.faculty |
Faculty of Science |
en |
ethesis.faculty.URI |
http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca |
|
ethesis.university.URI |
http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97 |
|
ethesis.university |
Helsingfors universitet |
sv |
ethesis.university |
University of Helsinki |
en |
ethesis.university |
Helsingin yliopisto |
fi |
dct.creator |
Koivisto, Timo |
|
dct.issued |
2016 |
|
dct.language.ISO639-2 |
eng |
|
dct.abstract |
This thesis is a review of bandit algorithms in information retrieval. In information retrieval a result list should include the most relevant documents and the results should also be non-redundant and diverse. To achieve this, some form of feedback is required. This document describes implicit feedback collected from user interactions by using interleaving methods that allow alternative rankings of documents to be presented in result lists. Bandit algorithms can then be used to learn from user interactions in a principled way. The reviewed algorithms include dueling bandits, contextual bandits, and contextual dueling bandits. Additionally coactive learning and preference learning are described. Finally algorithms are summarized by using regret as a performance measure. |
en |
dct.language |
en |
|
ethesis.language.URI |
http://data.hulib.helsinki.fi/id/languages/eng |
|
ethesis.language |
English |
en |
ethesis.language |
englanti |
fi |
ethesis.language |
engelska |
sv |
ethesis.thesistype |
pro gradu-avhandlingar |
sv |
ethesis.thesistype |
pro gradu -tutkielmat |
fi |
ethesis.thesistype |
master's thesis |
en |
ethesis.thesistype.URI |
http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis |
|
dct.identifier.urn |
URN:NBN:fi-fe2017112251762 |
|
dc.type.dcmitype |
Text |
|