Skip to main content
Login | Suomeksi | På svenska | In English

Comparison of popular methods for prediction of film ratings

Show simple item record

dc.date.accessioned 2015-09-28T12:25:28Z und
dc.date.accessioned 2017-10-24T12:24:03Z
dc.date.available 2015-09-28T12:25:28Z und
dc.date.available 2017-10-24T12:24:03Z
dc.date.issued 2015-09-28T12:25:28Z
dc.identifier.uri http://radr.hulib.helsinki.fi/handle/10138.1/5023 und
dc.identifier.uri http://hdl.handle.net/10138.1/5023
dc.title Comparison of popular methods for prediction of film ratings en
ethesis.department.URI http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14
ethesis.department Institutionen för datavetenskap sv
ethesis.department Department of Computer Science en
ethesis.department Tietojenkäsittelytieteen laitos fi
ethesis.faculty Matematisk-naturvetenskapliga fakulteten sv
ethesis.faculty Matemaattis-luonnontieteellinen tiedekunta fi
ethesis.faculty Faculty of Science en
ethesis.faculty.URI http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca
ethesis.university.URI http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97
ethesis.university Helsingfors universitet sv
ethesis.university University of Helsinki en
ethesis.university Helsingin yliopisto fi
dct.creator Palon, Preston
dct.issued 2015
dct.language.ISO639-2 eng
dct.abstract A recommender system suggests items that the user of the system is likely to find valuable. Together with the explosion of e-commerce, recommender systems have become a focus of academic research. Within this field prediction of film ratings is a popular research area and the topic of this thesis. Many websites that sell, rent or stream films allow users to give ratings to films they have seen. The goal is to accurately predict ratings the user has not given yet. It would then be possible to recommend films the user may want to see. Different ways of predicting film ratings in recommender systems were compared at this thesis using MovieLens 100K as the dataset. The algorithms were implemented in MATLAB, tested using 5-fold cross-validation, and ranked using mean absolute error as the accuracy metric. In total nine different recommender system designs were tested, including four hybrid systems designed and created for this thesis. Techniques used include user and item-based collaborative filtering, singular value decomposition, content-based recommendation and demographic method. Separate tuning data was used to optimise parameters including similarity measure used and the best nearest neighbourhood size. Of the basic methods item-based collaborative filtering gave the best results, followed by singular value decomposition. User-based collaborative filtering, content-based recommendation and demographic method performed slightly worse. The overall best results were achieved with a hybrid design that combines baseline predictors with user-based and item-based collaborative filtering. Choosing the best similarity measure and finding ideal values for parameters like nearest neighbourhood size had a significant impact on the results. en
dct.language en
ethesis.language.URI http://data.hulib.helsinki.fi/id/languages/eng
ethesis.language English en
ethesis.language englanti fi
ethesis.language engelska sv
ethesis.thesistype pro gradu-avhandlingar sv
ethesis.thesistype pro gradu -tutkielmat fi
ethesis.thesistype master's thesis en
ethesis.thesistype.URI http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis
ethesis.degreeprogram Algorithms and Machine Learning en
dct.identifier.urn URN:NBN:fi-fe2017112252517
dc.type.dcmitype Text

Files in this item

Files Size Format View
Thesis.pdf 2.654Mb PDF

This item appears in the following Collection(s)

Show simple item record