Skip to main content
Login | Suomeksi | På svenska | In English

Kernel Methods for Protein-Protein Interaction Prediction

Show simple item record

dc.date.accessioned 2016-01-25T13:38:44Z und
dc.date.accessioned 2017-10-24T12:24:07Z
dc.date.available 2016-01-25T13:38:44Z und
dc.date.available 2017-10-24T12:24:07Z
dc.date.issued 2016-01-25T13:38:44Z
dc.identifier.uri http://radr.hulib.helsinki.fi/handle/10138.1/5254 und
dc.identifier.uri http://hdl.handle.net/10138.1/5254
dc.title Kernel Methods for Protein-Protein Interaction Prediction en
ethesis.department.URI http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14
ethesis.department Institutionen för datavetenskap sv
ethesis.department Department of Computer Science en
ethesis.department Tietojenkäsittelytieteen laitos fi
ethesis.faculty Matematisk-naturvetenskapliga fakulteten sv
ethesis.faculty Matemaattis-luonnontieteellinen tiedekunta fi
ethesis.faculty Faculty of Science en
ethesis.faculty.URI http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca
ethesis.university.URI http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97
ethesis.university Helsingfors universitet sv
ethesis.university University of Helsinki en
ethesis.university Helsingin yliopisto fi
dct.creator Lei, Jinmin
dct.issued 2016
dct.language.ISO639-2 eng
dct.abstract Despite of the efficiency brought by the high-throughput technology in detecting protein-protein interactions, different wet-lab methods still pose different pitfalls. As a complementary strategy, dry-lab methods are less expensive and have an advantage of data fusion that overcomes the biases of individual data sources. This thesis explores the indicative features and the effect of a graph model in the protein-protein interaction prediction task as well as the capability of the multiple kernel learning algorithms in improving the prediction performance.Different kernels are applied in accordance with different features. We integrate 14 global and 10 graph features respectively in the SVM framework via different kernel methods, and then compare the prediction performances of different features. When applying the graph features, we represent individual proteins as labeled graphs and then apply three different graph kernels to explore which one can best capture the relationships between proteins. For merging heterogeneous data, we apply different multiple kernel learning algorithms and explore their capabilities in improving the prediction accuracy. We formulate the prediction of protein-protein interactions as a binary classification problem and in the SVM framework, we need to reconstruct the kernel which measures the similarity between protein pairs from the kernel which measures the similarity between proteins. For this goal, we employ three different pairwise kernels in the SVM framework and explore their effects in capturing the relationships between protein pairs. We perform experiments on 896 Saccharomyces Cerevisiae (baker's yeast) proteins and report the prediction performances of the three pairwise kernels on 10 graph and 14 global features, as well as the prediction results of different multiple kernel learning algorithms. Our experimental results reveal that the overall prediction performance achieved by the 10 graph features applied to the proposed graph model is better than that achieved by the 14 protein global features, and that among all multiple kernel learning methods, the align wins over the others in the protein-protein interaction prediction task. Our methods detect the interacting proteins at a high level. Based on this work, low-level models can be devised to detect the exact interacting spots between proteins. en
dct.language en
ethesis.language.URI http://data.hulib.helsinki.fi/id/languages/eng
ethesis.language English en
ethesis.language englanti fi
ethesis.language engelska sv
ethesis.thesistype pro gradu-avhandlingar sv
ethesis.thesistype pro gradu -tutkielmat fi
ethesis.thesistype master's thesis en
ethesis.thesistype.URI http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis
ethesis.degreeprogram Bioinformatics en
dct.identifier.urn URN:NBN:fi-fe2017112252501
dc.type.dcmitype Text

Files in this item

Files Size Format View
leijinmin_thesis.pdf 670.1Kb PDF

This item appears in the following Collection(s)

Show simple item record