Skip to main content
Login | Suomeksi | På svenska | In English

Investigating Neural-Based Learning Algorithms for Control

Show simple item record

dc.date.accessioned 2016-05-12T15:13:05Z und
dc.date.accessioned 2017-10-24T12:24:12Z
dc.date.available 2016-05-12T15:13:05Z und
dc.date.available 2017-10-24T12:24:12Z
dc.date.issued 2016-05-12T15:13:05Z
dc.identifier.uri http://radr.hulib.helsinki.fi/handle/10138.1/5445 und
dc.identifier.uri http://hdl.handle.net/10138.1/5445
dc.title Investigating Neural-Based Learning Algorithms for Control en
ethesis.discipline Computer science en
ethesis.discipline Tietojenkäsittelytiede fi
ethesis.discipline Datavetenskap sv
ethesis.discipline.URI http://data.hulib.helsinki.fi/id/1dcabbeb-f422-4eec-aaff-bb11d7501348
ethesis.department.URI http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14
ethesis.department Institutionen för datavetenskap sv
ethesis.department Department of Computer Science en
ethesis.department Tietojenkäsittelytieteen laitos fi
ethesis.faculty Matematisk-naturvetenskapliga fakulteten sv
ethesis.faculty Matemaattis-luonnontieteellinen tiedekunta fi
ethesis.faculty Faculty of Science en
ethesis.faculty.URI http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca
ethesis.university.URI http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97
ethesis.university Helsingfors universitet sv
ethesis.university University of Helsinki en
ethesis.university Helsingin yliopisto fi
dct.creator Gao, Yuan
dct.issued 2016
dct.language.ISO639-2 eng
dct.abstract Controlling a complicated mechanical system to perform a certain task, for example, making robot to dance, is a traditional problem studied in the area of control theory. However, evidence shows that incorporating machine learning techniques in robotics can enable researchers to get rid of tedious engineering works of adjusting environmental parameters. Many researchers like Jan Peters, Sethu Vijayakumar, Stefan Schaal, Andrew Ng and Sebastian Thrun are the early explorers in this field. Based on the Partial Observable Markov Decision Process (POMDP) reinforcement learning, they contributed theory and practical implementation of several benchmarks in this field. Recently, one sub-field of machine learning called deep learning gained a lot of attention as a method attempting to model high-level abstractions by using model architectures composed of multiple non-linear layers (for example [Krizhevsky2012]). Several architectures of deep learning networks like deep belief network [Hinton2006], deep Boltzman machine [Salakhutdinov2009], convolutional neural network [Krizhevsky2012] and deep de-noising auto-encoder [Vincent2010] have shown their advantages in specific areas. The main contribution of deep learning is more related to perception which deals with problems like Sensor Fusion [OConnor2013], Nature Language Processing(NLP)[Cho2014] and Object Recognition [Lenz2013][Hoffman2014]. Although considered briefly in Jürgen Schmidhuber's team[Mayer2006], the other area of robotics, namely control, remains more-or-less unexplored in the realm of deep learning. The main focus of this thesis is to introduce general learning methods for robot control problem with an exploration on deep learning method. As a consequence, this thesis tries to describe the transitional learning methods as well as the emerging deep learning methods including new findings in the investigation. en
dct.language en
ethesis.language.URI http://data.hulib.helsinki.fi/id/languages/eng
ethesis.language English en
ethesis.language englanti fi
ethesis.language engelska sv
ethesis.thesistype pro gradu-avhandlingar sv
ethesis.thesistype pro gradu -tutkielmat fi
ethesis.thesistype master's thesis en
ethesis.thesistype.URI http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis
dct.identifier.urn URN:NBN:fi-fe2017112251809
dc.type.dcmitype Text

Files in this item

Files Size Format View
master_thesis.pdf 9.071Mb PDF

This item appears in the following Collection(s)

Show simple item record