Investigating Neural-Based Learning Algorithms for Control

Investigating Neural-Based Learning Algorithms for Control

dc.date.accessioned	2016-05-12T15:13:05Z	und
dc.date.accessioned	2017-10-24T12:24:12Z
dc.date.available	2016-05-12T15:13:05Z	und
dc.date.available	2017-10-24T12:24:12Z
dc.date.issued	2016-05-12T15:13:05Z
dc.identifier.uri	http://radr.hulib.helsinki.fi/handle/10138.1/5445	und
dc.identifier.uri	http://hdl.handle.net/10138.1/5445
dc.title	Investigating Neural-Based Learning Algorithms for Control	en
ethesis.discipline	Computer science	en
ethesis.discipline	Tietojenkäsittelytiede	fi
ethesis.discipline	Datavetenskap	sv
ethesis.discipline.URI	http://data.hulib.helsinki.fi/id/1dcabbeb-f422-4eec-aaff-bb11d7501348
ethesis.department.URI	http://data.hulib.helsinki.fi/id/225405e8-3362-4197-a7fd-6e7b79e52d14
ethesis.department	Institutionen för datavetenskap	sv
ethesis.department	Department of Computer Science	en
ethesis.department	Tietojenkäsittelytieteen laitos	fi
ethesis.faculty	Matematisk-naturvetenskapliga fakulteten	sv
ethesis.faculty	Matemaattis-luonnontieteellinen tiedekunta	fi
ethesis.faculty	Faculty of Science	en
ethesis.faculty.URI	http://data.hulib.helsinki.fi/id/8d59209f-6614-4edd-9744-1ebdaf1d13ca
ethesis.university.URI	http://data.hulib.helsinki.fi/id/50ae46d8-7ba9-4821-877c-c994c78b0d97
ethesis.university	Helsingfors universitet	sv
ethesis.university	University of Helsinki	en
ethesis.university	Helsingin yliopisto	fi
dct.creator	Gao, Yuan
dct.issued	2016
dct.language.ISO639-2	eng
dct.abstract	Controlling a complicated mechanical system to perform a certain task, for example, making robot to dance, is a traditional problem studied in the area of control theory. However, evidence shows that incorporating machine learning techniques in robotics can enable researchers to get rid of tedious engineering works of adjusting environmental parameters. Many researchers like Jan Peters, Sethu Vijayakumar, Stefan Schaal, Andrew Ng and Sebastian Thrun are the early explorers in this field. Based on the Partial Observable Markov Decision Process (POMDP) reinforcement learning, they contributed theory and practical implementation of several benchmarks in this field. Recently, one sub-field of machine learning called deep learning gained a lot of attention as a method attempting to model high-level abstractions by using model architectures composed of multiple non-linear layers (for example [Krizhevsky2012]). Several architectures of deep learning networks like deep belief network [Hinton2006], deep Boltzman machine [Salakhutdinov2009], convolutional neural network [Krizhevsky2012] and deep de-noising auto-encoder [Vincent2010] have shown their advantages in specific areas. The main contribution of deep learning is more related to perception which deals with problems like Sensor Fusion [OConnor2013], Nature Language Processing(NLP)[Cho2014] and Object Recognition [Lenz2013][Hoffman2014]. Although considered briefly in Jürgen Schmidhuber's team[Mayer2006], the other area of robotics, namely control, remains more-or-less unexplored in the realm of deep learning. The main focus of this thesis is to introduce general learning methods for robot control problem with an exploration on deep learning method. As a consequence, this thesis tries to describe the transitional learning methods as well as the emerging deep learning methods including new findings in the investigation.	en
dct.language	en
ethesis.language.URI	http://data.hulib.helsinki.fi/id/languages/eng
ethesis.language	English	en
ethesis.language	englanti	fi
ethesis.language	engelska	sv
ethesis.thesistype	pro gradu-avhandlingar	sv
ethesis.thesistype	pro gradu -tutkielmat	fi
ethesis.thesistype	master's thesis	en
ethesis.thesistype.URI	http://data.hulib.helsinki.fi/id/thesistypes/mastersthesis
dct.identifier.urn	URN:NBN:fi-fe2017112251809
dc.type.dcmitype	Text

Files in this item

Files	Size	Format	View
master_thesis.pdf	9.071Mb	PDF

This item appears in the following Collection(s)

Faculty of Science [4203]

Show simple item record

Investigating Neural-Based Learning Algorithms for Control

Files in this item

This item appears in the following Collection(s)

Yhteystiedot

HELSINGIN YLIOPISTO