Hierarchical reinforcement learning using path clustering

Paulo Gil; Luís Nunes

Ciência-IUL Publications Publication Detailed Description

Publication in conference proceedings

Hierarchical reinforcement learning using path clustering

Paulo Gil (Gil, P.); Luís Nunes (Nunes, L.);

2013 8th Iberian Conference on Information Systems and Technologies (CISTI)

Year (definitive publication)

2013

Language

English

Country

United States of America

More Information

Visit Link

Web of Science®

Times Cited: 2

(Last checked: 2024-05-18 16:21)

View record in Web of Science®

Scopus

Times Cited: 6

(Last checked: 2024-05-15 09:22)

View record in Scopus

Article Impact Index: 1.9

Google Scholar

Times Cited: 9

(Last checked: 2024-05-13 09:48)

View record in Google Scholar

Abstract

In this paper we intend to study the possibility to improve the performance of the Q-Learning algorithm, by automatically finding subgoals and making better use of the acquired knowledge. This research explores a method that allows an agent to gather information about sequences of states that lead to a goal, detect classes of common sequences and introduce the states at the end of these sequences as subgoals. We use the taxiproblem (a standard in Hierarchical Reinforcement Learning literature) and conclude that, even though this problem's scale is relatively small, in most of the cases subgoals do improve the learning speed, achieving relatively good results faster than standard Q-Learning. We propose a specific iteration interval as the most appropriate to insert subgoals in the learning process. We also found that early adoption of subgoals may lead to suboptimal learning. The extension to more challenging problems is an interesting subject for future work.

Acknowledgements

Keywords

Hierarchical reinforcement learning,Q-learning,Performance,Subgoals

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences

Publication Identifiers

Ciência-IUL ID	ci-pub-42667
Scopus (source: ORCID)	2-s2.0-84887948781
Scopus (source: Ciência-IUL)	2-s2.0-84887948781
Other ID (source: External)	CV:666077966945
WoS (source: Ciência-IUL)	WOS:000345737600070
Scopus (source: External)	2-s2.0-84887948781
WoS (source: ORCID)	000345737600070
Other ID (source: ORCID)	cv-prod-id-642944
Handle (source: ORCID)	10071/5353
WoS (source: author)	000345737600070
WoS (source: External)	WOS:000345737600070
Handle (source: other)	10071/5353
Scopus (source: author)	2-s2.0-84887948781
Handle (source: Ciência-IUL)	http://hdl.handle.net/10071/27772

Other Publication Details

Online Publication Year	2013
Publisher	IEEE
Indexes	Web of Science©; Scopus; IEEE Xplore
ISSN	2166-0727 (print)
ISBN	978-989-98434-0-0 (online)
Volume
Article Number	6615769
Pages	--	Total Pages	6
Peer Reviewed	Yes
Dissemination Mean	Both (printed and digital)
Editors
Event Title	8th Iberian Conference on Information Systems and Technologies, CISTI 2013
Event Organizer	IEEE
City	Lisboa
Event Type	Conference
Event Classification	European
Event Year	2013
Event Publication Type	Full Paper
ISCTE-IUL Repository	Link to the repository
Publication Date (online)	2013-01-01
Publication Date (print)	2013-01-01

Altmetric

PlumX Metrics