Hierarchical reinforcement learning using path clustering

Paulo Gil; Luís Nunes

Ciência-IUL Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Hierarchical reinforcement learning using path clustering

Paulo Gil (Gil, P.); Luís Nunes (Nunes, L.);

2013 8th Iberian Conference on Information Systems and Technologies (CISTI)

Ano (publicação definitiva)

2013

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 2

(Última verificação: 2024-05-05 09:36)

Ver o registo na Web of Science®

Scopus

N.º de citações: 6

(Última verificação: 2024-04-30 16:48)

Ver o registo na Scopus

Índice de Impacto do Artigo: 1.9

Ver Mais

Google Scholar

N.º de citações: 9

(Última verificação: 2024-05-04 17:24)

Ver o registo no Google Scholar

Abstract/Resumo

In this paper we intend to study the possibility to improve the performance of the Q-Learning algorithm, by automatically finding subgoals and making better use of the acquired knowledge. This research explores a method that allows an agent to gather information about sequences of states that lead to a goal, detect classes of common sequences and introduce the states at the end of these sequences as subgoals. We use the taxiproblem (a standard in Hierarchical Reinforcement Learning literature) and conclude that, even though this problem's scale is relatively small, in most of the cases subgoals do improve the learning speed, achieving relatively good results faster than standard Q-Learning. We propose a specific iteration interval as the most appropriate to insert subgoals in the learning process. We also found that early adoption of subgoals may lead to suboptimal learning. The extension to more challenging problems is an interesting subject for future work.

Agradecimentos/Acknowledgements

Palavras-chave

Hierarchical reinforcement learning,Q-learning,Performance,Subgoals

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais

Identificadores da Publicação

Outro ID (fonte: ORCID)	cv-prod-id-642944
Scopus (fonte: Ciência-IUL)	2-s2.0-84887948781
Handle (fonte: outro)	10071/5353
WoS (fonte: Externo)	WOS:000345737600070
Scopus (fonte: Externo)	2-s2.0-84887948781
Handle (fonte: ORCID)	10071/5353
ID Ciência-IUL	ci-pub-42667
WoS (fonte: Ciência-IUL)	WOS:000345737600070
Scopus (fonte: autor)	2-s2.0-84887948781
Scopus (fonte: ORCID)	2-s2.0-84887948781
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/27772
WoS (fonte: ORCID)	000345737600070
Outro ID (fonte: Externo)	CV:666077966945
WoS (fonte: autor)	000345737600070

Outros Detalhes da Publicação

Ano Publicação Online	2013
Editora	IEEE
Indexação	Web of Science©; Scopus; IEEE Xplore
ISSN	2166-0727 (print)
ISBN	978-989-98434-0-0 (online)
Volume
Número Artigo	6615769
Páginas	--	Total Páginas	6
Avaliado Cientificamente	Sim
Meio de Divulgação	Ambos (impresso e digital)
Editores
Título do Evento	8th Iberian Conference on Information Systems and Technologies, CISTI 2013
Organizador do Evento	IEEE
Cidade	Lisboa
Tipo de Evento	Conferência
Classificação do Evento	Europeu
Ano do Evento	2013
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)	2013-01-01
Data Publicação (print)	2013-01-01

Altmetric

PlumX Metrics