Artigo em revista científica Q1
Learning by observation of agent software images
Paulo Costa (Costa, P.); Luís Botelho (Botelho, L.);
Título Revista
Journal of Artificial Intelligence Research
Ano (publicação definitiva)
2013
Língua
Inglês
País
Estados Unidos da América
Mais Informação
Web of Science®

N.º de citações: 2

(Última verificação: 2024-04-25 11:02)

Ver o registo na Web of Science®


: 0.1
Scopus

N.º de citações: 2

(Última verificação: 2024-04-22 22:56)

Ver o registo na Scopus


: 0.0
Google Scholar

N.º de citações: 4

(Última verificação: 2024-04-26 06:11)

Ver o registo no Google Scholar

Abstract/Resumo
Learning by observation can be of key importance whenever agents sharing similar features want to learn from each other. This paper presents an agent architecture that enables software agents to learn by direct observation of the actions executed by expert agents while they are performing a task. This is possible because the proposed architecture displays information that is essential for observation, making it possible for software agents to observe each other. The agent architecture supports a learning process that covers all aspects of learning by observation, such as discovering and observing experts, learning from the observed data, applying the acquired knowledge and evaluating the agent's progress. The evaluation provides control over the decision to obtain new knowledge or apply the acquired knowledge to new problems. We combine two methods for learning from the observed information. The first one, the recall method, uses the sequence on which the actions were observed to solve new problems. The second one, the classification method, categorizes the information in the observed data and determines to which set of categories the new problems belong. Results show that agents are able to learn in conditions where common supervised learning algorithms fail, such as when agents do not know the results of their actions a priori or when not all the effects of the actions are visible. The results also show that our approach provides better results than other learning methods since it requires shorter learning periods.
Agradecimentos/Acknowledgements
--
Palavras-chave
Learning through observation,Software Image
  • Ciências da Computação e da Informação - Ciências Naturais
Registos de financiamentos
Referência de financiamento Entidade Financiadora
SFRH/BD/44779/2008 Fundação para a Ciência e a Tecnologia
PEst-OE/EEI/LA0008/2013 Fundação para a Ciência e a Tecnologia