Publicação em atas de evento científico
Exploring APIs with N-gram language models
Gonçalo Prendi (Prendi, G.); Hugo Sousa (Sousa, H.); André Santos (Santos, A. L.); Ricardo Ribeiro (Ribeiro, R.);
INFORUM 2015: Atas do 7.o Simpósio Nacional de Informática
Ano (publicação definitiva)
2015
Língua
Inglês
País
Portugal
Mais Informação
Web of Science®

N.º de citações: 0

(Última verificação: 2026-04-08 22:35)

Ver o registo na Web of Science®

Scopus

Esta publicação não está indexada na Scopus

Google Scholar

N.º de citações: 1

(Última verificação: 2026-04-06 22:20)

Ver o registo no Google Scholar

Esta publicação não está indexada no Overton

Abstract/Resumo
Software development requires the use of external Application Programming Interfaces (APIs) in order to reuse libraries and frameworks. Programmers often struggle with unfamiliar APIs. Such difficulties often lead to an incorrect sequence of API calls that may not produce the desired outcome. Language models have shown the ability to capture regularities in text as well as in code. In this paper we explore the use of n-gram language models and their ability to capture regularities in APIs. We explored some of the most widely used APIs with the Java programming language, training several language models over hundreds of GitHub Java projects that use these APIs. The evaluation shows perplexity values for these language models that hint the possibility of using them to produce a tool to assist developers with code completion when using an unfamiliar API. On the one hand, such a tool may help developers to write correct API call sequences more efficiently; and, on the other hand, allows them to explore the features offered by the API.
Agradecimentos/Acknowledgements
--
Palavras-chave
APIs,Java,Perplexity,Source code mining,Code completion,N-gram language models