Ciência-IUL
Publications
Publication Detailed Description
10th Symposium on Languages, Applications and Technologies (SLATE 2021)
Year (definitive publication)
2021
Language
English
Country
--
More Information
Web of Science®
This publication is not indexed in Web of Science®
Scopus
Google Scholar
Abstract
This paper proposes a set of approaches for the semantic search of mobile applications, based on their name and on the unstructured textual information contained in their description. The proposed approaches make use of word-level, character-level, and contextual word-embeddings that have been trained or fine-tuned using a dataset of about 500 thousand mobile apps, collected in the scope of this work. The proposed approaches have been evaluated using a public dataset that includes information about 43 thousand applications, and 56 manually annotated non-exact queries. Our results show that both character-level embeddings trained on our data, and fine-tuned RoBERTa models surpass the performance of the other existing retrieval strategies reported in the literature.
Acknowledgements
--
Keywords
Semantic search,Word embeddings,Elasticsearch,Mobile applications
Fields of Science and Technology Classification
- Mathematics - Natural Sciences
- Social and Economic Geography - Social Sciences
Funding Records
Funding Reference | Funding Entity |
---|---|
39703 | PT2020 |
UIDB/50021/2020 | Fundação para a Ciência e a Tecnologia |
Related Projects
This publication is an output of the following project(s):