Ciência_Iscte
Publications
Publication Detailed Description
10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
Year (definitive publication)
2018
Language
English
Country
Portugal
More Information
Web of Science®
This publication is not indexed in Web of Science®
Scopus
Google Scholar
This publication is not indexed in Overton
Abstract
Topic Modeling is a well-known unsupervised learning technique used when dealing with text data. It is used to discover latent patterns, called topics, in a collection of documents (corpus). This technique provides a convenient way to retrieve information from unclassified and unstructured text. Topic Modeling tasks have been performed for tracking events/topics/trends in different domains such as academic, public health, marketing, news, and so on. In this paper, we propose a framework for extracting topics from a large dataset of short messages, for brand interest tracking purposes. The framework consists training LDA topic models for each brand using time intervals, and then applying the model on aggregated documents. Additionally, we present a set of preprocessing tasks that helped to improve the topic models and the corresponding outputs. The experiments demonstrate that topic modeling can successfully track people’s discussions on Social Networks even in massive datasets, and ca pture those topics spiked by real-life events.
Acknowledgements
--
Keywords
Topic modeling,Topics evolution,LDA,Preprocessing,Brand interest
Fields of Science and Technology Classification
- Physical Sciences - Natural Sciences
Funding Records
| Funding Reference | Funding Entity |
|---|---|
| UID/CEC/50021/2013 | Fundação para a Ciência e a Tecnologia |
Português