Publication in conference proceedings
Rating prediction on yelp academic dataset using paragraph vectors
Rui Maia (Maia, R.); Joao C Ferreira or Joao Ferreira (Ferreira, J.); Ana Martins (Martins, A.);
Proceedings of 232nd The IIER International Conference
Year (definitive publication)
2019
Language
English
Country
Malaysia
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

This publication is not indexed in Google Scholar

Abstract
This work studies the application of Paragraph Vectors to the Yelp Academic Dataset reviews in order to predict user ratings for different categories of businesses like auto repair, restaurants or veterinarians. Paragraph Vectors is a word embeddings techniques were each word or piece of text is converted to a continuous low dimensional space. Then, the opinion mining or sentiment analysis is observed as a classification task, where each user review is associated with a label the rating - and a probabilistic model is built with a logistic classifier. Following the intuition that the semantic information present in textual user reviews is generally more complex and complete than the numeric rating itself, this work applies Paragraph Vectors successfully toYelp dataset and evaluates its results.
Acknowledgements
--
Keywords
Prediction,Paragraph vectors,learning-to-rank,Dimension reduce
  • Computer and Information Sciences - Natural Sciences
Funding Records
Funding Reference Funding Entity
UID/MULTI/0446/2013 Fundação para a Ciência e a Tecnologia
UID/GES/00315/2013 Fundação para a Ciência e a Tecnologia