OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

A study on LIWC categories for opinion mining in Spanish reviews

Salas-Zárate, María Del Pilar and López-López, Estanislao and Valencia-Garcia, Rafael and Aussenac-Gilles, Nathalie and Almela, Ángela and Alor-Hernández, Giner A study on LIWC categories for opinion mining in Spanish reviews. (2014) Journal of Information Science, 40 (6). 1-13. ISSN 0165-5515

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
354kB

Official URL: http://dx.doi.org/10.1177/0165551514547842

Abstract

With the exponential growth of social media i.e. blogs and social networks, organizations and individual persons are increasingly using the number of reviews of these media for decision making about a product or service. Opinion mining detects whether the emotion of an opinion expressed by a user on Web platforms in natural language, is positive or negative. This paper presents extensive experiments to study the effectiveness of the classification of Spanish opinions in five categories: highly positive, highly negative, positive, negative and neutral, using the combination of the psychological and linguistic features of LIWC. LIWC is a text analysis software that enables the extraction of different psychological and linguistic features from natural language text. For this study, two corpora have been used, one about movies and one about technological products. Furthermore, we have conducted a comparative assessment of the performance of various classification techniques: J48, SMO and BayesNet, using precision, recall and F-measure metrics. All in all, findings have revealed that the positive and negative categories provide better results than the other categories. Finally, experiments on both corpora indicated that SMO produces better results than BayesNet and J48 algorithms obtaining an F-measure of 90.4% and 87.2% in each domain.

Item Type:Article
Additional Information:Thanks to Journal of Information science. The definitive version is available at http://jis.sagepub.com/content/40/6/749.
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Instituto Tecnologico de Orizaba (MEXICO)
Other partners > Universidad Católica San Antonio de Murcia - UCAM (SPAIN)
Other partners > Universidad de Murcia - UM (SPAIN)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:18 Feb 2015 12:09

Repository Staff Only: item control page