TY - JOUR
T1 - Automatic Detection of Satire in Twitter: A psycholinguistic-based approach
AU - Salas-Zárate, María del Pilar
AU - Paredes-Valverde, Mario Andrés
AU - Rodriguez-Garcia, Miguel Angel
AU - Valencia-García, Rafael
AU - Alor-Hernández, Giner
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work has been supported by the Spanish Ministry of Economy and Competitiveness and the European Commission (FEDER / ERDF) through project KBS4FIA (TIN2016-76323-R).María del Pilar Salas-Zárate and Mario Andrés Paredes-Valverde are supported by the National Council of Science and Technology (CONACYT), the Secretariat of Public Education (SEP) and the Mexican government.This work was also supported by Tecnológico Nacional de Mexico (TecNM) and Secretariat of Public Education (SEP) through PRODEP (Programa para el Desarrollo Profesional Docente, in Spanish).
PY - 2017/4/24
Y1 - 2017/4/24
N2 - In recent years, a substantial effort has been made to develop sophisticated methods that can be used to detect figurative language, and more specifically, irony and sarcasm. There is, however, an absence of new approaches and research works that analyze satirical texts. The recognition of satire by sentiment analysis and Natural Language Processing (NLP) applications is extremely important because it can influence and change the meaning of a statement in varied and complex ways. We used this understanding as a basis to propose a method that employs a wide variety of psycholinguistic features and which detects satirical and non-satirical text. We then went on to train a set of machine learning algorithms that would enable us to classify unknown data. Finally, we conducted several experiments in order to detect the most relevant features that generate a better pattern as regards detecting satirical texts. We evaluated the effectiveness of our method by obtaining a corpus of satirical and non-satirical news from Mexican and Spanish twitter accounts. Our proposal obtained encouraging results, with an F-measure of 85.5% for Mexico and one of 84.0% for Spain. Moreover, the results of the experiment showed that there is no significant difference between Mexican and Spanish satire.
AB - In recent years, a substantial effort has been made to develop sophisticated methods that can be used to detect figurative language, and more specifically, irony and sarcasm. There is, however, an absence of new approaches and research works that analyze satirical texts. The recognition of satire by sentiment analysis and Natural Language Processing (NLP) applications is extremely important because it can influence and change the meaning of a statement in varied and complex ways. We used this understanding as a basis to propose a method that employs a wide variety of psycholinguistic features and which detects satirical and non-satirical text. We then went on to train a set of machine learning algorithms that would enable us to classify unknown data. Finally, we conducted several experiments in order to detect the most relevant features that generate a better pattern as regards detecting satirical texts. We evaluated the effectiveness of our method by obtaining a corpus of satirical and non-satirical news from Mexican and Spanish twitter accounts. Our proposal obtained encouraging results, with an F-measure of 85.5% for Mexico and one of 84.0% for Spain. Moreover, the results of the experiment showed that there is no significant difference between Mexican and Spanish satire.
UR - http://hdl.handle.net/10754/623288
UR - http://www.sciencedirect.com/science/article/pii/S0950705117301855
UR - http://www.scopus.com/inward/record.url?scp=85018980981&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2017.04.009
DO - 10.1016/j.knosys.2017.04.009
M3 - Article
SN - 0950-7051
VL - 128
SP - 20
EP - 33
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
ER -