A Mood Analysis on Youtube Comments and a Method for Improved Social Spam Detection

Ezpeleta, Enaitz; Iturbe, Mikel; Garitano, Iñaki; Velez de Mendizabal, Iñaki; Zurutuza, Urko

View/Open

A mood analysis on youtube comments and a method for improved social spam detection.pdf (241.8Kb)

Full record

Impact

Save the reference

Title

A Mood Analysis on Youtube Comments and a Method for Improved Social Spam Detection

Author

Ezpeleta, Enaitz

Iturbe, Mikel

Garitano, Iñaki

Velez de Mendizabal, Iñaki

Zurutuza, Urko

Version

Postprint

Rights

© Springer International Publishing AG, part of Springer Nature 2018. This is a post-peer-review, pre-copyedit version of an article published in Hybrid Artificial Intelligent Systems. HAIS 2018. Lecture Notes in Computer Science, vol 10870. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-92639-1_43

Access

Embargoed access

Publisher

Springer

Keywords

spam
social spam
mood analysis
online social networks ... [+]

spam
social spam
mood analysis
online social networks
Youtube [-]

Abstract

In the same manner that Online Social Networks (OSN) usage increases, non-legitimate campaigns over these types of web services are growing. This is the reason why signi cant number of users are affected by social spam every day and therefore, their privacy is threatened. To deal with this issue in this study we focus on mood analysis, among all content-based analysis techniques. We demonstrate that using this technique social spam filtering results are improved. First, the best spam filtering classifiers are identified using a labeled dataset consisting of Youtube comments, including spam. Then, a new dataset is created adding the mood feature to each comment, and the best classifiers are applied to it. A comparison between obtained results with and without mood information shows that this feature can help to improve social spam filtering results: the best accuracy is improved in two different datasets, and the number of false positives is reduced 13.76% and 11.41% on average. Moreover, the results are validated carrying out the same experiment but using a different dataset. [-]

xmlui.dri2xhtml.METS-1.0.item-sponsorship

Gobierno de España

xmlui.dri2xhtml.METS-1.0.item-projectID

GE/Programa Estatal de Investigacion, Desarrollo e Innovación orientada a los retos de la sociedad en el marco del Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016, convocatoria del 2017/TIN2017-84658-C2-2-R/Integración de Conocimiento Semántico para el Filtrado de Spam basado en Contenido/SKI4SPAM