Title
A Mood Analysis on Youtube Comments and a Method for Improved Social Spam DetectionVersion
http://purl.org/coar/version/c_ab4af688f83e57aa
Rights
© Springer International Publishing AG, part of Springer Nature 2018. This is a post-peer-review, pre-copyedit version of an article published in Hybrid Artificial Intelligent Systems. HAIS 2018. Lecture Notes in Computer Science, vol 10870. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-92639-1_43Access
http://purl.org/coar/access_right/c_f1cfPublisher’s version
https://doi.org/10.1007/978-3-319-92639-1_43Published at
Hybrid Artificial Intelligent Systems (HAIS 2018). Pp. 514-525. Lecture Notes in Computer Science Vol.10870. Springer,Publisher
SpringerKeywords
spam
social spam
mood analysis
online social networks ... [+]
social spam
mood analysis
online social networks ... [+]
spam
social spam
mood analysis
online social networks
Youtube [-]
social spam
mood analysis
online social networks
Youtube [-]
Abstract
In the same manner that Online Social Networks (OSN) usage increases, non-legitimate campaigns over these types of web services are growing. This is the reason why signi cant number of users are affec ... [+]
In the same manner that Online Social Networks (OSN) usage increases, non-legitimate campaigns over these types of web services are growing. This is the reason why signi cant number of users are affected by social spam every day and therefore, their privacy is threatened. To deal with this issue in this study we focus on mood analysis, among all content-based analysis techniques. We demonstrate that using this technique social spam filtering results are improved. First, the best spam filtering classifiers are identified using a labeled dataset consisting of Youtube comments, including spam. Then, a new dataset is created adding the mood feature to each comment, and the best classifiers are applied to it. A comparison between obtained results with and without mood information shows that this feature can help to improve social spam filtering results: the best accuracy is improved in two different datasets, and the number of false positives is reduced 13.76% and 11.41% on average. Moreover, the results are validated carrying out the same experiment but using a different dataset. [-]