Título
Deobfuscating leetspeak with deep learning to improve spam filteringOtras instituciones
Instituto Universitário de Lisboa (Iscte)Universidade de Vigo
Instituto de Investigación Sanitaria Galicia Sur (IISGS)
Versión
Version publicada
Derechos
© 2023 UNIRAcceso
Acceso abiertoVersión del editor
https://doi.org/10.9781/ijimai.2023.07.003Publicado en
International Journal of Interactive Multimedia and Artificial Intelligence Vol. 8. N. 4. Pp. 46-55Editor
UNIR - Universidad Internacional de La RiojaPalabras clave
Convolutional Neural Networks
Deep Learning
Leetspeak
ODS 9 Industria, innovación e infraestructura ... [+]
Deep Learning
Leetspeak
ODS 9 Industria, innovación e infraestructura ... [+]
Convolutional Neural Networks
Deep Learning
Leetspeak
ODS 9 Industria, innovación e infraestructura
Spam Filtering
Text Deobfuscation [-]
Deep Learning
Leetspeak
ODS 9 Industria, innovación e infraestructura
Spam Filtering
Text Deobfuscation [-]
Resumen
The evolution of anti-spam filters has forced spammers to make greater efforts to bypass filters in order to distribute content over networks. The distribution of content encoded in images or the use ... [+]
The evolution of anti-spam filters has forced spammers to make greater efforts to bypass filters in order to distribute content over networks. The distribution of content encoded in images or the use of Leetspeak are concrete and clear examples of techniques currently used to bypass filters. Despite the importance of dealing with these problems, the number of studies to solve them is quite small, and the reported performance is very limited. This study reviews the work done so far (very rudimentary) for Leetspeak deobfuscation and proposes a new technique based on using neural networks for decoding purposes. In addition, we distribute an image database specifically created for training Leetspeak decoding models. We have also created and made available four different corpora to analyse the performance of Leetspeak decoding schemes. Using these corpora, we have experimentally evaluated our neural network approach for decoding Leetspeak. The results obtained have shown the usefulness of the proposed model for addressing the deobfuscation of Leetspeak character sequences. [-]
Financiador
Eusko Jaurlaritza = Gobierno VascoGobierno de España
Programa
Ikertalde Convocatoria 2022-2025Programa Estatal de Investigación, Desarrollo e Innovación orientada a los retos de la sociedad en el marco del Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016, convocatoria del 2017
Número
IT1676-22TIN2017-84658-C2-2-R
URI de la ayuda
Sin informaciónSin información
Proyecto
Grupo de sistemas inteligentes para sistemas industrialesIntegración de Conocimiento Semántico para el Filtrado de Spam basado en Contenido (SKI4SPAM)
Colecciones
- Artículos - Ingeniería [684]
El ítem tiene asociados los siguientes ficheros de licencia: