dc.contributor.author | Angelopoulou, Olga | |
dc.contributor.author | Y, Shao | |
dc.contributor.author | Marcello, Trovati | |
dc.contributor.author | Q, Shi | |
dc.contributor.author | E, Asimakopoulou | |
dc.contributor.author | Nik, Bessis | |
dc.date.accessioned | 2017-06-12T10:53:02Z | |
dc.date.available | 2017-06-12T10:53:02Z | |
dc.date.issued | 2017-01-01 | |
dc.identifier.citation | Angelopoulou , O , Y , S , Marcello , T , Q , S , E , A & Nik , B 2017 , ' A Hybrid Spam Detection Method Based on Unstructured Datasets ' , Soft Computing , vol. 21 , no. 1 , pp. 233-243 . https://doi.org/10.1007/s00500-015-1959-z | |
dc.identifier.issn | 1432-7643 | |
dc.identifier.uri | http://hdl.handle.net/2299/18317 | |
dc.description | This document is the accepted manuscript version of the following article: Shao, Y., Trovati, M., Shi, Q. et al. Soft Comput (2017) 21: 233. The final publication is available at Springer via http://dx.doi.org/10.1007/s00500-015-1959-z. © Springer-Verlag Berlin Heidelberg 2015. | |
dc.description.abstract | The identification of non-genuine or malicious messages poses a variety of challenges due to the continuous changes in the techniques utilised by cyber-criminals. In this article, we propose a hybrid detection method based on a combination of image and text spam recognition techniques. In particular, the former is based on sparse representation-based classification, which focuses on the global and local image features, and a dictionary learning technique to achieve a spam and a ham sub-dictionary. On the other hand, the textual analysis is based on semantic properties of documents to assess the level of maliciousness. More specifically, we are able to distinguish between meta-spam and real spam. Experimental results show the accuracy and potential of our approach. | en |
dc.format.extent | 11 | |
dc.format.extent | 1060415 | |
dc.language.iso | eng | |
dc.relation.ispartof | Soft Computing | |
dc.subject | Image spam, Text spam, Semantic networks, Classication, Subclass Discriminant Analysis, Feature Selection, Sparse Representation | |
dc.title | A Hybrid Spam Detection Method Based on Unstructured Datasets | en |
dc.contributor.institution | School of Computer Science | |
dc.contributor.institution | Centre for Computer Science and Informatics Research | |
dc.description.status | Peer reviewed | |
dc.date.embargoedUntil | 2018-01-01 | |
rioxxterms.versionofrecord | 10.1007/s00500-015-1959-z | |
rioxxterms.type | Journal Article/Review | |
herts.preservation.rarelyaccessed | true | |