Show simple item record

dc.contributor.authorFazlali, Mahmood
dc.contributor.authorMirhosseini, Mina
dc.contributor.authorShahsavari, Mahyar
dc.contributor.authorShafarenko, Alex
dc.contributor.authorMashinchi, Mashaallah
dc.date.accessioned2024-03-28T17:15:01Z
dc.date.available2024-03-28T17:15:01Z
dc.date.issued2024-03-04
dc.identifier.citationFazlali , M , Mirhosseini , M , Shahsavari , M , Shafarenko , A & Mashinchi , M 2024 , GPU-based Parallel Technique for Solving the N-Similarity Problem in Textual Data Mining . in 2024 Third International Conference on Distributed Computing and High Performance Computing (DCHPC) . Institute of Electrical and Electronics Engineers (IEEE) , pp. 1-6 , 2024 Third International Conference on Distributed Computing and High Performance Computing (DCHPC) , Tehran , Iran, Islamic Republic of , 14/04/24 . https://doi.org/10.1109/DCHPC60845.2024.10454074
dc.identifier.citationconference
dc.identifier.isbn979-8-3503-8158-0
dc.identifier.otherORCID: /0000-0002-1701-5562/work/156578386
dc.identifier.urihttp://hdl.handle.net/2299/27685
dc.description© 2024 IEEE. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1109/DCHPC60845.2024.10454074
dc.description.abstractAn important issue in data mining and information retrieval is the problem of multiple similarity or n-similarity. This problem entails finding a group of n data points with the highest similarity within a large dataset. Exact methods to solve this problem exist but come with high time and space complexities. Additionally, various metaheuristic algorithms have been proposed, including genetic algorithms, gravitational search algorithms, particle swarm optimization, imperialist competitive algorithms, and fuzzy imperialist competitive algorithms. These metaheuristics are capable of finding near-optimal solutions within a reasonable timeframe, although there is no guarantee of achieving exact results. In this paper, we employ a parallelization technique using CUDA to expedite the exact method. We conduct experiments on textual datasets to identify a group of n textual documents with the highest similarity to each other. The experimental results demonstrate that the proposed parallel exact method significantly reduces execution time compared to the best sequential approach and CPU multi-core implementation. Furthermore, it is evident that the proposed method requires less memory space than the exact method.en
dc.format.extent6
dc.format.extent1115797
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof2024 Third International Conference on Distributed Computing and High Performance Computing (DCHPC)
dc.subjectmultiple similarity
dc.subjectn-similarity
dc.subjectparallel programming
dc.subjecttext document similarity
dc.subjectArtificial Intelligence
dc.subjectDecision Sciences (miscellaneous)
dc.subjectControl and Optimization
dc.subjectSafety, Risk, Reliability and Quality
dc.subjectComputer Networks and Communications
dc.subjectModelling and Simulation
dc.titleGPU-based Parallel Technique for Solving the N-Similarity Problem in Textual Data Miningen
dc.contributor.institutionDepartment of Computer Science
dc.contributor.institutionSchool of Physics, Engineering & Computer Science
dc.contributor.institutionCentre for AI and Robotics Research
dc.contributor.institutionCybersecurity and Computing Systems
dc.contributor.institutionNetworks and Security Research Centre
dc.date.embargoedUntil2026-02-04
dc.identifier.urlhttp://www.scopus.com/inward/record.url?scp=85187778299&partnerID=8YFLogxK
rioxxterms.versionofrecord10.1109/DCHPC60845.2024.10454074
rioxxterms.typeOther
herts.preservation.rarelyaccessedtrue


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record