Szczegóły publikacji

Opis bibliograficzny

A method for fast selection of machine-learning classifiers for spam filtering / Sylwia Rapacz, Piotr CHOŁDA, Marek NATKANIEC // Electronics [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 2079-9292. — 2021 — vol. 10 iss. 17 art. no. 2083, s. 1–23. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 22–23, Abstr. — Publikacja dostępna online od: 2021-08-27

Autorzy (3)

Słowa kluczowe

ham classifiers e-mail SPAM machine learning

Dane bibliometryczne

ID BaDAP	136097
Data dodania do BaDAP	2021-09-21
Tekst źródłowy	URL
DOI	10.3390/electronics10172083
Rok publikacji	2021
Typ publikacji	artykuł w czasopiśmie
Otwarty dostęp
Creative Commons
Czasopismo/seria	Electronics

Abstract

The paper elaborates on how text analysis influences classification—a key part of the spam-filtering process. The authors propose a multistage meta-algorithm for checking classifier performance. As a result, the algorithm allows for the fast selection of the best-performing classifiers as well as for the analysis of higher-dimensionality data. The last aspect is especially important when analyzing large datasets. The approach of cross-validation between different datasets for supervised learning is applied in the meta-algorithm. Three machine-learning methods allowing a user to classify e-mails as desirable (ham) or potentially harmful (spam) messages were compared in the paper to illustrate the operation of the meta-algorithm. The used methods are simple, but as the results showed, they are powerful enough. We use the following classifiers: k-nearest neighbours (k-NNs), support vector machines (SVM), and the naïve Bayes classifier (NB). The conducted research gave us the conclusion that multinomial naïve Bayes classifier can be an excellent weapon in the fight against the constantly increasing amount of spam messages. It was also confirmed that the proposed solution gives very accurate results.

Publikacje, które mogą Cię zainteresować

artykuł

#129520Data dodania: 21.7.2020

Comparison of machine learning classifiers for accurate prediction of real-time stuck pipe incidents / Javed Akbar Khan, Muhammad Irfan, Sonny Irawan, Fong Kam Yao, Md Shokor Abdul Rahaman, Ahmad Radzi Shahari, Adam GŁOWACZ, Nazia Zeb // Energies [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 1996-1073. — 2020 — vol. 13 iss. 14 art. no. 3683, s. 1-26. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 24-26, Abstr. — Publikacja dostępna online od: 2020-07-17

Szczegóły

artykuł

#132142Data dodania: 16.1.2021

Empirical assessment of machine learning techniques for software requirements risk prediction / Rashid Naseem, Zain Shaukat, Muhammad Irfan, Muhammad Arif Shah, Arshad Ahmad, Fazal Muhammad, Adam GŁOWACZ, Larisa Dunai, Jose Antonino-Daviu, Adel Sulaiman // Electronics [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 2079-9292. — 2021 — vol. 10 iss. 2 art. no. 168, s. 1–19. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 17–19, Abstr. — Publikacja dostępna online od: 2021-01-14

Szczegóły