Szczegóły publikacji

Opis bibliograficzny

Text annotation automation for hate speech detection using SVM-classifier based on feature extraction / Shoffan SAIFULLAH, Nur Heri Cahyana, Yuli Fauziah, Agus Sasmito Aribowo, Felix Andika DWIYANTO, Rafał DREŻEWSKI // W: ICSSET 2022 [Dokument elektroniczny] : 2nd International Conference Series on Science, Engineering, and Technology : 22 June 2022, Sidoarjo, Indonesia. — Wersja do Windows. — Dane tekstowe. — [Indonesia] : AIP Publishing, [2024]. — (AIP Conference Proceedings ; ISSN 0094-243X ; vol. 3167). — e-ISBN: 978-0-7354-5005-9. — S. 040003-1–040003-7. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 040003-5–040003-7, Abstr. — S. Saifullah - dod. afiliacja: University of Pembangunan Nasional Veteran Yogyakarta, Yogyakarta, Indonesia

Autorzy (6)

AGHSaifullah Shoffan
Cahyana Nur Heri
Fauziah Yuli
Aribowo Agus Sasmito
AGHDwiyanto Felix Andika
AGHDreżewski Rafał

Słowa kluczowe

text annotation text mining machine learning hate speech detection

Dane bibliometryczne

ID BaDAP	154726
Data dodania do BaDAP	2024-08-06
Tekst źródłowy	URL
DOI	10.1063/5.0218034
Rok publikacji	2024
Typ publikacji	materiały konferencyjne (aut.)
Otwarty dostęp
Wydawca	American Institute of Physics
Czasopismo/seria	AIP Conference Proceedings

Abstract

This article aims to develop a semi-supervised method for automatically annotating hate speech in social media using natural language processing (NLP) techniques. The approach is based on a Support Vector Machine (SVM) classifier that combines feature extraction algorithms, including ensemble meta-learners and meta-vectorizers. The system was trained on a dataset of 13,169 elements, and the results show that the accuracy of the model is highly dependent on the feature extraction method used. The optimal automatic annotation was achieved using TF-IDF feature extraction, resulting in an accuracy of 92.5%. The implications of this study are that automated hate speech annotation using NLP techniques can significantly improve the accuracy, reliability, and inclusiveness of identifying hate speech online. The results of this study suggest that SVM and TF-IDF are the most suitable methods for this task.

Publikacje, które mogą Cię zainteresować

artykuł

#151683Data dodania: 30.1.2024

Automated text annotation using a semi-supervised approach with meta vectorizer and machine learning algorithms for hate speech detection / Shoffan SAIFULLAH, Rafał DREŻEWSKI, Felix Andika DWIYANTO, Agus Sasmito Aribowo, Yuli Fauziah, Nur Heri Cahyana // Applied Sciences (Basel) [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 2076-3417. — 2024 — vol. 14 iss. 3 art. no. 1078, s. 1–19. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 17–19, Abstr. — Publikacja dostępna online od: 2024-01-26. — S. Saifullah - dod. afiliacja: Department of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, Indonesia. — R. Dreżewski - dod. afiliacja: Artificial Intelligence Research Group (AIRG), Informatics Department, Faculty of Industrial Technology, Universitas Ahmad Dahlan, Indonesia. — F. A. Dwiyanto - dod. afiliacja: Department of Electrical Engineering, Universitas Negeri Malang, Malang, Indonesia

Szczegóły

artykuł

#143653Data dodania: 21.11.2022

Semi-supervised text annotation for hate speech detection using K-nearest neighbors and term frequency-inverse document frequency / Nur Heri Cahyana, Shoffan SAIFULLAH, Yuli Fauziah, Agus Sasmito Aribowo, Rafał DREŻEWSKI // International Journal of Advanced Computer Science and Applications (IJACSA) ; ISSN 2158-107X . — 2022 — vol. 13 no. 10, s. 147-151. — Bibliogr. s. 150-151, Abstr. — S. Saifullah - dod. afiliacja: Department of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta Yogyakarta, Indonesia

Szczegóły