Szczegóły publikacji
Opis bibliograficzny
Feature selection for regression tasks base on explainable artificial intelligence procedures / Piotr A. KOWALSKI, Maciej Walczak // W: IJCNN 2023 [Dokument elektroniczny] : International Joint Conference on Neural Networks : 18–23 June 2023, Queensland, Australia : conference proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : IEEE, cop. 2023. — (Proceedings of ... International Joint Conference on Neural Networks ; ISSN 2161-4393). — e-ISBN: 978-1-6654-8867-9. — S. [1–8]. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. [8], Abstr. — P. A. Kowalski - dod. afiliacja: Systems Research Institute Polish Academy of Sciences, Warsaw, Poland
Autorzy (2)
Słowa kluczowe
Dane bibliometryczne
| ID BaDAP | 150998 |
|---|---|
| Data dodania do BaDAP | 2024-01-16 |
| Tekst źródłowy | URL |
| DOI | 10.1109/IJCNN54540.2023.10191064 |
| Rok publikacji | 2023 |
| Typ publikacji | materiały konferencyjne (aut.) |
| Otwarty dostęp | |
| Wydawca | Institute of Electrical and Electronics Engineers (IEEE) |
| Konferencja | IEEE International Joint Conference on Neural Networks 2023 |
| Czasopismo/seria | Proceedings of ... International Joint Conference on Neural Networks |
Abstract
In modern data analysis and data mining, there are many applicable regression procedures, however, the most frequently used ones include classical linear regression, decision trees, random forest model, support vector method and artificial neural networks. Most of these algorithms are powerful approximation tools that can model the complex relationships and patterns in data. One of the reasons for redundant structure within such procedures is the use of a too-large feature vector for a given task. This paper aimed to investigate and compare the selected methods of significance analysis and then reduce the size of the feature vector through supervised learning models. Based on the obtained results, it seems that the Shap analysis and the Sobol global sensitivity analysis method are the most reliable methods for determining the significance of feature vector elements and thus for reducing the feature vector dimension in regression problems.