Szczegóły publikacji

Opis bibliograficzny

Comparison of SVM and ontology-based text classification methods / Krzysztof WRÓBEL, Maciej WIELGOSZ, Aleksander SMYWIŃSKI-POHL, Marcin PIETROŃ // W: Artificial intelligence and soft computing : 15th international conference, ICAISC 2016 : Zakopane, Poland, June 12–16, 2016 : proceedings, Pt. 1 / eds. Leszek Rutkowski, [et al.]. — Switzerland : Springer International Publishing, cop. 2016. — (Lecture Notes in Artificial Intelligence ; ISSN 0302-9743 ; 9692). — ISBN: 978-3-319-39377-3; e-ISBN: 978-3-319-39378-0. — S. 667–680. — Bibliogr. s. 679–680, Abstr. — Toż na Dysku Flash. — K. Wróbel - afiliacja: Uniwersytet Jagiellońki, A. Smywiński-Pohl – dod. afiliacja: Jagiellonian University


Autorzy (4)


Słowa kluczowe

Vector Space ModelWikipediatext classificationsupport vector machineontology-based methods

Dane bibliometryczne

ID BaDAP98359
Data dodania do BaDAP2016-06-20
DOI10.1007/978-3-319-39378-0_57
Rok publikacji2016
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
Konferencja15th International Conference on Artificial Intelligence and Soft Computing
Czasopismo/seriaLecture Notes in Computer Science

Abstract

This work addresses the challenging task of text categorization. The main goal is the comparison of two different approaches, i.e. Vector Space Model and ontology-based solutions. The authors compare and contrast them with respect to accuracy and processing flow, which affect the classification results. The ontology-based method outperforms its counter-part when it comes to category resolution, i.e. the number of categories which can be processed. On the other hand, the SVM-based module is much faster and performs well when trained on an appropriately-structured learning set. The authors performed a series of tests to compare the methods and, as expected, the ontology-based solution outperformed the SVM classifier. It reached a micro averaged F1-score of 0.90 with 2.8 million Wikipedia articles, whereas the SVM-based module did not exceed 0.86 with the same data set. The macro averaged F1-score of both solutions was inferior to the micro one and reached values of 0.75 and 0.57, for ontology and SVM-based solutions respectively. © Springer International Publishing Switzerland 2016.

Publikacje, które mogą Cię zainteresować

fragment książki
Authorship attribution of Polish newspaper articles / Marcin KUTA, Bartłomiej Puto, Jacek KITOWSKI // W: Artificial intelligence and soft computing : 15th international conference, ICAISC 2016 : Zakopane, Poland, June 12–16, 2016 : proceedings, Pt. 2 / eds. Leszek Rutkowski, [et al.]. — Switzerland : Springer International Publishing, cop. 2016. — (Lecture Notes in Artificial Intelligence ; ISSN 0302-9743 ; 9693). — ISBN: 978-3-319-39383-4; e-ISBN: 978-3-319-39384-1. — S. 474–483. — Bibliogr. s. 483, Abstr. — Toż na Dysku Flash
fragment książki
Text mining with hybrid biclustering algorithms / Patryk ORZECHOWSKI, Krzysztof BORYCZKO // W: Artificial intelligence and soft computing : 15th international conference, ICAISC 2016 : Zakopane, Poland, June 12–16, 2016 : proceedings, Pt. 2 / eds. Leszek Rutkowski, [et al.]. — Switzerland : Springer International Publishing, cop. 2016. — (Lecture Notes in Artificial Intelligence ; ISSN 0302-9743 ; 9693). — ISBN: 978-3-319-39383-4; e-ISBN: 978-3-319-39384-1. — S. 102–113. — Bibliogr. s. 111–113, Abstr. — Toż na Dysku Flash