Szczegóły publikacji

Opis bibliograficzny

Towards textual data augmentation for neural networks: synonyms and maximum loss / Michał JUNGIEWICZ, Aleksander SMYWIŃSKI-POHL // Computer Science ; ISSN 1508-2806. — 2019 — vol. 20 no. 1, s. 57–83. — Bibliogr. s. 79–83, Abstr.

Autorzy (2)

Słowa kluczowe

sentence classification deep learning natural language processing neural networks data augmentation

Dane bibliometryczne

ID BaDAP	122464
Data dodania do BaDAP	2019-06-28
Tekst źródłowy	URL
DOI	10.7494/csci.2019.20.1.3023
Rok publikacji	2019
Typ publikacji	artykuł w czasopiśmie
Otwarty dostęp
Creative Commons
Czasopismo/seria	Computer Science

Abstract

Data augmentation is one of the ways to deal with labeled data scarcity and overfitting. Both of these problems are crucial for modern deep-learning algo- rithms, which require massive amounts of data. The problem is better explored in the context of image analysis than for text; this work is a step forward to help close this gap. We propose a method for augmenting textual data when training convolutional neural networks for sentence classification. The aug- mentation is based on the substitution of words using a thesaurus as well as Princeton University's WordNet. Our method improves upon the baseline in most of the cases. In terms of accuracy, the best of the variants is 1.2% (pp.) better than the baseline. © 2019 AGH University of Science and Technology Press.

Publikacje, które mogą Cię zainteresować

fragment książki

#130729Data dodania: 20.10.2020

Data augmentation for sentiment analysis in English – the online approach / Michał JUNGIEWICZ, Aleksander SMYWIŃSKI-POHL // W: Artificial neural networks and machine learning - ICANN 2020 : 29th International Conference on Artificial Neural Networks : Bratislava, Slovakia, September 15–18, 2020 : proceedings, Pt. 2 / eds. Igor Farkaš, Paolo Masulli, Stefan Wermter. — Cham : Springer Nature Switzerland, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12397. Theoretical Computer Science and General Issues ; ISSN 0302-9743). — ISBN: 978-3-030-61615-1; e-ISBN: 978-3-030-61616-8. — S. 584–595. — Bibliogr., Abstr. — Publikacja dostępna online od: 2020-10-14

Szczegóły

fragment książki

#163080Data dodania: 30.9.2025

AttentionMix: a guided text data augmentation method relying on attention / Dominik Lewy, Jacek MAŃDZIUK // W: Neural Information Processing : 31st International Conference, ICONIP 2024 : Auckland, New Zealand, December 2–6, 2024 : proceedings , Pt. 9 / eds. Mufti Mahmud, [et al.]. — Singapore: Springer Nature, cop. 2025. — ( Lecture Notes in Computer Science ; ISSN 0302-9743 ; vol. 15294 ). — ISBN: 978-981-96-6601-0; e-ISBN: 978-981-96-6599-0. — S. 409–423. — Bibliogr., Abstr. — Publikacja dostępna online od: 2025-07-02. — J. Mańdziuk - dod. afiliacja: Warsaw University of Technology, Warsaw

Szczegóły