Szczegóły publikacji

Opis bibliograficzny

GPU-embedding of kNN-graph representing large and high-dimensional data / Bartosz MINCH, Mateusz Nowak, Rafał WCISŁO, Witold DZWINEL // W: Computational Science - ICCS 2020 : 20th International Conference : Amsterdam, The Netherlands, June 3–5, 2020 : proceedings, Pt. 2 / eds. Valeria V. Krzhizhanovskaya, [et al.]. — Cham : Springer Nature Switzerland, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12138. Theoretical Computer Science and General Issues ; ISSN 0302-9743). — ISBN: 978-3-030-50416-8; e-ISBN:  978-3-030-50417-5. — S. 322–336. — Bibliogr. s. 335–336, Abstr. — Publikacja dostępna online od: 2020-06-15

Autorzy (4)

Słowa kluczowe

kNN graph visualizationhigh dimensional datadata embeddingGPU implementation

Dane bibliometryczne

ID BaDAP129150
Data dodania do BaDAP2020-06-25
Tekst źródłowyURL
DOI10.1007/978-3-030-50417-5_24
Rok publikacji2020
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaSpringer
KonferencjaInternational Conference on Computational Science 2020
Czasopisma/serieLecture Notes in Computer Science, Theoretical Computer Science and General Issues

Abstract

Interactive visual exploration of large and multidimensional data still needs more efficient ND→2D data embedding (DE) algorithms. We claim that the visualization of very high-dimensional data is equivalent to the problem of 2D embedding of undirected kNN-graphs. We demonstrate that high quality embeddings can be produced with minimal time&memory complexity. A very efficient GPU version of IVHD (interactive visualization of high-dimensional data) algorithm is presented, and we compare it to the state-of-the-art GPU-implemented DE methods: BH-SNE-CUDA and AtSNE-CUDA. We show that memory and time requirements for IVHD-CUDA are radically lower than those for the baseline codes. For example, IVHD-CUDA is almost 30 times faster in embedding (without the procedure of kNN graph generation, which is the same for all the methods) of the largest ( M=1.4⋅106 ) YAHOO dataset than AtSNE-CUDA. We conclude that in the expense of minor deterioration of embedding quality, compared to the baseline algorithms, IVHD well preserves the main structural properties of ND data in 2D for radically lower computational budget. Thus, our method can be a good candidate for a truly big data ( M=108+ ) interactive visualization.

Publikacje, które mogą Cię zainteresować

fragment książki
#147733Data dodania: 20.7.2023
What will happen when we radically simplify t-SNE and UMAP visualization algorithms? : is it worth doing so? / Bartosz MINCH, Radosław ŁAZARZ, Witold DZWINEL // W: Computational Science – ICCS 2023 : 23rd international conference : Prague, Czech Republic, July 3–5, 2023 : proceedings, Pt. 2 / eds. Jiří Mikyška [et al.]. — Cham, Switzerland : Springer, cop. 2023. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 14074). — ISBN: 978-3-031-36020-6; e-ISBN: 978-3-031-36021-3. — S. 238–246. — Bibliogr., Abstr. — Publikacja dostępna online od: 2023-06-26
fragment książki
#129176Data dodania: 25.6.2020
Supermodeling: the next level of abstraction in the use of data assimilation / Marcin Sendera, Gregory S. Duane, Witold DZIWNEL // W: Computational Science - ICCS 2020 : 20th International Conference : Amsterdam, The Netherlands, June 3–5, 2020 : proceedings, Pt. 6 / eds. Valeria V. Krzhizhanovskaya, [et al.]. — Cham : Springer Nature Switzeland, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12142. Theoretical Computer Science and General Issues ; ISSN 0302-9743). — ISBN: 978-3-030-50432-8; e-ISBN: 978-3-030-50433-5 . — S. 133–147. — Bibliogr. s. 146–147, Abstr. — Publikacja dostępna online od: 2020-06-15