Szczegóły publikacji

Opis bibliograficzny

Interactive data mining by using multidimensional scaling / Piotr PAWLICZEK, Witold DZWINEL // Procedia Computer Science [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 1877-0509. — 2013 — vol. 18, s. 40-49. — Bibliogr. s. 49, Abstr. — Dodatkowa afiliacja P. Pawliczek: University of Texas, Departament of Biochemistry and Molecular Biology. — 13th Annual International Conference on Computational Science (ICCS) : Barcelona, Spain, 05–07 June, 2013


Autorzy (2)


Słowa kluczowe

incomplete distance matrixGPUmethod of particlesmulti-core CPUmulti-dimensional scaling

Dane bibliometryczne

ID BaDAP77359
Data dodania do BaDAP2013-11-14
DOI10.1016/j.procs.2013.05.167
Rok publikacji2013
Typ publikacjireferat w czasopiśmie
Otwarty dostęptak
Czasopismo/seriaProcedia Computer Science

Abstract

Blind choice and parameterization of data mining tools often yield vague or completely misleading results. Interactive visualization enables not only extensive exploration of data but also better matching of clustering/classification schemes to the type of data being analyzed. The multidimensional scaling (MDS), which employs particle dynamics to the error function minimization, is a good candidate to be a computational engine for interactive data mining. However, the main disadvantage of MDS is both its memory and time complexity. We developed novel SUBSET algorithm of a lower complexity, which is competitive to the best, currently used, MDS algorithms in terms of efficiency and accuracy. SUBSET employs reduced dissimilarity matrix, which structure allows for efficient usage of both multi-core CPU and SIMD GPU processor architectures. Consequently, SUBSET enables visualization of datasets consisting of an order of 105 data items on a standard personal computer or laptop. We compare a few strategies of dissimilarity matrix reduction and we present typical timings obtained by respective MDS algorithms on selected multithread CPU and GPU architectures. (C) 2013 The Authors. Published by Elsevier B.V.

Publikacje, które mogą Cię zainteresować

artykuł
Visual exploration of data by using multidimensional scaling on multicolore CPU, GPU, and MPI cluster / Piotr PAWLICZEK, Witold DZWINEL, David A. Yuen // Concurrency and Computation : Practice and Experience ; ISSN 1532-0626. — 2014 — vol. 26 iss. 3, s. 662–682. — Bibliogr. s. 680–682, Abstr. — Piotr Pawliczek – dod. afiliacja: University of Texas
artykuł
Very fast interactive visualization of large sets of high-dimensional data / Witold DZWINEL, Rafał WCISŁO // Procedia Computer Science [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 1877-0509. — 2015 — vol. 51, s. 572–581. — Bibliogr. s. 581, Abstr. — ICCS 2015 : International Conference On Computational Science : Computational Science at the Gates of Nature : June 1–3, 2015 in Reykjavík, Iceland