Szczegóły publikacji

Opis bibliograficzny

Convolutive NTF for ambisonic source separation under reverberant conditions / Mateusz GUZIK, Konrad KOWALCZYK // W: ICASSP 2023 [Dokument elektroniczny] : 2023 IEEE International Conference on Acoustics, Speech and Signal Processing : 4–10 June, Rhodes Island, Greece : conference proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : IEEE, cop. 2023. — e-ISBN: 978-1-7281-6327-7. — S. [1–5]. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 5, Abstr. — Publikacja dostępna online od: 2023-05-05


Autorzy (2)


Słowa kluczowe

spherical harmonicsconvolutive spatial covariance modelsource separationnon negative matrix factorization

Dane bibliometryczne

ID BaDAP146975
Data dodania do BaDAP2023-06-02
Tekst źródłowyURL
DOI10.1109/ICASSP49357.2023.10094601
Rok publikacji2023
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
KonferencjaInternational Conference on Acoustics, Speech and Signal Processing

Abstract

This paper presents a Non-negative Tensor Factorization (NTF) based sound source separation method with a novel convolutive Spatial Covariance Matrix (SCM) model, that is suitable for use with reverberant Ambisonic signals. The presented solution builds upon a previous work on SHD SCM-based NTF, but unlike the original, non-convolutive approach, it avoids the problem encountered when the analysis window is too short to capture the dominant part of the reverberant signal. Here we introduce a novel convolutive SCM model that accounts for reverberation which spans over multiple time frames and then we derive the corresponding parameter update equations. In particular, this work considers several variants of these updates, describes the underlying motivation for each algorithm design choice and indicates the update rules, which offer the highest gain in Signal-to-Distortion Ratio (SDR). The proposed solution is evaluated against the original approach for various reverberation time values, number of sources and types of source signals, using simulated first-order Ambisonic recordings. The results of this preliminary study clearly indicate that the proposed method enables higher quality of separation compared with the reference, non-convolutive algorithm.

Publikacje, które mogą Cię zainteresować

fragment książki
Wishart localization prior on spatial covariance matrix in ambisonic source separation using non-negative tensor factorization / Mateusz GUZIK, Konrad KOWALCZYK // W: ICASSP 2022 [Dokument elektroniczny] : 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing : 7–13 May 2022, virtual, 22–27 May 2022, Singapore, satellite venue: Shenzhen, China : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2022. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN: 978-1-6654-0540-9. — S. 446–450. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 450, Abstr. — Publikacja dostępna online od: 2022-04-27
fragment książki
Maximum a posteriori estimator for convolutive sound source separation with sub-source based NTF model and the localization probabilistic prior on the mixing matrix / Mieszko FRAŚ, Konrad KOWALCZYK // W: ICASSP 2021 [Dokument elektroniczny] : 2021 IEEE International Conference on Acoustics, Speech and Signal Processing : June 6–11, 2021 virtual conference, Toronto, Ontario, Canada : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2021. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN:  978-1-7281-7605-5. — S. 526–530. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 530, Abstr. — Publikacja dostępna online od: 2021-05-13