Szczegóły publikacji

Opis bibliograficzny

Wishart localization prior on spatial covariance matrix in ambisonic source separation using non-negative tensor factorization / Mateusz GUZIK, Konrad KOWALCZYK // W: ICASSP 2022 [Dokument elektroniczny] : 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing : 7–13 May 2022, virtual, 22–27 May 2022, Singapore, satellite venue: Shenzhen, China : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2022. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN: 978-1-6654-0540-9. — S. 446–450. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 450, Abstr. — Publikacja dostępna online od: 2022-04-27


Autorzy (2)


Słowa kluczowe

ambisonicsarray signal processingsource separationnon negative tensor factorizationspherical harmonics

Dane bibliometryczne

ID BaDAP140050
Data dodania do BaDAP2022-05-06
Tekst źródłowyURL
DOI10.1109/ICASSP43922.2022.9746222
Rok publikacji2022
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaInstitute of Electrical and Electronics Engineers (IEEE)
Konferencja2022 IEEE International Conference on Acoustics, Speech and Signal Processing
Czasopismo/seriaProceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing

Abstract

This paper presents an extension of the existing Non-negative Tensor Factorization (NTF) based method for sound source separation under reverberant conditions, formulated for Ambisonic microphone mixture signals. In particular, we address the problem of optimal exploitation of the prior knowledge concerning the source localization, through the formulation of a suitable Maximum a Posteriori (MAP) framework. Within the presented approach, the magnitude spectrograms are modelled by the NTF and the individual source Spatial Covariance Matrices (SCM) are approximated as a sum of anechoic Spherical Harmonic (SH) components, weighted with the so-called spatial selector. We constrain the SCM using the Wishart distribution, which leads to a new posterior probability and in turn to the derivation of the extended update rules. The proposed solution avoids the issues encountered in the original method, related to the empirical binary initialization strategy for the spatial selector weights, which due to multiplicative update rules may result in sound coming from certain directions not being taken into account. The proposed method is evaluated against the original algorithm and another recently proposed Expectation Maximization (EM) algorithm that also incorporates a spatial localization prior, showing improved separation performance in experiments with first-order Ambisonic recordings of musical instruments and speech utterances.

Publikacje, które mogą Cię zainteresować

fragment książki
Convolutive NTF for ambisonic source separation under reverberant conditions / Mateusz GUZIK, Konrad KOWALCZYK // W: ICASSP 2023 [Dokument elektroniczny] : 2023 IEEE International Conference on Acoustics, Speech and Signal Processing : 4–10 June, Rhodes Island, Greece : conference proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : IEEE, cop. 2023. — e-ISBN: 978-1-7281-6327-7. — S. [1–5]. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 5, Abstr. — Publikacja dostępna online od: 2023-05-05
fragment książki
Maximum a posteriori estimator for convolutive sound source separation with sub-source based NTF model and the localization probabilistic prior on the mixing matrix / Mieszko FRAŚ, Konrad KOWALCZYK // W: ICASSP 2021 [Dokument elektroniczny] : 2021 IEEE International Conference on Acoustics, Speech and Signal Processing : June 6–11, 2021 virtual conference, Toronto, Ontario, Canada : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2021. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN:  978-1-7281-7605-5. — S. 526–530. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 530, Abstr. — Publikacja dostępna online od: 2021-05-13