Szczegóły publikacji

Opis bibliograficzny

Maximum a posteriori estimator for convolutive sound source separation with sub-source based NTF model and the localization probabilistic prior on the mixing matrix / Mieszko FRAŚ, Konrad KOWALCZYK // W: ICASSP 2021 [Dokument elektroniczny] : 2021 IEEE International Conference on Acoustics, Speech and Signal Processing : June 6–11, 2021 virtual conference, Toronto, Ontario, Canada : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2021. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN:  978-1-7281-7605-5. — S. 526–530. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 530, Abstr. — Publikacja dostępna online od: 2021-05-13


Autorzy (2)


Słowa kluczowe

sound source separationprobabilistic localization priorexpectation maximizationnon negative tensor factorization

Dane bibliometryczne

ID BaDAP136320
Data dodania do BaDAP2021-09-22
Tekst źródłowyURL
DOI10.1109/ICASSP39728.2021.9413863
Rok publikacji2021
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaInstitute of Electrical and Electronics Engineers (IEEE)
Konferencja2021 IEEE International Conference on Acoustics, Speech and Signal Processing
Czasopismo/seriaProceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing

Abstract

In this paper we present a method for the separation of sound source signals recorded using multiple microphones in a reverberant room. In particular, we propose a maximum a posteriori (MAP) estimator based on the multichannel nonnegative tensor factorization (NTF) model with the localization prior distribution on the mixing matrix, in which the latent data consists of the so-called sub-sources for an improved performance in a reverberant environment. For the proposed MAP estimator, we derive the sub-source based expectation maximization (EM) algorithm with the multiplicative update rules (MU) and the localization prior distribution (LP) on the mixing matrix (SSEM-MU-LP). We then perform several experiments for speech and instrumental sound sources recorded using two microphones, in determined and under-determined scenarios, and with different types of initialization of the model parameters. The results of these experiments clearly indicate a significant improvement of the proposed algorithm with the localization prior over the state-of-the-art NTF-based source separation algorithms, which can reach up to 50% in the signal-to-distortion ratio.

Publikacje, które mogą Cię zainteresować

fragment książki
Incorporation of localization information for sound source separation in spherical harmonic domain / Mateusz GUZIK, Mieszko FRAŚ, Konrad KOWALCZYK // W: IEEE MMSP 2021 [Dokument elektroniczny] : IEEE 23rd International Workshop on Multimedia Signal Processing : October 06–08, 2021, Tampere, Finland. — Wersja do Windows. — Dane tekstowe. — [Piscataway : IEEE], [2021]. — e-ISBN: 978-1-6654-3288-7. — S. [1–6]. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. [6], Abstr.
fragment książki
Wishart localization prior on spatial covariance matrix in ambisonic source separation using non-negative tensor factorization / Mateusz GUZIK, Konrad KOWALCZYK // W: ICASSP 2022 [Dokument elektroniczny] : 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing : 7–13 May 2022, virtual, 22–27 May 2022, Singapore, satellite venue: Shenzhen, China : proceedings. — Wersja do Windows. — Dane tekstowe. — Piscataway : The Institute of Electrical and Electronics Engineers, cop. 2022. — (Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing ; ISSN 1520-6149). — e-ISBN: 978-1-6654-0540-9. — S. 446–450. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 450, Abstr. — Publikacja dostępna online od: 2022-04-27