Szczegóły publikacji
Opis bibliograficzny
Enhancing Gaussian Mixture Model fitting via equiprobable binning and adaptive differential evolution / Wojciech ACHTELIK, Maciej SMOŁKA // W: Computational Science – ICCS 2025 Workshops : 25th international conference : Singapore, Singapore, July 7–9, 2025 : proceedings, Pt. 2 / eds. Maciej Paszyński, Amanda S. Barnard, Yongjie Jessica Zhang. — Cham : Springer Nature Switzerland, cop. 2025. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 15908). — ISBN: 978-3-031-97556-1; e-ISBN: 978-3-031-97557-8. — S. 301–315. — Bibliogr., Abstr. — Publikacja dostępna online od: 2025-07-06
Autorzy (2)
Dane bibliometryczne
| ID BaDAP | 161056 |
|---|---|
| Data dodania do BaDAP | 2025-07-18 |
| DOI | 10.1007/978-3-031-97557-8_22 |
| Rok publikacji | 2025 |
| Typ publikacji | materiały konferencyjne (aut.) |
| Otwarty dostęp | |
| Wydawca | Springer |
| Konferencja | International Conference on Computational Science 2025 |
| Czasopismo/seria | Lecture Notes in Computer Science |
Abstract
Fitting Gaussian Mixture Models (GMMs) to one-dimensional data is a fundamental task in machine learning, traditionally addressed using the Expectation-Maximization (EM) algorithm. However, EM lacks inherent mechanisms to enforce separation between mixture components, a critical requirement in domains like medical research where distinct subgroups must be identified. Recently, the Distribution Optimization (DO) framework addressed this limitation by reformulating GMM estimation as a chi-squared goodness-of-fit minimization problem with an overlap penalty to enhance separation. However, its reliance on equiwidth binning and genetic algorithms can limit accuracy and scalability. In this paper, we refine the DO framework in two key ways: (1) replacing equiwidth binning with Mann–Wald’s equiprobable cells to improve estimation accuracy, and (2) adopting advanced Differential Evolution (DE) for more robust optimization of the high-dimensional parameter space. Through extensive experiments on synthetic and real-world datasets, we demonstrate that our refined approach significantly enhances accuracy, stability, and scalability compared to the original DO method.