Szczegóły publikacji

Opis bibliograficzny

Spatio-temporal PM2.5 forecasting using machine learning and low-cost sensors: an urban perspective / Mateusz ZARĘBA, Szymon Cogiel, Tomasz DANEK // Engineering Proceedings [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN  2673-4591 . — 2025 — vol. 101 iss. 1 art. no. 6, s. 1–11. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 10–11, Abstr. — Publikacja dostępna online od: 2025-07-25. — 11th international conference on time series and forecasting : Canaria, Spain, 16–18 July 2025

Autorzy (3)

Słowa kluczowe

machine learningtime seriesair pollutionpollution forecasting

Dane bibliometryczne

ID BaDAP161567
Data dodania do BaDAP2025-08-25
Tekst źródłowyURL
DOI10.3390/engproc2025101006
Rok publikacji2025
Typ publikacjireferat w czasopiśmie
Otwarty dostęptak
Creative Commons
Czasopismo/seriaEngineering Proceedings

Abstract

This study analyzes air pollution time-series big data to assess stationarity, seasonal patterns, and the performance of machine learning models in forecasting PM2.5 concentrations. Fifty-two low-cost sensors (LCS) were deployed across Krakow city and its surroundings (Poland), collecting hourly air quality data and generating nearly 20,000 observations per month. The network captured both spatial and temporal variability. The Kwiatkowski–Phillips–Schmidt–Shin (KPSS) test confirmed trend-based non-stationarity, which was addressed through differencing, revealing distinct daily and 12 h cycles linked to traffic and temperature variations. Additive seasonal decomposition exhibited time-inconsistent residuals, leading to the adoption of multiplicative decomposition, which better captured pollution outliers associated with agricultural burning. Machine learning models—Ridge Regression, XGBoost, and LSTM (Long Short-Term Memory) neural networks—were evaluated under high spatial and temporal variability (winter) and low variability (summer) conditions. Ridge Regression showed the best performance, achieving the highest 𝑅2 (0.97 in winter, 0.93 in summer) and the lowest mean squared errors. XGBoost showed strong predictive capabilities but tended to overestimate moderate pollution events, while LSTM systematically underestimated PM2.5 levels in December. The residual analysis confirmed that Ridge Regression provided the most stable predictions, capturing extreme pollution episodes effectively, whereas XGBoost exhibited larger outliers. The study proved the potential of low-cost sensor networks and machine learning in urban air quality forecasting focused on rare smog episodes (RSEs).

Publikacje, które mogą Cię zainteresować

fragment książki
#162420Data dodania: 15.9.2025
Spatio-temporal PM2.5 forecasting using machine learning and low-cost sensors: an urban perspective / Mateusz ZARĘBA, Szymon Cogiel, Tomasz DANEK // W: ITISE-2025 : [11th International conference on Time Series and Forecasting] : July 16th-18th, 2025, Gran Canaria, Spain : program & abstracts / [eds.] Ignacio Rojas, [et al.]. — [Spain : Universidad de Granada], [2025]. — ISBN: 979-13-87522-16-2. — S. 44-45, [ID] 6937
artykuł
#153652Data dodania: 17.6.2024
Machine learning techniques for spatio-temporal air pollution prediction to drive sustainable urban development in the era of energy and data transformation / Mateusz ZARĘBA, Szymon Cogiel, Tomasz DANEK, Elżbieta WĘGLIŃSKA // Energies [Dokument elektroniczny]. — Czasopismo elektroniczne ; ISSN 1996-1073. — 2024 — vol. 17 iss. 11 art. no. 2738, s. 1–13. — Wymagania systemowe: Adobe Reader. — Bibliogr. s. 12–13, Abstr. — Publikacja dostępna online od: 2024-06-04