Szczegóły publikacji

Opis bibliograficzny

Efficient object detection in fused visual and infrared spectra for edge platforms / Piotr Janyst, Bogusław CYGANEK, Łukasz Przebinda // W: Data Analytics in System Engineering : proceedings of 7th Computational Methods in Systems and Software 2023, Vol. 4. — Cham : Springer, cop. 2024. — (Lecture Notes in Networks and Systems ; ISSN 2367-3370 ; vol. 935). — ISBN: 978-3-031-54819-2; e-ISBN: 978-3-031-54820-8. — eds. Radek Silhavy, Petr Silhavy. — S. 243–253. — Bibliogr., Abstr. — B. Cyganek - dod. afiliacja: MyLED Inc., Kraków

Autorzy (3)

Słowa kluczowe

image fusionmulti-modal embeddingsYOLO architecturethermal object detection

Dane bibliometryczne

ID BaDAP156181
Data dodania do BaDAP2024-11-20
DOI10.1007/978-3-031-54820-8_19
Rok publikacji2024
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaSpringer
Czasopismo/seriaLecture Notes in Networks and Systems

Abstract

Image fusion is an important task in computer vision, aiming to combine information from multiple modalities to enhance overall perception and understanding. This paper presents an innovative approach to image fusion, specifically focusing on the fusion of RGB and thermal image embeddings to enhance object recognition and detection performance. To achieve this, we introduce two key components, FusionConv and SepThermalConv layers to the YOLO object detection network, as well as modified FusionC3 layer. The FusionConv layer effectively integrates RGB and thermal image features by leveraging multimodal embeddings with use of fusion parameter. Similarly, the SepThermal Conv layer optimizes the processing of thermal information by incorporating separate branches for enhanced representation. Extensive experiments conducted on a custom dataset demonstrate significant performance gains achieved by our fusion method compared to using individual modalities in isolation. Our results highlight the potential of multimodal fusion techniques to improve object detection and perception of complex scenes by effectively combining RGB and thermal images.