Szczegóły publikacji
Opis bibliograficzny
Optimisation of a Siamese neural network for real-time energy efficient object tracking / Dominika PRZEWŁOCKA, Mateusz WĄSALA, Hubert SZOLC, Krzysztof BŁACHUT, Tomasz KRYJAK // W: Computer Vision and Graphics : International Conference : ICCVG 2020 : Warsaw, Poland, September 14–16, 2020 : proceedings / eds. Leszek J. Chmielewski, Ryszard Kozera, Arkadiusz Orłowski. — [Cham] : Springer International Publishing, cop. 2020. — (Lecture Notes in Computer Science ; ISSN 0302-9743 ; LNCS 12334. Image Processing, Computer Vision, Pattern Recognition, and Graphics). — ISBN: 978-3-030-59005-5; e-ISBN: 978-3-030-59006-2. — S. 151–163. — Bibliogr., Abstr. — Publikacja dostępna online od: 2020-08-09
Autorzy (5)
Słowa kluczowe
Dane bibliometryczne
| ID BaDAP | 130002 |
|---|---|
| Data dodania do BaDAP | 2020-09-11 |
| DOI | 10.1007/978-3-030-59006-2_14 |
| Rok publikacji | 2020 |
| Typ publikacji | materiały konferencyjne (aut.) |
| Otwarty dostęp | |
| Wydawca | Springer |
| Czasopismo/seria | Lecture Notes in Computer Science |
Abstract
In this paper the research on optimisation of visual object tracking using a Siamese neural network for embedded vision systems is presented. It was assumed that the solution shall operate in real-time, preferably for a high resolution video stream, with the lowest possible energy consumption. To meet these requirements, techniques such as the reduction of computational precision and pruning were considered. Brevitas, a tool dedicated for optimisation and quantisation of neural networks for FPGA implementation, was used. A number of training scenarios were tested with varying levels of optimisations – from integer uniform quantisation with 16 bits to ternary and binary networks. Next, the influence of these optimisations on the tracking performance was evaluated. It was possible to reduce the size of the convolutional filters up to 10 times in relation to the original network. The obtained results indicate that using quantisation can significantly reduce the memory and computational complexity of the proposed network while still enabling precise tracking, thus allow to use it in embedded vision systems. Moreover, quantisation of weights positively affects the network training by decreasing overfitting.