Szczegóły publikacji

Opis bibliograficzny

Needle in a case: scalable search over large-scale image corpora in forensic applications / Kamil FABER, Dominik ŻUREK, Kacper Bujak, Monika Selegrat, Kamil PIĘTAK // Forensic Science International. Digital Investigation ; ISSN 2666-2825 . — 2026 — vol. 56 suppl. art. no. 302057, s. 1–9. — Bibliogr. s. 8–9, Abstr. — Publikacja dostępna online od: 2026-03-24. — DFRWS EU 2026 : 13th Annual Digital Forensics Research Conference Europe : March 24–27, Linköping, Sweden

Autorzy (5)

Słowa kluczowe

digital forensics image search large scale data analysis evidence analysis artificial intelligence

Dane bibliometryczne

ID BaDAP	166868
Data dodania do BaDAP	2026-04-14
Tekst źródłowy	URL
DOI	10.1016/j.fsidi.2026.302057
Rok publikacji	2026
Typ publikacji	referat w czasopiśmie
Otwarty dostęp
Creative Commons
Czasopismo/seria	Forensic Science International. Digital Investigation

Abstract

The rapid growth of digital content has made images a key form of communication, which also extends into criminal contexts, where visual material often constitutes critical evidence. Forensic analysts face the challenge of locating relevant images within large, heterogeneous datasets, such as those extracted from mobile devices. Manual inspection of such data is time-consuming and inefficient. Although traditional automated classification methods offer partial support, they remain constrained by predefined class sets, limiting their applicability in the dynamic and unpredictable nature of real forensic investigations. Recent advances in artificial intelligence (AI) have introduced models capable of retrieving images using natural-language queries, enabling more universal and adaptive search capabilities. In this work, we conduct a comprehensive evaluation of two modern AI paradigms for large-scale forensic image retrieval: Vision–Language Models (VLMs), which generate searchable textual captions of images, and Contrastive Language–Image Pre-training (CLIP), which performs embedding-based text–image similarity search. To the best of our knowledge, this is the first systematic comparison of these approaches in a forensic context. We evaluate 33 representative queries across three forensic use cases and nine heterogeneous datasets comprising over 80 000 images. Our results offer new insights into the trade-offs between caption-based and embedding-based retrieval methods and their applicability in practical digital forensic workflows.

Publikacje, które mogą Cię zainteresować

artykuł

#53056Data dodania: 1.9.2010

Usage of dedicated data structures for URL databases in a large-scale crawling — Zastosowanie dedykowanych struktur danych w bazach adresów URL crawlingu dużej skali / Krzysztof DOROSZ // Computer Science ; ISSN 1508-2806. — 2009 — vol. 10, s. 7–17. — Bibliogr. s. 17

Szczegóły