Szczegóły publikacji
Opis bibliograficzny
Multi-criteria linguistic optimization for covert communication in secure LLM-based steganography / Kamil WOŹNIAK, Marek R. OGIELA, Lidia OGIELA // Applied Soft Computing ; ISSN 1568-4946 . — 2025 — vol. 185 pt. A art. no. 113960, s. 1–16. — Bibliogr. s. 15–16, Abstr. — Publikacja dostępna online od: 2025-09-20
Autorzy (3)
Słowa kluczowe
Dane bibliometryczne
| ID BaDAP | 163101 |
|---|---|
| Data dodania do BaDAP | 2025-09-30 |
| Tekst źródłowy | URL |
| DOI | 10.1016/j.asoc.2025.113960 |
| Rok publikacji | 2025 |
| Typ publikacji | artykuł w czasopiśmie |
| Otwarty dostęp | |
| Creative Commons | |
| Czasopismo/seria | Applied Soft Computing |
Abstract
This paper presents a novel framework for covert communication through secure steganography using large language models (LLMs). Our approach leverages multi-criteria linguistic optimization to encode secret information directly into stylistic features of auto-regressively generated text. This strategy balances embedding capacity with naturalness and coherence. The secret message is partitioned into fixed-size blocks. Each block is embedded into binary stylistic feature vectors via a surjective linear mapping, which introduces redundancy. This redundancy enables the use of a history-aware cost function that selects stylistic vectors to minimize abrupt transitions and preserve fluency across sentences. Candidate sentences are generated by prompting LLMs with contextual and stylistic constraints. Rejection sampling then ensures exact feature matching and high linguistic quality. Experimental evaluation in multiple LLMs, diverse text contexts, and parameter settings demonstrates effective embedding capacities of up to 0.30 bits per token while maintaining strong linguistic naturalness, validated through perplexity, lexical diversity, readability, and a linguistic acceptability metric. Importantly, decoding recovers the full secret with zero error under ideal conditions. This confirms the reliability of the method. The current work focuses on embedding efficiency and imperceptibility. Robustness against active text alterations and formal undetectability assessments remain open challenges for future research. The proposed multi-criteria linguistic optimization framework offers a promising avenue for advanced covert communication by harmonizing secure information embedding with fluent, human-like language generation.