Szczegóły publikacji

Opis bibliograficzny

Enhancing embedding representations for large language models: a comparative study of the hashing trick and one-tot encoding / Agata Kozina, Michał PIKUS // W: Advances in Computational Collective Intelligence : 17th International Conference, ICCCI 2025 : Ho Chi Minh City, Vietnam, November 12–15, 2025 : proceedings , Pt. 1 / eds. Ngoc Thanh Nguyen, [et al.]. — Switzerland : Springer, cop. 2026. — ( Communications in Computer and Information Science ; ISSN 1865-0929 ; CCIS 2747 ). — ISBN: 978-3-032-10201-0; e-ISBN: 978-3-032-10202-7. — S. 415–426. — Bibliogr., Abstr. — Publikacja dostępna online od: 2025-11-08

Autorzy (2)

Kozina Agata
AGHPikus Michał

Słowa kluczowe

hashing trick OPT-1.3b wikitext-2-raw-v1 one-hot encoding wikitext large language models GPT-2 phi4-mini embeddings

Dane bibliometryczne

ID BaDAP	165497
Data dodania do BaDAP	2026-02-18
DOI	10.1007/978-3-032-10202-7_28
Rok publikacji	2026
Typ publikacji	materiały konferencyjne (aut.)
Otwarty dostęp
Wydawca	Springer
Konferencja	International Conference on Computational Collective Intelligence: Semantic Web, Social Networks and Multiagent Systems 2025
Czasopismo/seria	Communications in Computer and Information Science

Abstract

This paper investigates modifications to traditional embedding representations in large language models (LLMs) through two approaches: the hashing trick and one-hot encoding. We conducted experiments on three families of models GPT-2, facebook/opt-1.3b and Microsoft/phi-4-mini evaluating three configurations: original embeddings, hash-based embeddings, and one-hot embeddings. Our evaluation considers metrics such as evaluation loss, perplexity, and training time, using the Wikitext/wikitext-2-raw-v1 dataset. The results indicate that, while the original embeddings yield the best performance in terms of predictive accuracy and training efficiency, the modified embeddings offer potential scalability benefits. Notably, the hash-based approach outperforms one-hot encoding by a small margin, albeit at a higher computational cost. We conclude by discussing the trade-offs inherent in these methods and propose directions for future work to optimize the balance between efficiency and accuracy.

Publikacje, które mogą Cię zainteresować

fragment książki

#164740Data dodania: 9.12.2025

Enhanced CT image reconstruction using VMD-based Quaternion Bilateral Filtering / Mahmoud NASR, Krzysztof Brzostowski, Adam PIÓRKOWSKI // W: Computational Collective Intelligence : 17th International Conference, ICCCI 2025 : Ho Chi Minh City, Vietnam, November 12–15, 2025 : proceedings , Pt. 2 / eds. Ngoc Thanh Nguyen, [et al.]. — Cham : Springer Nature, cop. 2026. — ( Lecture Notes in Computer Science ; ISSN 0302-9743. Lecture Notes in Artificial Intelligence ; vol. 16139 ). — ISBN: 978-3-032-09320-2; e-ISBN: 978-3-032-09321-9. — S. 273–287. — Bibliogr., Abstr. — Publikacja dostępna online od: 2025-11-08. — M. Nasr - dod. afiliacja: Sano Centre for Computational Medicine, Kraków, Poland

Szczegóły