PT-2025-25757 · Llama.Cpp · Llama.Cpp

Guygoldenberg

CVE-2025-49847

Published

2025-06-17

Updated

2025-07-03

CVSS v3.1

8.8

High

Vector

AV:N/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H

Name of the Vulnerable Software and Affected Versions llama.cpp versions prior to b5662

Description The issue is related to a buffer overflow in the vocabulary-loading code of llama.cpp. An attacker-supplied GGUF model vocabulary can trigger this overflow. Specifically, the helper function token to piece() in vocab.cpp casts a large size t token length into an int32 t, causing the length check to be bypassed. As a result, memcpy is called with an oversized size, allowing a malicious model to overwrite memory beyond the intended buffer. This can lead to arbitrary memory corruption and potential code execution.

Recommendations For versions prior to b5662, update to version b5662 to resolve the issue. As a temporary workaround, consider restricting the use of attacker-supplied GGUF model vocabularies to minimize the risk of exploitation.

Exploit

Fix

Buffer Overflow

Found an issue in the description? Have something to add? Feel free to write us 👾

dbugs@ptsecurity.com

Weakness Enumeration

CWE-195CWE-119

Related Identifiers

CVE-2025-49847

GHSA-8WWF-W4QM-GPQR

OPENSUSE-SU-2025:15245-1

Affected Products

Llama.Cpp

PT-2025-25757 · Llama.Cpp · Llama.Cpp

CVE-2025-49847

Weakness Enumeration

Related Identifiers

Affected Products

References · 17