PT-2026-5654 · Hugging Face · Text-Generation-Inference

Published

2026-02-02

·

Updated

2026-02-26

·

CVE-2026-0599

CVSS v3.1

7.5

High

VectorAV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H
Name of the Vulnerable Software and Affected Versions huggingface/text-generation-inference version 3.3.6 huggingface/text-generation-inference versions prior to 3.3.7
Description A flaw exists in huggingface/text-generation-inference that allows unauthenticated remote attackers to cause a denial-of-service condition through resource exhaustion. The issue occurs during input validation in VLM mode when the system processes Markdown image links by performing HTTP GET requests. The entire response body is read into memory and cloned, potentially leading to network bandwidth saturation, memory inflation, and CPU overutilization. This behavior can crash the host machine, especially in default configurations lacking memory limits and authentication. The issue is triggered even if the request is ultimately rejected due to token limits.
Recommendations Update huggingface/text-generation-inference to version 3.3.7 or later.

Fix

DoS

Resource Exhaustion

Found an issue in the description? Have something to add? Feel free to write us 👾

Weakness Enumeration

Related Identifiers

CVE-2026-0599
GHSA-J7X9-7J54-2V3H

Affected Products

Text-Generation-Inference