PT-2025-12092 · Vllm · Vllm

Publicado

2025-03-20

·

Atualizado

2025-03-21

·

CVE-2024-11040

CVSS v3.1

7.5

Alta

VetorAV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H
Name of the Vulnerable Software and Affected Versions vllm versions 0.5.2.2
Description The issue is related to Denial of Service attacks. It occurs in the "POST /v1/completions" and "POST /v1/embeddings" endpoints. For "POST /v1/completions", enabling use beam search and setting best of to a high value causes the HTTP connection to time out, with vllm ceasing effective work and the request remaining in a 'pending' state, blocking new completion requests. For "POST /v1/embeddings", supplying invalid inputs to the JSON object causes an issue in the background loop, resulting in all further completion requests returning a 500 HTTP error code ('Internal Server Error') until vllm is restarted.
Recommendations For version 0.5.2.2, as a temporary workaround, consider disabling the use beam search function and limiting the best of value in the "POST /v1/completions" endpoint until a patch is available. Restrict access to the "POST /v1/embeddings" endpoint to minimize the risk of exploitation by supplying invalid JSON inputs. Avoid using the best of variable with high values in the affected API endpoint until the issue is resolved. At the moment, there is no information about a newer version that contains a fix for this vulnerability.

DoS

Resource Exhaustion

Encontrou algum problema na descrição? Tem algo a acrescentar? Fique à vontade para nos escrever 👾

Enumeração de Fraquezas

Identificadores relacionados

CVE-2024-11040

Produtos afetados

Vllm