PT-2026-48537 · Pypi · Vllm

Published

2026-06-10

·

Updated

2026-06-10

·

CVE-2026-47155

CVSS v3.1

6.5

Medium

VectorAV:N/AC:H/PR:N/UI:N/S:U/C:L/I:H/A:N

Summary

vLLM's revision pinning controls do not consistently apply to all artifacts loaded for a model. A deployment that supplies --revision or --code-revision can still load dynamic code, GGUF files, image processors, retrieval side weights, or same-repository subfolder weights/config from an unpinned/default revision.
This is a supply-chain integrity issue for pinned vLLM deployments. Operators can believe they are serving a reviewed model revision while vLLM resolves behavior-affecting nested or sibling artifacts outside that reviewed revision.

Details

The expected invariant is:
When a vLLM operator supplies a model or code revision pin, every code, config, processor, weight file, side weight, and same-repository subfolder artifact loaded as part of that model should resolve under that pin unless vLLM exposes and enforces a separate explicit pin for that artifact.
Current main was verified affected at commit 3795d7acf431980e62e738493f437ae2a51549da.
Affected source boundaries:
  • vllm/model executor/models/registry.py:1045-1051 and :1058-1064
  • try resolve transformers() passes revision=model config.revision and trust remote code=model config.trust remote code, but omits code revision=model config.code revision for external auto map dynamic module imports.
  • vllm/model executor/model loader/gguf loader.py:58-60
  • The direct-file GGUF form repo/file.gguf calls hf hub download(repo id=repo id, filename=filename) without passing revision.
  • vllm/model executor/models/roberta.py:203-209
  • BGE-M3 secondary sparse and ColBERT side weights are declared with revision=None.
  • vllm/model executor/models/kimi k25.py:111-114
  • Kimi-K2.5 calls cached get image processor() without passing model config.revision.
  • vllm/model executor/models/kimi audio.py:92-95
  • Kimi-Audio loads Whisper config from the whisper-large-v3 subfolder without a revision argument.
  • vllm/model executor/models/kimi audio.py:425-430
  • Kimi-Audio declares same-repository whisper-large-v3 secondary weights with revision=None.
  • vllm/model executor/model loader/default loader.py:287-301
  • The default loader preserves model config.revision for the primary source, then consumes model-supplied secondary sources as declared.
The strongest example is Kimi-Audio: the primary moonshotai/Kimi-Audio-7B-Instruct weights preserve the configured model revision, but the same-repository whisper-large-v3 audio tower config/weights do not. A pinned Kimi-Audio deployment can therefore load the Whisper subfolder outside the audited revision.
This report does not claim a trust remote code=False bypass, unauthenticated RCE, or real artifact compromise. The issue is improper propagation of explicit artifact pins across supported loader paths.

Impact

Affected users are operators who pin vLLM model deployments to a reviewed Hugging Face revision for safety review, provenance, rollback, or reproducibility. The impact is that the pin does not reliably describe the full set of artifacts vLLM serves. Even when the operator selects an audited revision, vLLM can resolve behavior-affecting secondary artifacts from the repository default branch or another mutable ref.
Depending on the model path, the unpinned artifact can be dynamic model code, a GGUF file, an image processor, retrieval side weights, or the same-repository Kimi-Audio Whisper subfolder weights/config.
This breaks the operational guarantee of a pinned deployment: "serve the exact artifact set I reviewed." A later change to an unpinned secondary artifact can alter model behavior without changing the operator's configured revision, making review, rollback, incident response, and audit records unreliable.

Occurrences

  • vllm/model executor/models/kimi k25.py L111-L114 — Kimi-K2.5 loads its image processor with cached get image processor() but does not pass self.ctx.model config.revision. The processor can therefore resolve from the default repository revision even when the model deployment is pinned.
  • vllm/model executor/models/kimi audio.py L425-L430 — Kimi-Audio declares same-repository whisper-large-v3 secondary weights with revision=None. A pinned Kimi-Audio deployment can therefore load the Whisper audio tower weights from an unpinned/default revision.
  • vllm/model executor/models/kimi audio.py L92-L95 — Kimi-Audio loads Whisper config from the same repository's whisper-large-v3 subfolder without passing the top-level model revision. The config for this behavior-affecting subcomponent can be resolved outside the audited model revision.
  • vllm/model executor/models/registry.py L1058-L1064 — The later dynamic model-class resolution repeats the same pin-decay pattern: it forwards revision and trust remote code, but omits code revision. This means an operator-provided code pin is not enforced at the dynamic module loader boundary.
  • vllm/model executor/model loader/gguf loader.py L58-L60 — The direct GGUF form repo/file.gguf calls hf hub download(repo id=repo id, filename=filename) without passing model config.revision. A deployment that pins the model revision can therefore resolve this GGUF file from the repository default revision.
  • vllm/model executor/models/registry.py L1045-L1051 — try get class from dynamic module() is called for external auto map config/model classes with revision=model config.revision, but without forwarding model config.code revision. When --code-revision is set, this dynamic module resolution can still fall back to the default code revision instead of the audited code revision.
  • vllm/model executor/models/roberta.py L203-L209 — BgeM3EmbeddingModel creates same-repository secondary sparse/ColBERT weight sources with revision=None. The primary model revision is not propagated to these side weights, so they can be downloaded outside the operator-selected model revision.

Fixes

The vLLM maintainer (Russell Bryant) redirected the report to the private GHSA channel. Offline proof bundle (vllm artifact pin decay bundle verify.py + bundle-verification-20260430T143506Z.json) is available upon request.

Fix

Insufficient Verification of Data Authenticity

Weakness Enumeration

Related Identifiers

CVE-2026-47155
GHSA-3WW4-5JV9-J5GM

Affected Products

Vllm