Lucene search
K

11 matches found

NVD
NVD
added 2026/05/12 8:16 p.m.5 views

CVE-2026-44222

vLLM is an inference and serving engine for large language models LLMs. From 0.6.1 to before 0.20.0, there is a a Token Injection vulnerability in vLLM’s multimodal processing. Unauthenticated, text-only prompts that spell special tokens are interpreted as control. Image and video placeholder...

7.5CVSS0.00014EPSS
Exploits1References2
NVD
NVD
added 2026/04/02 8:16 p.m.1 views

CVE-2026-34760

vLLM is an inference and serving engine for large language models LLMs. From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing tomono, while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm. This discrepancy results...

7.1CVSS0.00075EPSS
Exploits0References4
Vulnrichment
Vulnrichment
added 2026/04/02 6:59 p.m.0 views

CVE-2026-34760 vLLM: Downmix Implementation Differences as Attack Vectors Against Audio AI Models

vLLM is an inference and serving engine for large language models LLMs. From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing tomono, while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm. This discrepancy results...

5.9CVSS5.8AI score0.00075EPSS
Exploits0References4
Cvelist
Cvelist
added 2025/11/21 1:22 a.m.9 views

CVE-2025-62372 vLLM vulnerable to DoS with incorrect shape of multimodal embedding inputs

vLLM is an inference and serving engine for large language models LLMs. From version 0.5.5 to before 0.11.1, users can crash the vLLM engine serving multimodal models by passing multimodal embedding inputs with correct ndim but incorrect shape e.g. hidden dimension is wrong, regardless of whether...

8.3CVSS0.00089EPSS
Exploits0References4
CVE
CVE
added 2025/11/21 1:18 a.m.23 views

CVE-2025-62164

The CVE affects vLLM (inference/serving engine) before 0.11.1, where the Completions API loads user-supplied prompt embeddings with torch.load() lacking proper validation. A PyTorch 2.8.0 change disables sparse-tensor invariants checks, allowing crafted tensors to bypass bounds checks and trigger...

8.8CVSS7.8AI score0.00191EPSS
Exploits0References3Affected Software1
Positive Technologies
Positive Technologies
added 2025/10/07 12:0 a.m.1 views

PT-2025-41009

Name of the Vulnerable Software and Affected Versions vLLM versions prior to 0.11.0rc2 Description vLLM is an inference and serving engine for large language models LLMs. The API key validation mechanism in versions prior to 0.11.0rc2 is susceptible to a timing attack. The string comparison used...

7.5CVSS6.7AI score0.00298EPSS
Exploits1References8
EUVD
EUVD
added 2025/10/03 8:7 p.m.1 views

EUVD-2025-16512

Malicious code in bioql PyPI...

6.5CVSS6.3AI score0.00345EPSS
Exploits1References4
OSV
OSV
added 2025/08/21 2:41 p.m.1 views

CVE-2025-48956 vLLM API endpoints vulnerable to Denial of Service Attacks

vLLM is an inference and serving engine for large language models LLMs. From 0.1.0 to before 0.10.1.1, a Denial of Service DoS vulnerability can be triggered by sending a single HTTP GET request with an extremely large header to an HTTP endpoint. This results in server memory exhaustion,...

7.5CVSS6.5AI score0.00306EPSS
Exploits0References5
OSV
OSV
added 2025/05/06 4:53 p.m.5 views

CVE-2025-30165 Remote Code Execution Vulnerability in vLLM Multi-Node Cluster Configuration

vLLM is an inference and serving engine for large language models. In a multi-node vLLM deployment using the V0 engine, vLLM uses ZeroMQ for some multi-node communication purposes. The secondary vLLM hosts open a SUB ZeroMQ socket and connect to an XPUB socket on the primary vLLM host. When data ...

8CVSS7.8AI score0.00432EPSS
Exploits0References5
PyPA
PyPA
added 2025/03/19 4:15 p.m.7 views

PYSEC-2025-223

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. The outlines library is one of the backends used by vLLM to support structured output a.k.a. guided decoding. Outlines provides an optional cache for its compiled grammars on the local filesystem. This cache has...

6.5CVSS6.6AI score0.00658EPSS
Exploits0References3Affected Software1
OSV
OSV
added 2025/02/07 8:15 p.m.1 views

PYSEC-2025-62

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Maliciously constructed statements can lead to hash collisions, resulting in cache reuse, which can interfere with subsequent responses and cause unintended behavior. Prefix caching makes use of Python's built-i...

2.6CVSS7AI score0.00323EPSS
Exploits0References3
Rows per page
Query Builder