CVE Search Engine - Security Vulnerabilities and Exploits Search Tool

show all

12 matches found

OSV•added 2026/06/29 11:50 a.m.•5 views

PYSEC-2026-567 vLLM Allows Remote Code Execution via PyNcclPipe Communication Service

Impacted Environments This issue ONLY impacts environments using the PyNcclPipe KV cache transfer integration with the V0 engine. No other configurations are affected. Summary vLLM supports the use of the PyNcclPipe class to establish a peer-to-peer communication domain for data transmission...

9.8CVSS7.9AI score0.00959EPSS

Exploits1References8

NVD•added 2026/05/12 8:16 p.m.•9 views

CVE-2026-44222

vLLM is an inference and serving engine for large language models LLMs. From 0.6.1 to before 0.20.0, there is a a Token Injection vulnerability in vLLM’s multimodal processing. Unauthenticated, text-only prompts that spell special tokens are interpreted as control. Image and video placeholder...

7.5CVSS0.00414EPSS

Exploits1References2

NVD•added 2026/04/02 8:16 p.m.•8 views

CVE-2026-34760

vLLM is an inference and serving engine for large language models LLMs. From version 0.5.5 to before version 0.18.0, Librosa defaults to using numpy.mean for mono downmixing tomono, while the international standard ITU-R BS.775-4 specifies a weighted downmixing algorithm. This discrepancy results...

7.1CVSS0.00267EPSS

Exploits0References4

Vulnrichment•added 2026/04/02 6:59 p.m.•1 views

CVE-2026-34760 vLLM: Downmix Implementation Differences as Attack Vectors Against Audio AI Models

5.9CVSS5.8AI score0.00267EPSS

Exploits0References4

Cvelist•added 2025/11/21 1:22 a.m.•14 views

CVE-2025-62372 vLLM vulnerable to DoS with incorrect shape of multimodal embedding inputs

vLLM is an inference and serving engine for large language models LLMs. From version 0.5.5 to before 0.11.1, users can crash the vLLM engine serving multimodal models by passing multimodal embedding inputs with correct ndim but incorrect shape e.g. hidden dimension is wrong, regardless of whether...

8.3CVSS0.00338EPSS

Exploits0References4

CVE•added 2025/11/21 1:18 a.m.•47 views

CVE-2025-62164

The CVE affects vLLM (inference/serving engine) before 0.11.1, where the Completions API loads user-supplied prompt embeddings with torch.load() lacking proper validation. A PyTorch 2.8.0 change disables sparse-tensor invariants checks, allowing crafted tensors to bypass bounds checks and trigger...

8.8CVSS7.8AI score0.00849EPSS

Exploits0References3Affected Software1

EUVD•added 2025/10/03 8:7 p.m.•4 views

EUVD-2025-16512

Malicious code in bioql PyPI...

6.5CVSS6.3AI score0.00426EPSS

Exploits1References4

Positive Technologies•added 2025/09/28 12:0 a.m.•5 views

PT-2025-41009

Name of the Vulnerable Software and Affected Versions vLLM versions prior to 0.11.0rc2 Description vLLM is an inference and serving engine for large language models LLMs. The API key validation mechanism in versions prior to 0.11.0rc2 is susceptible to a timing attack. The string comparison used...

7.8CVSS6.5AI score0.00532EPSS

Exploits1References17

OSV•added 2025/08/21 2:41 p.m.•3 views

CVE-2025-48956 vLLM API endpoints vulnerable to Denial of Service Attacks

vLLM is an inference and serving engine for large language models LLMs. From 0.1.0 to before 0.10.1.1, a Denial of Service DoS vulnerability can be triggered by sending a single HTTP GET request with an extremely large header to an HTTP endpoint. This results in server memory exhaustion,...

7.5CVSS6.5AI score0.00527EPSS

Exploits0References5

OSV•added 2025/05/06 4:53 p.m.•7 views

CVE-2025-30165 Remote Code Execution Vulnerability in vLLM Multi-Node Cluster Configuration

vLLM is an inference and serving engine for large language models. In a multi-node vLLM deployment using the V0 engine, vLLM uses ZeroMQ for some multi-node communication purposes. The secondary vLLM hosts open a SUB ZeroMQ socket and connect to an XPUB socket on the primary vLLM host. When data ...

8CVSS7.8AI score0.00502EPSS

Exploits0References5

PyPA•added 2025/03/19 4:15 p.m.•11 views

PYSEC-2025-223

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. The outlines library is one of the backends used by vLLM to support structured output a.k.a. guided decoding. Outlines provides an optional cache for its compiled grammars on the local filesystem. This cache has...

6.5CVSS6.6AI score0.00421EPSS

Exploits0References3Affected Software1

OSV•added 2025/02/07 8:15 p.m.•5 views

PYSEC-2025-62

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Maliciously constructed statements can lead to hash collisions, resulting in cache reuse, which can interfere with subsequent responses and cause unintended behavior. Prefix caching makes use of Python's built-i...

2.6CVSS7AI score0.00184EPSS

Exploits0References3

Rows per page

Query Builder

Family

Bulletin Type

Min CVSS Score

Date

Order by