CVE Search Engine - Security Vulnerabilities and Exploits Search Tool

show all

12 matches found

CVE-2026-54234

A flaw was found in vLLM, a high-throughput and memory-efficient inference and serving engine for Large Language Models LLMs. A remote attacker can exploit this vulnerability by sending a specially crafted multi-request speculative decoding workload through public gRPC Generate and Abort endpoint...

7.5CVSS5.9AI score0.00343EPSS

Exploits1References6

Positive Technologies•added 2026/07/06 12:0 a.m.•10 views

PT-2026-55997

Name of the Vulnerable Software and Affected Versions vLLM versions prior to 0.24.0 Description A flaw in the rejection sampler during multi-request speculative decoding workloads allows the production of a recovered token equal to the model vocabulary size boundary value. This value is converted...

7.5CVSS5.9AI score0.00343EPSS

Exploits1References5

RedhatCVE•added 2026/06/05 7:25 p.m.•11 views

CVE-2026-44223

vLLM is an inference and serving engine for large language models LLMs. From to before 0.20.0, the extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash ...

6.5CVSS5.5AI score0.00367EPSS

Exploits0References1

NVD•added 2026/05/12 8:16 p.m.•15 views

CVE-2026-44223

vLLM is an inference and serving engine for large language models LLMs. From 0.18.0 to before 0.20.0, the extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The...

6.5CVSS0.00367EPSS

Exploits0References2

CVE•added 2026/05/12 7:58 p.m.•51 views

CVE-2026-44223

vLLM contains a vulnerability (CVE-2026-44223) where the extract_hidden_states speculative decoding pathway can crash the EngineCore process if any request uses penalty parameters (repetition_penalty, frequency_penalty, or presence_penalty). The issue arises from an incorrect tensor shape after t...

6.5CVSS5.9AI score0.00367EPSS

Exploits0References2Affected Software1

Vulnrichment•added 2026/05/12 7:58 p.m.•8 views

CVE-2026-44223 vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

6.5CVSS5.9AI score0.00367EPSS

Exploits0References2

CNNVD•added 2026/05/12 12:0 a.m.•11 views

vLLM 安全漏洞

vLLM is an open-source LLM-based inference and service engine that features high throughput and efficient memory usage. Versions of vLLM prior to 0.20.0 contained a security vulnerability. This vulnerability stemmed from the extracthiddenstates speculative decoding proposal, which returned tensor...

6.5CVSS5.8AI score0.00367EPSS

Exploits0References1

Snyk•added 2026/05/06 9:45 p.m.•8 views

Incorrect Type Conversion or Cast

Overview vllm is an A high-throughput and memory-efficient inference and serving engine for LLMs Affected versions of this package are vulnerable to Incorrect Type Conversion or Cast through the extracthiddenstates speculative decoding. An attacker can cause the server to crash and disrupt servic...

6.5CVSS5.8AI score0.00367EPSS

Exploits0References3

OSV•added 2026/05/06 9:45 p.m.•7 views

GHSA-83VM-P52W-F9PW vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

Summary The extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash is triggered when any request in the batch uses sampling penalty parameters...

6.5CVSS5.8AI score0.00367EPSS

Exploits0References5

Github Security Blog•added 2026/05/06 9:45 p.m.•10 views

vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

6.5CVSS5.8AI score0.00367EPSS

Exploits0References5Affected Software1

Positive Technologies•added 2026/05/06 12:0 a.m.•13 views

PT-2026-38288

Name of the Vulnerable Software and Affected Versions vLLM versions 0.18.0 through 0.19.1 Description The extract hidden states speculative decoding proposer returns a tensor with an incorrect shape after the first decode step, leading to a RuntimeError that crashes the EngineCore process. This...

6.5CVSS5.8AI score0.00367EPSS

Exploits0References9

Schneier on Security•added 2026/02/17 12:1 p.m.•10 views

Side-Channel Attacks Against LLMs

Here are three papers describing different side-channel attacks against LLMs. "Remote Timing Attacks on Efficient Language Model Inference": Abstract: Scaling up language models has significantly increased their capabilities. But larger models are slower models, and so there is now an extensive...

5.7AI score

Exploits0

Rows per page

Query Builder

Family

Bulletin Type

Min CVSS Score

Date

Order by