CVE Search Engine - Security Vulnerabilities and Exploits Search Tool

show all

12 matches found

RedhatCVE•added 2026/06/05 7:25 p.m.•15 views

CVE-2026-44223

vLLM is an inference and serving engine for large language models LLMs. From to before 0.20.0, the extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash ...

6.5CVSS5.5AI score0.00367EPSS

SaveExploits0References1

Packet Storm News•added 2026/05/28 12:0 a.m.•19 views

Minimal Prompt Perturbations Lead to Code Vulnerabilities: Prompt Fragility and Hidden-State Signals in Coding LLMs

LLM-based coding assistants are seeing rapid adoption, offering substantial gains in developer productivity. As organizations increasingly ship code these agents produce, the security of that code becomes critical. Prior work has shown that minor prompt perturbations degrade the functional...

5.8AI score

SaveExploits0

NVD•added 2026/05/12 8:16 p.m.•16 views

CVE-2026-44223

vLLM is an inference and serving engine for large language models LLMs. From 0.18.0 to before 0.20.0, the extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The...

6.5CVSS0.00367EPSS

SaveExploits0References2

ATTACKERKB•added 2026/05/12 7:58 p.m.•8 views

CVE-2026-44223

6.5CVSS5.9AI score0.00367EPSS

SaveExploits0References3Affected Software1

CVE•added 2026/05/12 7:58 p.m.•52 views

CVE-2026-44223

vLLM contains a vulnerability (CVE-2026-44223) where the extract_hidden_states speculative decoding pathway can crash the EngineCore process if any request uses penalty parameters (repetition_penalty, frequency_penalty, or presence_penalty). The issue arises from an incorrect tensor shape after t...

6.5CVSS5.9AI score0.00367EPSS

SaveExploits0References2Affected Software1

Cvelist•added 2026/05/12 7:58 p.m.•43 views

CVE-2026-44223 vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

6.5CVSS0.00367EPSS

SaveExploits0References2

Vulnrichment•added 2026/05/12 7:58 p.m.•9 views

CVE-2026-44223 vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

6.5CVSS5.9AI score0.00367EPSS

SaveExploits0References2

CNNVD•added 2026/05/12 12:0 a.m.•11 views

vLLM 安全漏洞

vLLM is an open-source LLM-based inference and service engine that features high throughput and efficient memory usage. Versions of vLLM prior to 0.20.0 contained a security vulnerability. This vulnerability stemmed from the extracthiddenstates speculative decoding proposal, which returned tensor...

6.5CVSS5.8AI score0.00367EPSS

SaveExploits0References1

Snyk•added 2026/05/06 9:45 p.m.•8 views

Incorrect Type Conversion or Cast

Overview vllm is an A high-throughput and memory-efficient inference and serving engine for LLMs Affected versions of this package are vulnerable to Incorrect Type Conversion or Cast through the extracthiddenstates speculative decoding. An attacker can cause the server to crash and disrupt servic...

6.5CVSS5.8AI score0.00367EPSS

SaveExploits0References3

Github Security Blog•added 2026/05/06 9:45 p.m.•12 views

vLLM: extract_hidden_states speculative decoding crashes server on any request with penalty parameters

Summary The extracthiddenstates speculative decoding proposer in vLLM returns a tensor with an incorrect shape after the first decode step, causing a RuntimeError that crashes the EngineCore process. The crash is triggered when any request in the batch uses sampling penalty parameters...

6.5CVSS5.8AI score0.00367EPSS

SaveExploits0References5Affected Software1

Packet Storm News•added 2025/05/23 12:0 a.m.•18 views

An Attack to Break Permutation-Based Private Third-Party Inference Schemes for LLMs

Recent advances in Large Language Models LLMs have led to the widespread adoption of third-party inference services, raising critical privacy concerns. Existing methods of performing private third-party inference, such as Secure Multiparty Computation SMPC, often rely on cryptographic methods...

7AI score

SaveExploits0

Packet Storm News•added 2025/05/08 12:0 a.m.•7 views

Defending against Indirect Prompt Injection by Instruction Detection

The integration of Large Language Models LLMs with external sources is becoming increasingly common, with Retrieval-Augmented Generation RAG being a prominent example. However, this integration introduces vulnerabilities of Indirect Prompt Injection IPI attacks, where hidden instructions embedded...

7.2AI score

SaveExploits0

Rows per page

Query Builder

Family

Bulletin Type

Min CVSS Score

Date

Order by