CVE-2026-34756 | Mend Vulnerability Database

Vulnerability DatabaseCVE-2026-34756

CVE-2026-34756

Published:April 06, 2026

Updated:July 13, 2026

vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n parameter in the ChatCompletionRequest and CompletionRequest Pydantic models, an unauthenticated attacker can send a single HTTP request with an astronomically large n value. This completely blocks the Python asyncio event loop and causes immediate Out-Of-Memory crashes by allocating millions of request object copies in the heap before the request even reaches the scheduling queue. This vulnerability is fixed in 0.19.0.

Affected Packages

vllm (CONDA):

Affected version(s) >=0.8.3 <0.19.0

Fix Suggestion:

Update to version 0.19.0

https://github.com/vllm-project/vllm.git (GITHUB):

Affected version(s) >=v0.1.0 <v0.19.0

Fix Suggestion:

Update to version v0.19.0

vllm (PYTHON):

Affected version(s) >=0.1.0 <0.19.0

Fix Suggestion: