vllm.entrypoints.serve.disagg.protocol ¶
GenerateRequest ¶
Bases: BaseModel
Source code in vllm/entrypoints/serve/disagg/protocol.py
features class-attribute instance-attribute ¶
features: MultiModalFeatures | None = None
Multimodal hashes and placeholder positions (populated for MM inputs).
sampling_params instance-attribute ¶
sampling_params: SamplingParams
The sampling parameters for the model.
MultiModalFeatures ¶
Bases: BaseModel
Lightweight multimodal metadata produced by the render step.
Carries hashes (for cache lookup / identification) and placeholder positions so the downstream /generate service knows where in the token sequence each multimodal item lives.
Source code in vllm/entrypoints/serve/disagg/protocol.py
kwargs_data class-attribute instance-attribute ¶
Per-modality serialized tensor data.
Each value is a list parallel to mm_hashes[modality]. A str entry is a base64-encoded MultiModalKwargsItem; None means the item should be resolved from cache. The entire field is None for metadata-only (cache-hit) responses.
mm_hashes instance-attribute ¶
Per-modality item hashes, e.g. {"image": ["abc", "def"]}.
mm_placeholders instance-attribute ¶
mm_placeholders: dict[str, list[PlaceholderRangeInfo]]
Per-modality placeholder ranges in the token sequence.
PlaceholderRangeInfo ¶
Bases: BaseModel
Serializable placeholder location for a single multi-modal item.