Skip to content

Speculative decoding support #1986

@rahulgurnani

Description

@rahulgurnani

IGW should support speculative decoding since popular model servers like vllm/sglang do support it. But its not documented presently.

Metadata

Metadata

Assignees

Labels

needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions