skip to content
Top
New
Show
Ask
Jobs
DSpark: Speculative decoding accelerates LLM inference [pdf]
(github.com)
662 points | by
aurenvale
9 hours ago
251 comments
251 comments