The500Feed.Live
Everything going on in AI - updated daily from 500+ sources
📄 ResearchJune 11, 2026
Accelerating Speculative Diffusions via Block Verification
Speculative decoding speeds up LLM inference by using a draft model to generate tokens, with an acceptance-rejection scheme that ensures that the output matches the target distribution. Adapting this to continuous diffusions is difficult because speculative sampling requires drawing from a residual ...
Read Original Article →