🎯
Focusing
Pinned Loading
-
dilab-zju/self-speculative-decoding
dilab-zju/self-speculative-decoding PublicCode associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.