I'm from Northeast University of China,Love open source learning
-
NEU China
- China
-
09:56
(UTC -12:00) - https://twitter.com/liuchen49379445
- @liuchen49379445
Stars
AIinfer
7 repositories
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …
This is a Chinese translation of the CUDA programming guide
Learn CUDA Programming, published by Packt
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"