-
University of Science and Technology in China
- Anhui Hefei, China
-
20:25
(UTC -12:00) - https://en.ustc.edu.cn/
Stars
A tutorial on RDMA based programming using code examples
Efficient and easy multi-instance LLM serving
A small library to modify all page-table levels of all processes from user space for x86_64 and ARMv8.
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
NVIDIA Linux open GPU kernel module source
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Fast OS-level support for GPU checkpoint and restore
High-speed Large Language Model Serving for Local Deployment
Serverless LLM Serving for Everyone.
A high-throughput and memory-efficient inference and serving engine for LLMs
Filesystem overlay for transparent, distributed migration of active data across separate storage systems.
SpotServe: Serving Generative Large Language Models on Preemptible Instances
mhy98 / typora-vue-theme
Forked from blinkfox/typora-vue-themeThis is a typora theme inspired by Vue document style. 一个类似于 Vue 文档风格的 Typora Markdown 编辑器主题。
Solutions to all questions of the book Introduction to the Theory of Computation, 3rd edition by Michael Sipser