Skip to content
View IndependenceSDS's full-sized avatar

Block or report IndependenceSDS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Visual Studio Code

TypeScript 167,940 30,738 Updated Mar 1, 2025

A tutorial on RDMA based programming using code examples

C 522 148 Updated Jan 3, 2020

Efficient and easy multi-instance LLM serving

Python 305 24 Updated Feb 28, 2025

Working Set Size tools

C 219 28 Updated Aug 15, 2021

A small library to modify all page-table levels of all processes from user space for x86_64 and ARMv8.

C 249 72 Updated Oct 30, 2024

Linux-based partitioning hypervisor

C 1,782 336 Updated May 18, 2024
Python 45 6 Updated Nov 18, 2024

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 7,415 811 Updated Feb 28, 2025

Self-host LLMs with vLLM and BentoML

Python 89 14 Updated Feb 28, 2025

NVIDIA Linux open GPU kernel module source

C 15,578 1,354 Updated Feb 28, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,386 714 Updated Dec 17, 2024
Python 311 40 Updated Apr 2, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,394 23,479 Updated Mar 1, 2025

Fast OS-level support for GPU checkpoint and restore

C++ 156 13 Updated Feb 25, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,128 424 Updated Feb 19, 2025

Serverless LLM Serving for Everyone.

Python 427 37 Updated Feb 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,813 5,966 Updated Mar 1, 2025
235 23 Updated Feb 11, 2025

OpenZFS on Linux and FreeBSD

C 11,014 1,801 Updated Feb 27, 2025

An open and reliable container runtime

Go 18,026 3,545 Updated Feb 28, 2025

Filesystem overlay for transparent, distributed migration of active data across separate storage systems.

C 40 10 Updated Dec 19, 2019

SpotServe: Serving Generative Large Language Models on Preemptible Instances

112 9 Updated Feb 22, 2024

NVIDIA device plugin for Kubernetes

Go 3,050 663 Updated Feb 28, 2025

This is a typora theme inspired by Vue document style. 一个类似于 Vue 文档风格的 Typora Markdown 编辑器主题。

CSS 2 Updated Apr 1, 2022

CUDA checkpoint and restore utility

C 297 15 Updated Jan 27, 2025

Checkpoint/Restore tool

C 3,111 630 Updated Feb 20, 2025

cricket is a virtualization solution for GPUs

C 182 44 Updated Feb 18, 2025

Solutions to all questions of the book Introduction to the Theory of Computation, 3rd edition by Michael Sipser

1,443 249 Updated Dec 8, 2020

XRP: In-Kernel Storage Functions with eBPF

Shell 213 37 Updated Jul 4, 2023
Next