galiyu

icefire galiyu

0 followers · 18 following

Starred repositories

facebook / zstd

Zstandard - Fast real-time compression algorithm

C 24,381 2,191 Updated Feb 20, 2025

nullplay / Unified-Convolution-Framework

C++ 7 Updated Apr 24, 2023

ParCIS / Magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

C++ 86 17 Updated Nov 23, 2022

JonathanSalwan / Triton

Triton is a dynamic binary analysis library. Build your own program analysis tools, automate your reverse engineering, perform software verification or just emulate code.

C++ 3,629 539 Updated Feb 16, 2025

HPMLL / DTC-SpMM_ASPLOS24

C++ 27 4 Updated Jun 19, 2024

open-mmlab / OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Python 4,843 1,320 Updated Aug 8, 2024

open-mmlab / mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Python 5,526 1,581 Updated Jul 10, 2024

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 15,420 1,449 Updated Jan 19, 2025

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,958 1,933 Updated Feb 21, 2025

Oneflow-Inc / dfccl

C++ 20 7 Updated Feb 17, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,916 5,826 Updated Feb 23, 2025

QSCTech / zju-icicles

浙江大学课程攻略共享计划

HTML 37,853 9,465 Updated Feb 21, 2025

microsoft / NPKit

NCCL Profiling Kit

Python 127 12 Updated Jul 1, 2024

UofT-EcoSystem / Minuet

[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs

Cuda 75 3 Updated Jun 7, 2024

Azure / msccl

Microsoft Collective Communication Library

62 6 Updated Nov 23, 2024

microsoft / taccl

TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches

Python 69 10 Updated Jul 25, 2023

openucx / ucc

Unified Collective Communication Library

C 227 104 Updated Feb 19, 2025

microsoft / msccl

Microsoft Collective Communication Library

C++ 336 31 Updated Sep 20, 2023

microsoft / msccl-tools

Synthesizer for optimal collective communication algorithms

Python 104 25 Updated Apr 8, 2024

ROCm / rccl

ROCm Communication Collectives Library (RCCL)

C++ 298 137 Updated Feb 22, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 3,484 864 Updated Jan 27, 2025

maxiaof / github-hosts

通过修改Hosts解决国内Github经常抽风访问不到,每日更新

Java 1,529 101 Updated Jan 22, 2025

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 67,642 8,298 Updated Feb 21, 2025