zjnyly

Follow

郑佳宁 zjnyly

Follow

19 followers · 73 following

sysu
https://zjnyly.github.io/

Achievements

Achievements

Lists (5)

Sort

DSL

HDR

SPARSE-HARDWARE

streaming-benchmark

XILINX

Starred repositories

popjane / free_chatgpt_api

🔥 公益免费的ChatGPT API，Free ChatGPT API，GPT4 API，可直连，无需代理，使用标准 OpenAI APIKEY 格式访问 ChatGPT，可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

3,750 395 Updated Nov 12, 2024

Xilinx / open-nic

AMD OpenNIC Project Overview

Shell 243 41 Updated Dec 20, 2022

wdlctc / headinfer

Python 24 2 Updated Feb 26, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,450 389 Updated Feb 28, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,799 1,931 Updated Mar 1, 2025

RayVentura / ShortGPT

🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation

Python 6,198 794 Updated Feb 10, 2025

xcw-1010 / HLSPilot

C 23 4 Updated Dec 10, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,853 4,062 Updated Jul 17, 2024

TorchMoE / MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

Python 138 12 Updated Feb 27, 2025

nyu-systems / Grendel-GS

Ongoing research training gaussian splatting at scale by distributed system

Python 469 29 Updated Aug 9, 2024

INT-FlashAttention2024 / INT-FlashAttention

C++ 60 3 Updated Jan 23, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,723 502 Updated Feb 27, 2025

ranggihwang / Pregated_MoE

C++ 41 5 Updated May 4, 2024

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,507 264 Updated Jan 16, 2024

Infini-AI-Lab / MagicPIG

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 192 14 Updated Dec 16, 2024

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 530 39 Updated Feb 14, 2025

abdelfattah-lab / Kratos-benchmark

Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision

Python 9 2 Updated Jul 25, 2024

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,104 95 Updated Feb 27, 2025

BaiTheBest / SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

Python 51 8 Updated Feb 8, 2025

FMInference / DejaVu

Python 311 40 Updated Apr 2, 2024

thunlp / SparsingLaw

The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".

Python 18 1 Updated Nov 12, 2024

thunlp / LLMxMapReduce

Python 174 9 Updated Feb 21, 2025

TinyVolt / optimal-brain-compression

An implementation of OBC algorithm packed into a module

Python 8 Updated Dec 28, 2023

SqueezeAILab / SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 678 43 Updated Aug 13, 2024

GATECH-EIC / ShiftAddViT

[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer

Python 32 Updated Dec 6, 2023

snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 110 21 Updated Jul 10, 2024

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,396 348 Updated Feb 27, 2025

Theia-4869 / FasterVLM

Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.

Python 48 1 Updated Dec 14, 2024

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

338 17 Updated Feb 9, 2025

mit-han-lab / spatten

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Scala 82 8 Updated Aug 27, 2024

Starred topics

Awesome Lists