Skip to content
View ChengZhang-98's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ChengZhang-98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 17,013 1,398 Updated Feb 7, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,777 528 Updated Dec 14, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,894 1,503 Updated Jan 13, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 8,715 853 Updated Feb 7, 2025

Implementation of Microscaling data formats in SystemVerilog.

SystemVerilog 13 2 Updated Aug 25, 2024

CUDA Learning guide

Cuda 307 31 Updated Jun 20, 2024

tpu-systolic-array-weight-stationary

Verilog 20 3 Updated May 7, 2021

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 332 29 Updated Nov 26, 2024

Material for gpu-mode lectures

Jupyter Notebook 3,641 369 Updated Jan 6, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 695 55 Updated Sep 4, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,591 485 Updated Feb 7, 2025

Berkeley's Spatial Array Generator

Scala 865 182 Updated Feb 5, 2025

Kernel Tuner

Python 306 52 Updated Feb 2, 2025

All in one vscode plugin for HDL development

VHDL 455 13 Updated Feb 6, 2025

Verilog AXI components for FPGA implementation

Verilog 1,598 468 Updated Dec 7, 2023

CUDA Templates for Linear Algebra Subroutines

C++ 6,152 1,054 Updated Feb 7, 2025

A survey on Hardware Accelerated LLMs

41 5 Updated Jan 13, 2025

Machine-Learning Accelerator System Exploration Tools

Python 141 66 Updated Feb 5, 2025

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 195 29 Updated Sep 23, 2024

the resources I use to learn computer science in my spare time

3,817 354 Updated Feb 14, 2023

A modern Neovim configuration with full battery for Python, Lua, C++, Markdown, LaTeX, and more...

Lua 3,722 546 Updated Jan 28, 2025

高颜值的第三方网易云播放器,支持 Windows / macOS / Linux :electron:

Vue 30,258 4,452 Updated Nov 8, 2024

在vscode上的数字设计开发插件

Verilog 349 20 Updated Jan 27, 2023

Long Range Arena for Benchmarking Efficient Transformers

Python 5 Updated Sep 28, 2022

MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code

2,612 603 Updated Oct 18, 2022

This repository contains source code to binarize any real-value word embeddings into binary vectors.

C 47 8 Updated Jan 7, 2021