Skip to content
View JoeyZi1's full-sized avatar

Highlights

  • Pro

Block or report JoeyZi1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A reading list for homomorphic encryption

121 8 Updated Aug 1, 2024

A library for lattice-based multiparty homomorphic encryption in Go

Go 1,293 186 Updated Apr 16, 2025

Everything you want to know about Google Cloud TPU

Python 523 30 Updated Jul 16, 2024

A simple and elegant Jekyll theme for an academic personal homepage

CSS 778 659 Updated Apr 9, 2025

INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"

Python 112 12 Updated Jan 26, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 22,488 3,256 Updated Mar 5, 2025

Torch2Chip (MLSys, 2024)

Python 51 5 Updated Apr 2, 2025

PyTorch native post-training library

Python 5,094 574 Updated Apr 17, 2025

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Python 1,905 341 Updated Dec 14, 2023

Vitis In-Depth Tutorials

C 1,358 564 Updated Apr 14, 2025
C++ 35 35 Updated Mar 26, 2025

Create a mobile Balatro app from your Steam version of Balatro

C# 1,115 54 Updated Nov 10, 2024

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,356 1,308 Updated May 21, 2023

fastllm是c++实现,后端无依赖(仅依赖CUDA,无需依赖PyTorch)的高性能大模型推理库。 可实现单4090推理DeepSeek R1 671B INT4模型,单路可达20+tps。

C++ 3,502 357 Updated Apr 17, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 45,127 6,919 Updated Apr 17, 2025

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 13,878 46,747 Updated Apr 3, 2025

light-weight graph framework based on coroutines

C++ 2 Updated Oct 18, 2023

Tutorial on PhD Application

938 104 Updated May 12, 2024

Gitbook about quadcopter and crazepony.

Python 254 60 Updated Nov 26, 2015

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 241 34 Updated Jan 29, 2023

This is the source code of the 2021 replication for ReScience of the paper "Speedup Graph Processing by Graph Ordering" by Hao Wei, Jeffrey Xu Yu, Can Lu, and Xuemin Lin, published in Proceedings o…

C++ 10 3 Updated May 31, 2021

Fast and memory-efficient exact attention

Python 16,933 1,612 Updated Apr 13, 2025

Implementation of a Tensor Processing Unit for embedded systems and the IoT.

VHDL 456 65 Updated Jan 5, 2019

An OpenCL-based FPGA Accelerator for Convolutional Neural Networks

C 1,294 370 Updated Feb 14, 2022

用C++实现一个简单的Transformer模型。 Attention Is All You Need。

C++ 50 8 Updated Mar 11, 2021

Vitis_Accel_Examples

Makefile 535 215 Updated Apr 15, 2025

AutoSA: Polyhedral-Based Systolic Array Compiler

C++ 218 33 Updated Dec 8, 2022

【入门项目】这个仓库是用hls来实现手写数字识别CNN硬件(xilinx fpga)加速的代码

Python 74 11 Updated Aug 6, 2022

使用HLS设计一个可分卷积(High Level Synthesis)模块,以在FPGA上对其进行加速。

C 8 1 Updated Mar 6, 2021
Next