This is the source code of the 2021 replication for ReScience of the paper "Speedup Graph Processing by Graph Ordering" by Hao Wei, Jeffrey Xu Yu, Can Lu, and Xuemin Lin, published in Proceedings o…

C++ 10 3 Updated May 31, 2021

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 16,933 1,612 Updated Apr 13, 2025

jofrfu / tinyTPU

Implementation of a Tensor Processing Unit for embedded systems and the IoT.

VHDL 456 65 Updated Jan 5, 2019

doonny / PipeCNN

An OpenCL-based FPGA Accelerator for Convolutional Neural Networks

C 1,294 370 Updated Feb 14, 2022

dianhsu / transformer-cpp-cpu

用C++实现一个简单的Transformer模型。 Attention Is All You Need。

C++ 50 8 Updated Mar 11, 2021

Xilinx / Vitis_Accel_Examples

Vitis_Accel_Examples

Makefile 535 215 Updated Apr 15, 2025

UCLA-VAST / AutoSA

AutoSA: Polyhedral-Based Systolic Array Compiler

C++ 218 33 Updated Dec 8, 2022

peilin-chen / hls_for_cnn_mnist

【入门项目】这个仓库是用hls来实现手写数字识别CNN硬件(xilinx fpga)加速的代码

Python 74 11 Updated Aug 6, 2022

Introspecting / DepthwiseCONV-HLS

使用HLS设计一个可分卷积（High Level Synthesis）模块，以在FPGA上对其进行加速。

C 8 1 Updated Mar 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JoeyZi1

Highlights

Block or report JoeyZi1

Lists (1)

✨ Inspiration

Stars

BUAA-CI-LAB / Literatures-on-Homomorphic-Encryption

tuneinsight / lattigo

ayaka14732 / tpu-starter

yaoyao-liu / minimal-light

pyf98 / DPHuBERT

lucidrains / vit-pytorch

SeoLabCornell / torch2chip

pytorch / torchtune

mit-han-lab / once-for-all

Xilinx / Vitis-Tutorials

Xilinx / Vitis-HLS-Introductory-Examples

sharc-lab / FPGA_ECE8893

blake502 / balatro-mobile-maker

Kyubyong / transformer

ztxz16 / fastllm

vllm-project / vllm

academicpages / academicpages.github.io

xiangyuzhi / corograph

zhanglj37 / Tutorial-on-PhD-Application

Crazepony / crazepony-gitbook

kssteven418 / I-BERT

lecfab / rescience-gorder