Skip to content
View yangxl17's full-sized avatar
  • Tsinghua University
  • Beijijing China

Highlights

  • Pro

Block or report yangxl17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🏠 MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

17 Updated Nov 29, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,923 606 Updated Aug 18, 2024

mnist_trained_model with torch

Python 3 Updated Jun 29, 2022

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 431 26 Updated Feb 10, 2025

Verilog Sigmoid and Tanh functions which can be configured and added to your neural network project

Verilog 16 3 Updated Mar 9, 2020
SystemVerilog 162 42 Updated Apr 8, 2024

在FPGA上面实现一个NPU计算单元。能够执行矩阵运算(ADD/ADDi/ADDs/MULT/MULTi/DOT等)、图像处理运算(CONV/POOL等)、非线性映射(RELU/TANH/SIGM等)。

Verilog 217 45 Updated Aug 16, 2018

Implementation of an NPU that can be integrated into a RISC- V core through X-Interface.

SystemVerilog 10 4 Updated Sep 17, 2024

Project is about designing a Trained Neural Network on FPGA to classify an Image Input using CNN.

Verilog 135 26 Updated Dec 13, 2020

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,440 175 Updated Feb 25, 2025
Verilog 56 11 Updated Jan 20, 2024

Dataflow compiler for QNN inference on FPGAs

Python 792 251 Updated Mar 6, 2025

Open source machine learning accelerators

Scala 373 30 Updated Mar 24, 2024

xkDLA:XinKai Deep Learning Accelerator (RTL)

Verilog 28 6 Updated Jan 15, 2024

Opensource software/hardware platform to build edge AI solutions deployed on FPGA or custom ASIC hardware.

C++ 247 37 Updated Aug 21, 2024

Run LeNet-5 baremetal

C 8 Updated Apr 24, 2021

An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.

Python 46 3 Updated Feb 26, 2025

Sparse CNN Accelerator targeting Intel FPGA

C++ 11 1 Updated Aug 26, 2021

General CNN_Accelerator design.卷积神经网络加速器设计。在PYNQ-Z2 FPGA开发板上实现了卷积池化全连接层等硬件加速计算。

VHDL 33 3 Updated Mar 6, 2025

基于FPGA的数字识别-实时视频处理的定点卷积神经网络实现

Verilog 301 67 Updated May 2, 2023

Low level design of a chip built for optimizing/accelerating CNN classifiers over gray scale images.

VHDL 12 3 Updated May 14, 2019

Reconfigurable implementation and evaluation of the Bit Pragmatic Deep Learning Inference engine

VHDL 4 3 Updated Dec 30, 2019

XJTU-Tripler is based on HiPU100, an FPGA-friendly DNN accelerator, developed by CAG, Institute of AI & Robotics, XJTU.

VHDL 175 45 Updated Jan 10, 2024
Verilog 9 3 Updated Apr 16, 2022
Verilog 13 Updated Apr 24, 2023

[TECS'23] A project on the co-design of Accelerators and CNNs.

Verilog 20 3 Updated Dec 10, 2022

FPGA accelerated TinyYOLO v2 object detection neural network

HTML 70 19 Updated Jul 31, 2018

This is a repo for my research on reconfigureable hardware

VHDL 2 1 Updated Oct 12, 2021
Verilog 16 3 Updated Apr 6, 2022
Next