Skip to content
View Coffee4Head's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Coffee4Head

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS

C++ 82 11 Updated Sep 27, 2024

(Minimalism Style) Powered by Jekyll, based on the Minimal Mistakes theme and Jason Ansel's website

CSS 588 750 Updated Dec 16, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,984 27,419 Updated Dec 31, 2024

Shuhai is a benchmarking-memory tool that allows FPGA programmers to demystify all the underlying details of memories, e.g., HBM and DDR4, on a Xilinx FPGA [FCCM 20]

SystemVerilog 98 20 Updated Sep 15, 2023

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 789 323 Updated Dec 20, 2024

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,094 1,418 Updated Apr 30, 2024

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Python 327 92 Updated Apr 20, 2024

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,581 373 Updated Dec 4, 2024

Dataflow compiler for QNN inference on FPGAs

Python 775 247 Updated Dec 20, 2024

Windows Calculator: A simple yet powerful calculator that ships with Windows

C++ 29,936 5,454 Updated Dec 3, 2024

A Spatial Accelerator Generation Framework for Tensor Algebra.

Verilog 53 8 Updated Dec 3, 2021

CIRCT-based HLS compilation flows, debugging, and cosimulation tools.

C++ 48 7 Updated Jul 17, 2023

A collection of gcc tips. 100 maybe just mean many here.

Go 545 149 Updated Jun 1, 2022

Flexible Intermediate Representation for RTL

Scala 732 177 Updated Aug 20, 2024

A hardware synthesis framework with multi-level paradigm

C++ 37 4 Updated Sep 16, 2023

PyTorch model to RTL flow for low latency inference

Tcl 123 11 Updated Mar 15, 2024

Circuit IR Compilers and Tools

C++ 1,701 305 Updated Dec 31, 2024

A scalable High-Level Synthesis framework on MLIR

MLIR 236 50 Updated May 15, 2024

An out-of-tree MLIR dialect template.

CMake 91 22 Updated Sep 6, 2024

JPEG Encoder Verilog

Verilog 73 33 Updated Oct 31, 2022

PandA-bambu public repository

C++ 248 48 Updated Oct 7, 2024

Deep Reinforcement Learning

3,445 591 Updated Dec 10, 2022

NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing

Python 1,630 429 Updated Dec 8, 2024