Coffee4Head

Follow

🎯

Focusing

Coffee4Head

🎯

Focusing

Follow

Stars

cornell-zhang / HiSparse

High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS

C++ 82 11 Updated Sep 27, 2024

GuangLun2000 / GuangLun2000.github.io

(Minimalism Style) Powered by Jekyll, based on the Minimal Mistakes theme and Jason Ansel's website

CSS 588 750 Updated Dec 16, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,984 27,419 Updated Dec 31, 2024

RC4ML / Shuhai

Shuhai is a benchmarking-memory tool that allows FPGA programmers to demystify all the underlying details of memories, e.g., HBM and DDR4, on a Xilinx FPGA [FCCM 20]

SystemVerilog 98 20 Updated Sep 15, 2023

onnx / onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 789 323 Updated Dec 20, 2024

onnx / models

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,094 1,418 Updated Apr 30, 2024

cornell-zhang / heterocl

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Python 327 92 Updated Apr 20, 2024

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,581 373 Updated Dec 4, 2024

Xilinx / finn

Dataflow compiler for QNN inference on FPGAs

Python 775 247 Updated Dec 20, 2024

microsoft / calculator

Windows Calculator: A simple yet powerful calculator that ships with Windows

C++ 29,936 5,454 Updated Dec 3, 2024

pku-liang / TensorLib

Forked from kirliavc/tensorlib

A Spatial Accelerator Generation Framework for Tensor Algebra.

Verilog 53 8 Updated Dec 3, 2021

circt-hls / circt-hls

CIRCT-based HLS compilation flows, debugging, and cosimulation tools.

C++ 48 7 Updated Jul 17, 2023

hellogcc / 100-gcc-tips

A collection of gcc tips. 100 maybe just mean many here.

Go 545 149 Updated Jun 1, 2022

chipsalliance / firrtl

Flexible Intermediate Representation for RTL

Scala 732 177 Updated Aug 20, 2024

pku-liang / Hector

A hardware synthesis framework with multi-level paradigm

C++ 37 4 Updated Sep 16, 2023

makslevental / openhls

PyTorch model to RTL flow for low latency inference

Tcl 123 11 Updated Mar 15, 2024

llvm / circt

Circuit IR Compilers and Tools

C++ 1,701 305 Updated Dec 31, 2024

UIUC-ChenLab / scalehls

A scalable High-Level Synthesis framework on MLIR

MLIR 236 50 Updated May 15, 2024

jmgorius / mlir-standalone-template

An out-of-tree MLIR dialect template.

CMake 91 22 Updated Sep 6, 2024

freecores / jpegencode

JPEG Encoder Verilog

Verilog 73 33 Updated Oct 31, 2022

ferrandi / PandA-bambu

PandA-bambu public repository

C++ 248 48 Updated Oct 7, 2024

wangshusen / DRL

Deep Reinforcement Learning

3,445 591 Updated Dec 10, 2022

neuropsychology / NeuroKit

NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing

Python 1,630 429 Updated Dec 8, 2024