-
ROCm Communication Collectives Library (RCCL)
C++ Other UpdatedNov 20, 2024 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 15, 2024 -
ml-engineering Public
Forked from stas00/ml-engineeringMachine Learning Engineering Open Book
Python Creative Commons Attribution Share Alike 4.0 International UpdatedNov 12, 2024 -
RecoNIC Public
Forked from Xilinx/RecoNICRecoNIC is a software/hardware shell used to enable network-attached processing within an RDMA-featured SmartNIC for scale-out computing.
SystemVerilog MIT License UpdatedOct 23, 2024 -
-
gem5-NVDLA Public
Forked from suchandler96/gem5-NVDLAC++ BSD 3-Clause "New" or "Revised" License UpdatedSep 26, 2024 -
nnfusion Public
Forked from microsoft/nnfusionA flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
C++ MIT License UpdatedSep 19, 2024 -
ventus-gpgpu-verilog Public
Forked from THU-DSP-LAB/ventus-gpgpu-verilogGPGPU supporting RISCV-V, developed with verilog HDL
Verilog UpdatedAug 16, 2024 -
awesome-tensor-compilers Public
Forked from merrymercy/awesome-tensor-compilersA list of awesome compiler projects and papers for tensor computation and deep learning.
UpdatedJul 14, 2024 -
SCP-firmware Public
Forked from ARM-software/SCP-firmwareRead-only mirror of System Control Processor (SCP) firmware
C Other UpdatedJul 10, 2024 -
awesome-artificial-intelligence Public
Forked from owainlewis/awesome-artificial-intelligenceA curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
UpdatedJun 20, 2024 -
riscv-iommu Public
Forked from zero-day-labs/riscv-iommuIOMMU IP compliant with the RISC-V IOMMU Specification v1.0
SystemVerilog Apache License 2.0 UpdatedJun 13, 2024 -
riscv-aia Public
Forked from riscv/riscv-aiaCreative Commons Attribution 4.0 International UpdatedJun 7, 2024 -
Chiplet-Gem5-SharedMemory Public
Forked from FCAS-LAB/Chiplet-Gem5-SharedMemoryC++ UpdatedJun 4, 2024 -
mgpusim Public
Forked from sarchlab/mgpusimA highly-flexible GPU simulator for AMD GPUs.
Go MIT License UpdatedMay 28, 2024 -
mlc-llm Public
Forked from mlc-ai/mlc-llmUniversal LLM Deployment Engine with ML Compilation
Python Apache License 2.0 UpdatedMay 27, 2024 -
awesome-compilers Public
Forked from aalhour/awesome-compilers😎 Curated list of awesome resources on Compilers, Interpreters and Runtimes
Other UpdatedMay 26, 2024 -
vulkan-sim Public
Forked from ubc-aamodt-group/vulkan-simVulkan-Sim is a GPU architecture simulator for Vulkan ray tracing based on GPGPU-Sim and Mesa.
C++ Other UpdatedMay 22, 2024 -
ramulator2 Public
Forked from CMU-SAFARI/ramulator2Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…
C++ MIT License UpdatedMay 10, 2024 -
tpu-mlir Public
Forked from sophgo/tpu-mlirMachine learning compiler based on MLIR for Sophgo TPU.
C++ Other UpdatedApr 15, 2024 -
AMD ROCm™ Software - GitHub Home
Python MIT License UpdatedMar 20, 2024 -
brpc Public
Forked from apache/brpcbrpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…
C++ Apache License 2.0 UpdatedFeb 19, 2024 -
ventus-gpgpu-doc Public
Forked from THU-DSP-LAB/ventus-gpgpu-docdocumentation for ventus gpgpu
1 UpdatedFeb 1, 2024 -
iob-cache Public
Forked from IObundle/iob-cacheVerilog Configurable Cache
Verilog MIT License UpdatedJan 31, 2024 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedJan 15, 2024 -
esp Public
Forked from sld-columbia/espEmbedded Scalable Platforms: Heterogeneous SoC architecture and IP integration made easy
C Other UpdatedJan 11, 2024 -
awesome-ai4eda Public
Forked from Thinklab-SJTU/awesome-ai4edaAwesome Artificial Intelligence for Electronic Design Automation Papers.
UpdatedDec 28, 2023 -
MegCC Public
Forked from MegEngine/MegCCMegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器
C++ Apache License 2.0 UpdatedOct 9, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 14, 2023