Skip to content
View TonyHu2001s's full-sized avatar

Block or report TonyHu2001s

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

BookSim 2.0

C++ 294 166 Updated Jun 24, 2024

STONNE: A Simulation Tool for Neural Networks Engines

C++ 122 30 Updated May 30, 2024

Repository to host and maintain scale-sim-v2 code

Python 253 103 Updated Jan 6, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,785 5,895 Updated Aug 24, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,384 3,138 Updated Jan 4, 2025

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,396 174 Updated Jan 1, 2025

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 330 12 Updated Jan 4, 2025

The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".

Python 50 Updated Jan 8, 2025

A curated list for Efficient Large Language Models

Python 1,387 103 Updated Dec 30, 2024
Python 48 5 Updated Jun 24, 2024

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 269 62 Updated Dec 11, 2024

Here are some implementations of basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and support the hardware design tradeoff.

Verilog 10 7 Updated Aug 25, 2023

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,247 2,992 Updated Jan 10, 2025

A primitive library for neural network

C++ 1,305 217 Updated Nov 24, 2024

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…

C++ 35,204 8,012 Updated Mar 19, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,456 855 Updated Jan 6, 2025

SMAUG: Simulating Machine Learning Applications Using Gem5-Aladdin

C++ 102 27 Updated Jan 4, 2023

This is the top-level repository for the Accel-Sim framework.

Python 322 120 Updated Oct 23, 2024

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

C++ 82 13 Updated Dec 11, 2024

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 603 211 Updated Aug 29, 2023

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,648 703 Updated Dec 24, 2024

Collect some IC textbooks for learning.

117 51 Updated Aug 11, 2022

Research about dataflow architecture

8 Updated Nov 30, 2023

SpotServe: Serving Generative Large Language Models on Preemptible Instances

109 9 Updated Feb 22, 2024

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 555 38 Updated Jan 10, 2025

Open deep learning compiler stack for Kendryte AI accelerators ✨

C# 758 185 Updated Jan 10, 2025

Open-source high-performance RISC-V processor

Scala 5,813 717 Updated Jan 10, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,617 533 Updated Oct 22, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,495 5,120 Updated Jan 10, 2025
Next