Skip to content
View MatthewLuo7's full-sized avatar
  • University of Alberta
  • Edmonton, Canada

Block or report MatthewLuo7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository to host and maintain scale-sim-v2 code

Python 270 113 Updated Mar 11, 2025
C++ 32 13 Updated Jul 9, 2020

An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model

C++ 433 142 Updated Jun 25, 2024

Reorder-based post-training quantization for large language model

Python 185 11 Updated May 17, 2023
C++ 1 Updated Feb 4, 2025

Using ideas from product quantization for state-of-the-art neural network compression.

Python 146 15 Updated Aug 14, 2021
Shell 27 3 Updated Mar 28, 2024

Fast and accurate DRAM power and energy estimation tool

C++ 149 50 Updated Mar 10, 2025

New generation entropy codecs : Finite State Entropy and Huff0

C 1,368 150 Updated Mar 21, 2024
Python 9 1 Updated Nov 11, 2024

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 188 16 Updated Dec 15, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,015 28,245 Updated Mar 11, 2025

Code accompanying the paper "Massive Activations in Large Language Models"

Python 147 9 Updated Mar 4, 2024

This is a collection of our zero-cost NAS and efficient vision applications.

Python 408 50 Updated Aug 21, 2023

Post-Training Quantization for Vision transformers.

Python 206 27 Updated Jul 19, 2022

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 240 34 Updated Jan 29, 2023

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python 97 3 Updated Dec 23, 2024

A paper list of some recent Transformer-based CV works.

1,211 142 Updated Mar 12, 2025

Code for "Atalanta: A Bit is Worth a ``Thousand'' Tensor Values"

C 4 2 Updated Dec 7, 2023
Python 2 Updated Aug 30, 2019

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer

Python 326 48 Updated Apr 11, 2023

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,452 4,864 Updated Feb 23, 2025

[ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers

Python 123 8 Updated Jan 10, 2024

A script for US visa appointments in Canada

Python 74 42 Updated Mar 9, 2025

Simulator for BitFusion

Python 96 23 Updated Aug 6, 2020

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,825 233 Updated Mar 3, 2025

A bit-level sparsity-awared multiply-accumulate process element.

Verilog 13 1 Updated Jul 9, 2024

A visualization and transformation of pytorch model

Jupyter Notebook 30 6 Updated Jan 8, 2020

PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications

SystemVerilog 38 7 Updated May 5, 2023
Next