Skip to content
View QingtaoLi1's full-sized avatar

Block or report QingtaoLi1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

    Python MIT License Updated Jan 3, 2025
  • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    C++ Other Updated Oct 9, 2024
  • hoi_llama.cpp Public

    Forked from HoiV/llama.cpp

    LLM inference in C/C++

    C++ MIT License Updated Jul 19, 2024
  • Hackable and optimized Transformers building blocks, supporting a composable construction.

    Python Other Updated Apr 19, 2024
  • cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ Other Updated Apr 9, 2024
  • Welder Public

    Forked from nox-410/Welder

    OSDI 2023 Welder, deeplearning compiler

    Python Updated Mar 21, 2024
  • nnfusion Public

    Forked from microsoft/nnfusion

    A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

    C++ MIT License Updated Jan 16, 2024
  • nni Public

    Forked from J-shang/nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Python MIT License Updated Jun 9, 2022
  • Repository for TorchSharp examples and tutorials.

    Jupyter Notebook MIT License Updated Apr 8, 2022