Skip to content
View smile2game's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report smile2game

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • vllm-dcu Public

    Python Apache License 2.0 Updated Oct 13, 2024
  • Camp Public

    Forked from PFCCLab/Camp

    飞桨护航计划集训营

    Mermaid Updated Oct 8, 2024
  • Paddle Public

    Forked from PaddlePaddle/Paddle

    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

    C++ Apache License 2.0 Updated Oct 7, 2024
  • parafuser Public

    Parallel inference and training of diffusion models (UNet or Transformer backbone) using my custom methods alongside other open-source repositories.

    1 Apache License 2.0 Updated Sep 18, 2024
  • LLM-Viewer Public

    Forked from hahnyuan/LLM-Viewer

    Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

    Python MIT License Updated Sep 11, 2024
  • 📖A small curated list of Awesome SD/DiT/ViT/Diffusion Distributed/Caching Inference Paper with codes, such as DistriFusion, PipeFusion, AsyncDiff, DeepCache etc.

    GNU General Public License v3.0 Updated Jul 28, 2024
  • xDiT Public

    Forked from xdit-project/xDiT

    A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

    Python Apache License 2.0 Updated Jul 27, 2024
  • hello-algo Public

    Forked from krahets/hello-algo

    《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

    Java Other Updated Jul 27, 2024
  • AISystem Public

    Forked from chenzomi12/AISystem

    AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

    Jupyter Notebook Apache License 2.0 Updated Jul 14, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Jul 9, 2024
  • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++ 1 Apache License 2.0 Updated Jul 7, 2024
  • Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 1 Other Updated Jul 6, 2024
  • Simple, safe way to store and distribute tensors

    Python Apache License 2.0 Updated Jul 6, 2024
  • AI-System Public

    Forked from microsoft/AI-System

    System for AI Education Resource.

    Python Creative Commons Attribution 4.0 International Updated Jun 21, 2024
  • MeSolver Public

    Forked from FuncJ/MeSolver

    The repository maintains the source code for the article titled "Characterize and Optimize Dense Linear Solver on Multi-core CPUs."

    Updated Jun 18, 2024
  • MeAtten Public

    Forked from FuncJ/MeAtten

    The repository maintains the source code for the article titled "Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs."

    Makefile Updated Jun 13, 2024
  • AICAS Grand Challenge 2024: Software and Hardware Co-optimization for General Large Language Model Inference on CPU

    Python Updated Apr 8, 2024
  • [ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

    Python MIT License Updated Mar 21, 2024
  • sige Public

    Forked from lmxyy/sige

    [NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

    Python Other Updated Mar 18, 2024
  • algorithm Public

    C++ 1 Updated Dec 8, 2023
  • instant-ngp Public

    Forked from NVlabs/instant-ngp

    Instant neural graphics primitives: lightning fast NeRF and more

    Cuda Other Updated Nov 19, 2023
  • paradigms Public

    Forked from AndyShih12/paradigms

    PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight

    Python MIT License Updated Oct 13, 2023
  • Simple samples for TensorRT programming

    Python Apache License 2.0 Updated Oct 12, 2023
  • learnCPP_CN Public

    Forked from gitgou/learnCPP_CN

    C++ 基础学习代码案例

    C++ Updated Aug 25, 2023
  • nvidia-game Public

    第一次比赛经历,很nice

    Python 1 Apache License 2.0 Updated Aug 13, 2023
  • My clone repository

    MIT License Updated Jun 1, 2023
  • HTML Updated May 5, 2023
  • BCI Public

    BCI

    Jupyter Notebook Updated Feb 24, 2023
  • bishe2017 Public

    bishe

    Updated Feb 24, 2023
  • pc-code Public

    to connect the pc and the laptop

    Jupyter Notebook MIT License Updated Feb 20, 2023