Skip to content
View laze44's full-sized avatar
  • Xi an Jiaotong University

Block or report laze44

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GPGPU microprocessor architecture

C 2,030 356 Updated Nov 8, 2024
229 22 Updated Dec 13, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,503 5,530 Updated Dec 18, 2024

FastVideo is an open-source framework for accelerating large video diffusion model.

Python 800 47 Updated Jan 8, 2025

Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.

Python 46 21 Updated Jan 7, 2025

CXL-DMSim: A Full-System CXL Disaggregated Memory Simulator Based on gem5

C++ 45 10 Updated Nov 14, 2024

đź‘» Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 21,804 507 Updated Jan 8, 2025

veRL: Volcano Engine Reinforcement Learning for LLM

Python 612 48 Updated Jan 8, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,138 541 Updated Jan 2, 2025

Parametric floating-point unit with support for standard RISC-V formats and operations as well as transprecision formats.

SystemVerilog 445 117 Updated Oct 23, 2024

32-bit Superscalar RISC-V CPU

Verilog 908 151 Updated Sep 18, 2021

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 52 6 Updated Jun 14, 2024
C++ 72 30 Updated Jan 8, 2025

Xiaomi Home Integration for Home Assistant

Python 16,864 788 Updated Jan 7, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,692 2,587 Updated Jan 8, 2025

Intel® NPU Acceleration Library

Python 567 63 Updated Dec 16, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,946 432 Updated Jan 8, 2025

Modeling Architectural Platform

C++ 174 60 Updated Jan 8, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 5,956 1,031 Updated Jan 8, 2025

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 673 74 Updated Jan 7, 2025

A model compilation solution for various hardware

MLIR 394 43 Updated Dec 31, 2024

Efficient LLM Inference over Long Sequences

Python 335 17 Updated Dec 28, 2024

Tile primitives for speedy kernels

Cuda 1,907 92 Updated Jan 4, 2025

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 477 36 Updated Jan 6, 2025

Tensor library for machine learning

C++ 11,497 1,076 Updated Jan 5, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 300 23 Updated Dec 23, 2024

Lightweight, general, scalable C++ library for finite element methods

C++ 1,775 507 Updated Jan 8, 2025

A distributed KV store for disaggregated LLM inference

C++ 19 2 Updated Jan 8, 2025
Next