Skip to content
View brucechin's full-sized avatar

Block or report brucechin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Video processing in Python

C++ 53 4 Updated Feb 10, 2025

VideoSys: An easy and efficient system for video generation

Python 1,908 130 Updated Jan 1, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,796 601 Updated May 31, 2024

A PyTorch Native LLM Training Framework

Python 704 38 Updated Dec 27, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,347 59 Updated Dec 10, 2024

Development repository for the Triton language and compiler

C++ 14,361 1,776 Updated Feb 12, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,684 117 Updated Dec 26, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,378 1,099 Updated Feb 11, 2025

Fast and memory-efficient exact attention

Python 15,419 1,454 Updated Feb 11, 2025

Scale LLM Engine public repository

Python 790 61 Updated Feb 11, 2025

DeepSeek LLM: Let there be answers

Makefile 5,767 878 Updated Feb 4, 2024

Transformer related optimization, including BERT, GPT

C++ 6,012 898 Updated Mar 27, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,091 2,670 Updated Feb 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,139 4,923 Updated Feb 11, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,258 558 Updated Oct 28, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,737 4,615 Updated Feb 11, 2025

Sparsity-aware deep learning inference runtime for CPUs

Python 3,097 181 Updated Jul 19, 2024

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

975 52 Updated Nov 21, 2024

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,851 416 Updated Dec 20, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,682 4,231 Updated Feb 12, 2025

Large Language Model Text Generation Inference

Python 9,733 1,139 Updated Feb 11, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 37,406 5,625 Updated Feb 12, 2025

Beringei is a high performance, in-memory storage engine for time series data.

C++ 3,170 294 Updated Jul 11, 2018

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 29,942 2,850 Updated Feb 11, 2025

RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.

C++ 1,782 501 Updated Feb 12, 2025

A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)

Python 19,167 2,704 Updated Feb 12, 2025

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,187 487 Updated Mar 22, 2024
Next