brucechin

Lianke Qin brucechin

In scaling law we trust.

412 followers · 599 following

Starred repositories

RomanArzumanyan / VALI

Video processing in Python

C++ 53 4 Updated Feb 10, 2025

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,908 130 Updated Jan 1, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,796 601 Updated May 31, 2024

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 704 38 Updated Dec 27, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,347 59 Updated Dec 10, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,361 1,776 Updated Feb 12, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

1,684 117 Updated Dec 26, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,378 1,099 Updated Feb 11, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,419 1,454 Updated Feb 11, 2025

scaleapi / llm-engine

Scale LLM Engine public repository

Python 790 61 Updated Feb 11, 2025

deepseek-ai / DeepSeek-LLM

DeepSeek LLM: Let there be answers

Makefile 5,767 878 Updated Feb 4, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,012 898 Updated Mar 27, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,091 2,670 Updated Feb 12, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,139 4,923 Updated Feb 11, 2025

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,258 558 Updated Oct 28, 2024

sallenkey-wei / cuda-handbook

pdf

89 40 Updated May 8, 2018

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,737 4,615 Updated Feb 11, 2025

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 3,097 181 Updated Jul 19, 2024

HillZhang1999 / llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

975 52 Updated Nov 21, 2024