-
University of California, Berkeley
- Berkeley, CA
- https://andy-yang-1.github.io/
Highlights
- Pro
Stars
Interact with your documents using the power of GPT, 100% privately, no data leaks
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A high-throughput and memory-efficient inference and serving engine for LLMs
Fast and memory-efficient exact attention
Ongoing research training transformer models at scale
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
SGLang is a fast serving framework for large language models and vision language models.
Running large language models on a single GPU for throughput-oriented scenarios.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Sky-T1: Train your own O1 preview model within $450
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
A curated list for Efficient Large Language Models
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Latency and Memory Analysis of Transformer Models for Training and Inference
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
Puzzles for learning Triton, play it with minimal environment configuration!
Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders