huangyz0918

Yizheng Huang huangyz0918

ML and System | LLM Agents

387 followers · 95 following

Los Angeles, CA
07:12 (UTC -08:00)
huangyz.name

Achievements

x3 x3

Achievements

x3 x3

Highlights

Developer Program Member
Pro

Organizations

Lists (2)

Sort

Active Learning

7 repositories

Continual Learning

16 repositories

Stars

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 12,978 1,567 Updated Jan 28, 2025

vllm-project / production-stack

Python 116 15 Updated Jan 30, 2025

openai / openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 4,775 467 Updated Jan 27, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 714 29 Updated Sep 21, 2024

browser-use / browser-use

Make websites accessible for AI agents

Python 22,031 2,143 Updated Jan 31, 2025

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,222 258 Updated Jan 31, 2025

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 13,065 1,796 Updated Aug 18, 2024

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,330 453 Updated Jan 28, 2025

deepseek-ai / DeepSeek-V3

Python 65,695 9,494 Updated Jan 26, 2025

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,772 526 Updated Dec 14, 2024

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python 930 100 Updated Jan 2, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,144 494 Updated May 3, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,261 249 Updated Jan 30, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,270 1,086 Updated Jan 31, 2025