Skip to content
View GindaChen's full-sized avatar

Highlights

  • Pro

Organizations

@UWQuickstep @open-lambda @UWHustle @laztech-vgt

Block or report GindaChen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Universal LLM Deployment Engine with ML Compilation

Python 19,932 1,661 Updated Feb 11, 2025

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 761 56 Updated Feb 7, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,476 269 Updated Feb 11, 2025

procedural reasoning datasets

Python 356 38 Updated Feb 11, 2025

Convert any PDF into a podcast episode!

Python 2,008 224 Updated Dec 7, 2024

Minimalist LLM Framework in 100 Lines. Enable LLMs to Program Themselves.

Jupyter Notebook 459 29 Updated Feb 6, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,436 464 Updated Feb 11, 2025

A fast and efficient type assistant for Python, including tensor shape inference

Python 284 6 Updated Feb 11, 2025

[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank

Python 36 7 Updated Nov 4, 2024

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 990 61 Updated Feb 7, 2025

The Multi-Faceted Optimizer for GenAI Workflows

Python 183 18 Updated Feb 6, 2025

Let your Claude able to think

TypeScript 14,225 1,661 Updated Jan 23, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 956 78 Updated Feb 11, 2025

Teaching materials for the applied machine learning course at Cornell Tech (online edition)

Jupyter Notebook 1,109 268 Updated Oct 12, 2022

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 685 27 Updated Jun 27, 2024

Low-bit LLM inference on CPU with lookup table

C++ 667 50 Updated Jan 9, 2025

STREAM benchmark

C 366 142 Updated Apr 12, 2024

A library for advanced large language model reasoning

Python 1,824 160 Updated Feb 6, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,300 320 Updated Nov 13, 2024

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 185 23 Updated Feb 5, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,138 49 Updated Nov 16, 2024

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 422 26 Updated Feb 10, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,603 1,971 Updated Oct 15, 2024
Jupyter Notebook 3,371 1,005 Updated Jul 9, 2024

Composable building blocks to build Llama Apps

Python 7,208 851 Updated Feb 11, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

144 3 Updated Jan 31, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 937 42 Updated Feb 1, 2025
Next