Skip to content
View cfeng16's full-sized avatar
🐢
🐢

Highlights

  • Pro

Block or report cfeng16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Material for gpu-mode lectures

Jupyter Notebook 3,988 402 Updated Feb 9, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,195 55 Updated Mar 12, 2025

Official implementation of Inductive Moment Matching

Python 317 6 Updated Mar 12, 2025

[NeurIPS'24 Spotlight] Observational Scaling Laws

Jupyter Notebook 53 3 Updated Oct 2, 2024

Unified Video Action Model

Python 98 6 Updated Mar 6, 2025

Enjoy the magic of Diffusion models!

Python 7,911 705 Updated Mar 13, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 8,222 855 Updated Mar 7, 2025

[ICLR 2025] Reconstructive Visual Instruction Tuning

Python 68 3 Updated Mar 1, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,649 96 Updated Mar 7, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,184 82 Updated Mar 10, 2025

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Python 533 105 Updated May 1, 2023

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 386 28 Updated Feb 3, 2025

Official Jax Implementation of MD4 Masked Diffusion Models

Python 64 6 Updated Feb 27, 2025

[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.

Python 60 5 Updated Jun 11, 2024

A curated list for awesome discrete diffusion models resources.

259 9 Updated Feb 5, 2025

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 112 8 Updated Feb 19, 2025

Simple and Effective Masked Diffusion Language Model

Python 330 38 Updated Mar 3, 2025

https://coshand.cs.columbia.edu/

Python 15 1 Updated Oct 23, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,187 284 Updated Mar 4, 2025

Data and code for NeurIPS 2021 Paper "IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning".

Python 51 15 Updated Jan 28, 2024

Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"

Python 33 2 Updated Sep 8, 2023

Geometry Question Solver (GeoS)

Python 170 49 Updated Oct 17, 2017

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 783 48 Updated Mar 12, 2024

Fast Diffusion Models with Transformers

Python 801 107 Updated Oct 25, 2024
Python 41 1 Updated Jan 13, 2025

Code for "Differentiable Robot Rendering" (CoRL 2024)

Python 128 9 Updated Oct 22, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,213 759 Updated Mar 12, 2025

[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"

Python 137 11 Updated Dec 14, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,326 6,235 Updated Mar 13, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,700 2,194 Updated Feb 1, 2025
Next