Skip to content
View jiasenlu's full-sized avatar

Organizations

@GT-Vision-Lab

Block or report jiasenlu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Unified Tokenizer for Visual Generation and Understanding

Python 187 4 Updated Mar 3, 2025

Official implementation of OneDiffusion paper

Python 609 20 Updated Dec 14, 2024

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 99 10 Updated Aug 14, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,159 91 Updated Feb 16, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,320 568 Updated Mar 10, 2025
Python 599 30 Updated Feb 15, 2024

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,346 263 Updated Mar 10, 2025

Fine-tune Segment-Anything Model with Lightning Fabric.

Python 526 57 Updated Mar 25, 2024

Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.

Python 323 15 Updated Jul 11, 2024

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Jupyter Notebook 358 40 Updated Jul 12, 2024

Stable diffusion for real-time music generation

Python 3,572 417 Updated Jul 22, 2024

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,231 227 Updated May 21, 2023

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,782 312 Updated May 1, 2024

An 16kHz implementation of HiFi-GAN for soft-vc.

Python 96 25 Updated Jul 19, 2023

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,065 523 Updated Jul 27, 2024

A faster pytorch implementation of faster r-cnn

Python 7,753 2,329 Updated May 20, 2022

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 16,376 1,953 Updated Jan 27, 2025
Jupyter Notebook 225 28 Updated Dec 18, 2023

Model parallel transformers in JAX and Haiku

Python 6,323 891 Updated Jan 21, 2023

Unofficial implementation of Pix2SEQ

Python 165 14 Updated Oct 5, 2021

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,287 961 Updated Feb 25, 2022
Python 64 8 Updated Nov 4, 2021

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Python 433 46 Updated Feb 12, 2022

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 571 55 Updated Feb 27, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,872 2,603 Updated Mar 4, 2025

A TensorFlow implementation of Invertible Residual Networks

Python 19 2 Updated Dec 8, 2022

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Python 5,605 639 Updated Feb 17, 2024

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Python 208 19 Updated Sep 30, 2022

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Python 680 82 Updated Apr 15, 2022
Next