Skip to content
View lzhangbj's full-sized avatar

Highlights

  • Pro

Block or report lzhangbj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Emu Series: Generative Multimodal Models from BAAI

Python 1,695 86 Updated Sep 27, 2024

Official implementation of OneDiffusion paper (CVPR 2025)

Python 617 20 Updated Dec 14, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,038 450 Updated Mar 22, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,023 46 Updated Feb 23, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,358 783 Updated Mar 12, 2025

[ECCV 2024 Oral] Audio-Synchronized Visual Animation

Python 47 1 Updated Sep 12, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,144 135 Updated Mar 22, 2025

A curated list of fellowships for graduate students in Computer Science and related fields.

610 64 Updated Jan 13, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,576 4,311 Updated Mar 23, 2025

WavJourney: Compositional Audio Creation with LLMs

Python 534 44 Updated Sep 28, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,757 28,367 Updated Mar 22, 2025

Bring portraits to life!

Python 14,396 1,553 Updated Feb 28, 2025

A vector-quantized periodic autoencoder (VQ-PAE) for motion alignment across different morphologies with no supervision [SIGGRAPH 2024]

Python 70 10 Updated Nov 4, 2024

PyTorch extensions for high performance and large scale training.

Python 3,278 286 Updated Jan 12, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,932 1,057 Updated Mar 6, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,636 1,553 Updated Dec 25, 2024

The uncompromising Python code formatter

Python 39,954 2,561 Updated Mar 23, 2025

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,773 364 Updated Jul 10, 2024

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Python 278 14 Updated Apr 22, 2024

[CVPR 2025] Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Python 259 22 Updated Mar 11, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 417 12 Updated Sep 2, 2024

Large-scale text-video dataset. 10 million captioned short videos.

Python 627 39 Updated Aug 14, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 558 55 Updated Jul 26, 2024

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

Python 20 4 Updated Oct 1, 2024

[ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning"

Python 26 Updated Mar 5, 2025

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,244 229 Updated May 21, 2023

Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)

Python 53 5 Updated Feb 6, 2025

TrackVerse

Python 3 Updated Sep 18, 2024

Audio Visual Instance Discrimination with Cross-Modal Agreement

Python 128 18 Updated Aug 13, 2021

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,457 143 Updated Dec 8, 2023
Next