Skip to content
View adityapandey9's full-sized avatar
  • Kolkata

Block or report adityapandey9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

73 Updated Dec 31, 2024

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 358 13 Updated Jan 2, 2025

Illumination Drawing Tools for Text-to-Image Diffusion Models

456 11 Updated Dec 22, 2024

[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Python 303 15 Updated Dec 27, 2024

NOVA: Autoregressive Video Generation without Vector Quantization

Python 295 8 Updated Jan 3, 2025

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Python 83 8 Updated Sep 1, 2021

Prompt Depth Anything

Python 433 16 Updated Dec 17, 2024

Official repository for the paper "CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models"

117 9 Updated Dec 15, 2024

Code for "Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation", Arxiv 2024

62 Updated Dec 22, 2024

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 343 8 Updated Dec 16, 2024

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 421 24 Updated Dec 31, 2024

This repository is the official implementation of "DisPose: Disentangling Pose Guidance for Controllable Human Image Animation"

Python 291 22 Updated Dec 24, 2024

Clarity: A Minimalist Website Template for AI Research

CSS 77 7 Updated Oct 28, 2024

Official Implementation for "InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention"

87 2 Updated Dec 11, 2024

seaweedfs implemented in pure Rust

Rust 154 19 Updated Dec 4, 2024

Converts text to speech in realtime

Python 2,220 217 Updated Jan 6, 2025

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 293 47 Updated Aug 25, 2021

Official implementation of the TTS model Lina-Speech

Jupyter Notebook 145 12 Updated Nov 11, 2024

The official implementation of EmoSphere++

Python 63 5 Updated Nov 6, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,536 204 Updated Dec 5, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 333 22 Updated Jan 7, 2025

Official impl. of "MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation"

92 4 Updated Dec 5, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 6,108 392 Updated Dec 27, 2024

[SIGGRAPH Asia 2024, Best Paper Honorable Mention] This is the official implementation of our SIGGRAPH Asia journal artical: TEXGen: a Generative Diffusion Model for Mesh Textures

Python 221 6 Updated Dec 18, 2024

Official code for "ControlAR: Controllable Image Generation with Autoregressive Models"

Python 170 5 Updated Dec 22, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,111 538 Updated Jan 2, 2025

AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

305 26 Updated Dec 4, 2024

Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"

JavaScript 217 17 Updated Jan 6, 2025
Next