Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 6,852 413 Updated Jan 9, 2025

gpustack / gpustack

Manage GPU clusters for running AI models

Python 1,044 96 Updated Jan 15, 2025

hkchengrex / MMAudio

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 959 102 Updated Jan 14, 2025

ericciarla / trendFinder

Stay on top of trending topics on social media and the web with AI

TypeScript 2,449 263 Updated Jan 6, 2025

shilinyan99 / AIDE

A Sanity Check for AI-generated Image Detection

Python 48 2 Updated Jan 3, 2025

DCDmllm / AnyEdit

Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"

Jupyter Notebook 44 3 Updated Jan 15, 2025

rkinas / triton-resources

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

157 13 Updated Jan 12, 2025

krystalan / DRT-o1

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

194 8 Updated Dec 31, 2024

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 6,455 1,201 Updated Jan 15, 2025

alimama-creative / FLUX-Controlnet-Inpainting

Python 560 36 Updated Nov 22, 2024

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,095 68 Updated Jan 9, 2025

lllyasviel / LuminaBrush

Illumination Drawing Tools for Text-to-Image Diffusion Models

488 15 Updated Dec 22, 2024

sayakpaul / diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 304 9 Updated Jan 1, 2025

AaltoML / BayesVLM

Code for Post-hoc Probabilistic Vision-Language Models

2 Updated Dec 10, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 22,770 1,863 Updated Jan 12, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,606 930 Updated Jan 15, 2025

timothybrooks / instruct-pix2pix

Python 6,478 545 Updated Mar 3, 2024

qihao067 / CrossFlow

This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolution'

Python 124 3 Updated Dec 31, 2024

TencentARC / BrushEdit

The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"

Python 478 24 Updated Dec 26, 2024

causalfusion / causalfusion

Python 127 1 Updated Dec 17, 2024

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Python 1,466 58 Updated Jan 12, 2025

diffusion-face-relighting / difareli_code

Official code for DiFaReli

Python 117 5 Updated Nov 10, 2023

logtd / ComfyUI-Fluxtapoz

Nodes for image juxtaposition for Flux in ComfyUI

Python 999 46 Updated Jan 9, 2025

wangjiangshan0725 / RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 360 8 Updated Dec 16, 2024

KwaiVGI / StyleMaster

[ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation

72 Updated Dec 11, 2024

mycfhs / DreamMix

The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Python 108 5 Updated Jan 2, 2025

Oğuzhan Ercan Oguzhanercan

Lists (25)

3D Gen

AR Image Generation

datasets

DeepFake Detection

Diffusion Backbone

Diffusion Control

Diffusion Guidance

Diffusion Optimization

Diffusion Quality Enhancement

Face

Image Editting

image enhancement

Image-Text

Inpainting

Low Computation Less Data Train

mmsy

Optimization

others

setup

Super Resolution

Translate

Video - General Tasks

Video Generation

Video Style

Voice

Stars