carpedkm

Follow

Daneul Michael Kim carpedkm

Follow

ex nihilo nihil fit

15 followers · 22 following

SNU
https://carpedkm.github.io

Highlights

Pro

Stars

83 stars written in Python

zylon-ai / private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 54,779 7,364 Updated Nov 13, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,820 3,455 Updated May 18, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,016 2,264 Updated Dec 27, 2024

unslothai / unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 20,182 1,431 Updated Jan 9, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,856 1,040 Updated Dec 31, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,207 953 Updated Jan 8, 2025

kornia / kornia

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,125 980 Updated Jan 6, 2025

andrewyng / aisuite

Simple, unified interface to multiple Generative AI providers

Python 9,655 869 Updated Jan 5, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 6,243 408 Updated Dec 27, 2024

Doubiiu / ToonCrafter

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,492 458 Updated Sep 9, 2024

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 4,935 529 Updated Jul 8, 2024

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,645 353 Updated Jul 10, 2024

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,315 372 Updated Dec 22, 2024

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,951 278 Updated Oct 5, 2024

lllyasviel / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,934 337 Updated Aug 30, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,020 267 Updated Oct 22, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,921 183 Updated Oct 31, 2024

sovrasov / flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Python 2,847 306 Updated Sep 27, 2024

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,703 218 Updated Sep 8, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 1,957 77 Updated Oct 24, 2024

zengyh1900 / Awesome-Image-Inpainting

A curated list of image inpainting and video inpainting papers and resources

Python 1,956 263 Updated Nov 6, 2024

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,729 85 Updated Oct 31, 2024

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,724 67 Updated Jan 2, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,533 94 Updated Dec 11, 2024

IDEA-Research / MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,244 112 Updated Dec 20, 2023

Fantasy-Studio / Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Python 1,140 100 Updated Nov 28, 2023

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,126 86 Updated Oct 21, 2024

Yuanshi9815 / OminiControl

A minimal and universal controller for FLUX.1.

Python 1,049 65 Updated Jan 9, 2025

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 970 73 Updated Nov 18, 2024

NVlabs / FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Python 809 63 Updated Jun 2, 2024