Skip to content
View hackisland's full-sized avatar

Highlights

  • Pro

Block or report hackisland

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository is the official implementation of "DisPose: Disentangling Pose Guidance for Controllable Human Image Animation"

Python 251 17 Updated Dec 24, 2024

Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"

Python 25 Updated Dec 17, 2024

Code release for https://kovenyu.com/WonderWorld/

Python 367 13 Updated Dec 22, 2024

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

JavaScript 98 4 Updated Dec 23, 2024

Python APIs for web automation, testing, and bypassing bot-detection.

Python 8,170 1,089 Updated Dec 25, 2024

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python 143 18 Updated Dec 26, 2024

HunyuanVideo GP: Large Video Generation Model - GPU Poor version

Python 28 2 Updated Dec 24, 2024

Open source Claude Artifacts – built with Llama 3.1 405B

TypeScript 4,110 819 Updated Dec 18, 2024

GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion

66 2 Updated Dec 17, 2024

An image viewer and AI-assisted editing tool that helps with curating datasets for generative AI models, finetunes and LoRA.

Python 92 5 Updated Dec 26, 2024

A minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like GPT-4o mini, GPT-4o and Claude 3.5 Sonnet(New)!

TypeScript 1,145 135 Updated Dec 27, 2024

Learning Flow Fields in Attention for Controllable Person Image Generation

Python 742 73 Updated Dec 20, 2024

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 767 75 Updated Dec 29, 2024

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 278 11 Updated Dec 18, 2024

A pipeline parallel training script for diffusion models.

Python 273 23 Updated Dec 28, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,763 1,393 Updated Dec 29, 2024

[NeurIPS 2024] "Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis" official implementation.

Python 21 1 Updated Dec 5, 2024

NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

247 17 Updated Dec 11, 2024

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 283 24 Updated Dec 27, 2024

We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a se…

Python 1,009 44 Updated Dec 20, 2024

Turbo3D: Ultra-fast Text-to-3D Generation

37 3 Updated Dec 7, 2024

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 65 2 Updated Dec 6, 2024

Custom Conditioning Delta (ConDelta) nodes for ComfyUI

Python 155 7 Updated Dec 10, 2024

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 631 34 Updated Dec 8, 2024

ComfyUI wrapper of catvton-flux

Python 43 7 Updated Dec 2, 2024

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…

Python 1,041 127 Updated Dec 20, 2024

🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.

TypeScript 2,678 130 Updated Dec 13, 2024

A minimal and universal controller for FLUX.1.

Python 1,007 59 Updated Dec 27, 2024
Next