Stars
Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents
An open infrastructure to democratize and decentralize the development of superintelligence for humanity.
Large-scale text-video dataset. 10 million captioned short videos.
Official inference repo for FLUX.1 models
Gameboy Emulator written in Rust and WebAssembly. 8-bit microprocessor: Sharp LR35902.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Code release for https://kovenyu.com/WonderWorld/
Focused on fast experimentation and simplicity
This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Latent Program Network (from the "Searching Latent Program Spaces" paper)
We write your reusable computer vision tools. 💜
Unicode-based scientific plotting for working in the terminal
Official implementation of SyncDiffusion.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A CLI interface for Marp and Marpit based converters
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Unofficial implementation of RealFill
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
A Spotify player in the terminal with full feature parity
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…