- Athens
- https://www.turbo-play.com
Highlights
- Pro
Stars
A latent text-to-image diffusion model
🔊 Text-Prompted Generative Audio Model
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
LAVIS - A One-stop Library for Language-Vision Intelligence
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Inpaint anything using Segment Anything and inpainting models.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
A playing cards AI that detects hand and ground and computes the best move.