Stars
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically Chat…
Robust Speech Recognition via Large-Scale Weak Supervision
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
🔊 Text-Prompted Generative Audio Model
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
the AI-native open-source embedding database
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Instruct-tune LLaMA on consumer hardware
X-Avatar: Expressive Human Avatars (CVPR2023)
An orchestrator for VAM Imposter plugin. My patreon: https://patreon.com/TwinWin