Skip to content
View collant's full-sized avatar
😇
whispering to bots
😇
whispering to bots

Block or report collant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,170 2,361 Updated Nov 26, 2024

Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"

Python 107 9 Updated Feb 2, 2024

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,150 419 Updated Nov 14, 2024

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically Chat…

Python 210 18 Updated May 20, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,663 8,922 Updated Jan 4, 2025

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,504 388 Updated Apr 3, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,697 4,317 Updated Aug 19, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,534 2,926 Updated Sep 2, 2024

Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"

Python 723 42 Updated Feb 9, 2024

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,702 105 Updated Aug 29, 2023

the AI-native open-source embedding database

Rust 17,051 1,412 Updated Jan 19, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,603 1,436 Updated Sep 5, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,057 550 Updated Jan 19, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,515 4,592 Updated Jan 18, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,801 377 Updated Mar 14, 2024

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Python 474 41 Updated May 1, 2024

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 4,106 357 Updated May 6, 2023

LLM as a Chatbot Service

Python 3,298 379 Updated Nov 20, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,765 2,224 Updated Jul 29, 2024

X-Avatar: Expressive Human Avatars (CVPR2023)

Python 386 24 Updated May 27, 2024

An orchestrator for VAM Imposter plugin. My patreon: https://patreon.com/TwinWin

HTML 17 6 Updated Feb 22, 2024