Skip to content
View collant's full-sized avatar
😇
whispering to bots
😇
whispering to bots

Block or report collant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 results for source starred repositories
Clear filter

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,283 2,377 Updated Feb 5, 2025

Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"

Python 108 9 Updated Feb 2, 2024

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,151 420 Updated Nov 14, 2024

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically Chat…

Python 210 18 Updated May 20, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,795 9,054 Updated Jan 4, 2025

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,531 390 Updated Apr 3, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,862 4,342 Updated Aug 19, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,545 2,923 Updated Sep 2, 2024

Official implementation of "Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation"

Python 722 42 Updated Feb 9, 2024

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,706 105 Updated Aug 29, 2023

the AI-native open-source embedding database

Rust 17,467 1,445 Updated Feb 7, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,678 1,443 Updated Sep 5, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,135 559 Updated Feb 7, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,692 4,612 Updated Feb 6, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,805 378 Updated Mar 14, 2024

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Python 475 41 Updated May 1, 2024

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 4,112 357 Updated May 6, 2023

LLM as a Chatbot Service

Python 3,300 379 Updated Nov 20, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,796 2,223 Updated Jul 29, 2024

X-Avatar: Expressive Human Avatars (CVPR2023)

Python 388 24 Updated May 27, 2024

An orchestrator for VAM Imposter plugin. My patreon: https://patreon.com/TwinWin

HTML 17 6 Updated Feb 22, 2024