Lists (4)
Sort Name ascending (A-Z)
Stars
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Foundational Models for State-of-the-Art Speech and Text Translation
We write your reusable computer vision tools. 💜
Making large AI models cheaper, faster and more accessible
Segment Anything in High Quality [NeurIPS 2023]
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
Context-aware AI Sales Agent to automate sales outreach.
Official Code for DragGAN (SIGGRAPH 2023)
the first library to let you embed a developer agent in your own app!
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
ImageBind One Embedding Space to Bind Them All
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
📋 A list of open LLMs available for commercial use.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Solving the Traveling Salesman Problem using Self-Organizing Maps
Cataclysm - Code generation library for the end game
This repo finds free parking spaces in the parking lot using only image processing
An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)