Lists (1)
Sort Name ascending (A-Z)
Stars
Stable Diffusion web UI
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Making large AI models cheaper, faster and more accessible
Instant voice cloning by MIT and MyShell. Audio foundation model.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Rembg is a tool to remove images background
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Inference and training library for high-quality TTS models.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
Reverse Engineering: Decompiling Binary Code with Large Language Models
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
CUDA accelerated rasterization of gaussian splatting
DeepSeek-VL: Towards Real-World Vision-Language Understanding
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
Atlas: End-to-End 3D Scene Reconstruction from Posed Images
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
SpeechGPT Series: Speech Large Language Models
Bumble's Private Detector - a pretrained model for detecting lewd images
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale