Skip to content
View chelbos's full-sized avatar

Block or report chelbos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
79 stars written in Python
Clear filter

Stable Diffusion web UI

Python 146,131 27,391 Updated Dec 28, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,602 5,435 Updated Dec 18, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,979 5,238 Updated Jun 27, 2024

Making large AI models cheaper, faster and more accessible

Python 39,021 4,352 Updated Jan 21, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,537 3,034 Updated Jan 7, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,791 2,151 Updated Jan 20, 2025

SOTA Open Source TTS

Python 18,515 1,398 Updated Jan 18, 2025

Rembg is a tool to remove images background

Python 17,752 1,925 Updated Jan 19, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,755 769 Updated Feb 11, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,600 649 Updated Aug 13, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,599 578 Updated Jan 11, 2025

Inference and training library for high-quality TTS models.

Python 4,922 509 Updated Dec 10, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,617 349 Updated Dec 7, 2024
Python 4,281 384 Updated Sep 27, 2024

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,553 365 Updated Jun 20, 2024

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 3,377 245 Updated Oct 28, 2024

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Python 2,867 327 Updated Jan 8, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,801 224 Updated Jan 11, 2025

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,788 201 Updated Dec 5, 2023

CUDA accelerated rasterization of gaussian splatting

Python 2,509 334 Updated Jan 15, 2025

Text-to-Audio/Music Generation

Python 2,357 184 Updated Sep 29, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,316 222 Updated Apr 24, 2024

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,297 150 Updated Jul 12, 2024

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Python 1,856 108 Updated Jan 12, 2025

Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Python 1,830 221 Updated Apr 6, 2022

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

Python 1,816 123 Updated Jul 5, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,754 64 Updated Jan 8, 2025

SpeechGPT Series: Speech Large Language Models

Python 1,326 89 Updated Jul 22, 2024

Bumble's Private Detector - a pretrained model for detecting lewd images

Python 1,318 97 Updated Nov 5, 2023

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,078 86 Updated Oct 21, 2024
Next