Skip to content
View pluja's full-sized avatar
💭
Staying calm
💭
Staying calm

Organizations

@ytorg

Block or report pluja

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

Artificial Intelligence related
28 repositories
Python 773 50 Updated Sep 22, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 77,822 9,324 Updated Jan 4, 2025

A Stable Diffusion desktop frontend with inpainting, img2img and more!

Jupyter Notebook 1,266 85 Updated Mar 21, 2023

Stable Diffusion web UI

Python 149,139 27,852 Updated Mar 4, 2025

A simple notebook demonstrating prompt-based music generation via Mubert API

Jupyter Notebook 2,739 240 Updated May 4, 2023

Rembg is a tool to remove images background

Python 18,231 1,958 Updated Mar 7, 2025

Port of OpenAI's Whisper model in C/C++

C++ 38,339 3,999 Updated Mar 8, 2025

OpenAI Whisper ASR Webservice API

Python 2,411 437 Updated Feb 18, 2025

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 13,839 1,025 Updated Mar 6, 2025

Your personal, fully customizable, Linux Voice Control Assistant.

Python 154 9 Updated Feb 10, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 24,620 2,501 Updated Mar 8, 2025

Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…

JavaScript 9,831 814 Updated Mar 10, 2025

Stable diffusion for real-time music generation (web app)

TypeScript 2,633 202 Updated Jul 22, 2024

Stable Diffusion built-in to Blender

Python 7,951 434 Updated Aug 26, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,801 1,916 Updated Nov 19, 2024

Real-time face swap for PC streaming or video calls

Python 27,785 332 Updated Nov 8, 2024

The no-code platform for building custom LLM Agents

2,930 427 Updated Jun 17, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,093 488 Updated Jul 11, 2024

one-click face swap

Python 29,429 6,654 Updated Aug 19, 2024

Handwriting Synthesis with RNNs ✏️

Python 4,453 614 Updated Jan 11, 2024

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,823 231 Updated Dec 7, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,620 2,265 Updated Jan 15, 2025

Self-hosted AI coding assistant

Rust 30,372 1,396 Updated Mar 10, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,378 1,119 Updated Nov 14, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,165 285 Updated Jan 21, 2025

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,774 313 Updated Jan 8, 2025

Inference and training library for high-quality TTS models.

Python 5,104 538 Updated Dec 10, 2024