Stars
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Instant voice cloning by MIT and MyShell. Audio foundation model.
official repository of aiXcoder-7B Code Large Language Model
🔉 spafe: Simplified Python Audio Features Extraction
Windows system utilities to maximize productivity
.NET 6.0 API for User Management, Authentication and Registration
Plugin.Maui.Audio provides the ability to play audio inside a .NET MAUI application
Integrate cutting-edge LLM technology quickly and easily into your apps
Utilities for clustering of audio samples
Lord of Large Language and Multi modal Systems Web User Interface
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Robust Speech Recognition via Large-Scale Weak Supervision
Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)
Code and model for "Peeking into the Future: Predicting Future Person Activities and Locations in Videos", Liang et al, CVPR 2019
implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)
The Free Software Media System - Server Backend & API
This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.
Official PyTorch implementation of "Joint Object Detection and Multi-Object Tracking with Graph Neural Networks"
Deep learning software for colorizing black and white images with a few clicks.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
[ECCV 2020] In-Domain GAN Inversion for Real Image Editing (PyTorch code)
[AAAI 2019] Spatial Temporal Re-identification