Stars
Unstructured data extract platform based on LlamaIndex, Pgvector, React and Django.
ESG Insights AI simplifies ESG data analysis with advanced AI models, ensuring compliance with GRI standards. It helps asset managers assess risks, improve reporting, and make informed decisions, r…
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Composable building blocks to build Llama Apps
NVIDIA AI Blueprint for digital human for customer service.
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
In-browser Postgres sandbox with AI assistance (formerly postgres.new)
Modern, statically generated personal website built with Nuxt.
Official inference repo for FLUX.1 models
Easy Docker setup for Stable Diffusion with user-friendly UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A Community Open-Source Saas for Crafting/Building/Creating Chatbots with OpenAI's Assistant API that you can add to your website.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🔊 Text-Prompted Generative Audio Model
A high-throughput and memory-efficient inference and serving engine for LLMs
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23