Stars
The official C# SDK for Model Context Protocol servers and clients, maintained by Microsoft
Work-in-progress tool to reverse unity's IL2CPP toolchain.
[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Integrate Git version control with automatic commit-and-sync and other advanced features in Obsidian.md
Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
🌊 Images to → 3D Parallax effect video. A free and open source ImmersityAI alternative
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Multilingual Voice Understanding Model
A UI-Focused Agent for Windows OS Interaction.
Collect eye movement signals by Tobii Eye Tracker 5
Installation and testing of tobii eye tracker in Ubuntu 18.10
the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8
ValyrianTech / OpenVoice_server
Forked from myshell-ai/OpenVoiceAPI server for Instant voice cloning by MyShell.
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Instant voice cloning by MIT and MyShell. Audio foundation model.
On-device Speech Recognition for Apple Silicon
An AutoGPT agent that controls Chrome on your desktop
a state-of-the-art-level open visual language model | 多模态预训练模型