Stars
Task-Aware Agent-driven Prompt Optimization Framework
No fortress, purely open ground. OpenManus is Coming.
This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…
Fully open reproduction of DeepSeek-R1
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Unofficial PyTorch implementation of Google AI's VoiceFilter system
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
zero-shot voice conversion & singing voice conversion, with real-time support
[CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Tensorflow 2.x implementation of Vision-Transformer model
Simple Tensorflow implementation of Densenet using Cifar10, MNIST
Pytorch implementation of graph attention network
Convert tensorflow model to pytorch model via [MMdnn](https://github.com/microsoft/MMdnn) for adversarial attacks.
Consistency models trained on CIFAR-10, in JAX.
Official repo for consistency models.
Open source implementation of AlphaFold3
Open source AI coding agent. Designed for large projects and real world tasks.
《AI 研发提效:构建 AI 辅助编码助手》 —— 介绍如何 DIY 一个端到端(从 IDE 插件、模型选型、数据集构建到模型微调)的 AI 辅助编程工具,类似于 GitHub Copilot、JetBrains AI Assistant、AutoDev 等。
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks