Skip to content
View sherryxie1's full-sized avatar

Block or report sherryxie1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Task-Aware Agent-driven Prompt Optimization Framework

Python 3,028 254 Updated Mar 21, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 38,872 6,424 Updated Mar 22, 2025

Spark-TTS Inference Code

Python 5,809 599 Updated Mar 21, 2025

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…

Jupyter Notebook 3,568 410 Updated Mar 5, 2025

Prod Env

Python 409 63 Updated Oct 9, 2023

Fully open reproduction of DeepSeek-R1

Python 23,169 2,109 Updated Mar 22, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,030 1,374 Updated Mar 3, 2025
Jupyter Notebook 11 4 Updated Apr 26, 2023

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,128 227 Updated Jul 25, 2024

This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …

Python 554 102 Updated Mar 1, 2025

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Python 1,221 303 Updated Feb 15, 2025

zero-shot voice conversion & singing voice conversion, with real-time support

Python 2,036 228 Updated Mar 17, 2025

[CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,235 154 Updated Mar 15, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,442 189 Updated Feb 14, 2025

Tensorflow 2.x implementation of Vision-Transformer model

Python 19 4 Updated Jan 29, 2021

Simple Tensorflow implementation of Densenet using Cifar10, MNIST

Python 507 195 Updated Mar 4, 2019

Pytorch implementation of graph attention network

Python 9 3 Updated Feb 23, 2023

Convert tensorflow model to pytorch model via [MMdnn](https://github.com/microsoft/MMdnn) for adversarial attacks.

Python 84 9 Updated Dec 1, 2022

Consistency models trained on CIFAR-10, in JAX.

Jupyter Notebook 144 17 Updated Aug 22, 2023

Official repo for consistency models.

Python 6,286 425 Updated Mar 22, 2024

Open source implementation of AlphaFold3

Python 969 84 Updated Oct 7, 2024

AlphaFold 3 inference pipeline.

Python 6,267 779 Updated Mar 13, 2025

Open source AI coding agent. Designed for large projects and real world tasks.

Go 11,305 779 Updated Mar 21, 2025

《AI 研发提效:构建 AI 辅助编码助手》 —— 介绍如何 DIY 一个端到端(从 IDE 插件、模型选型、数据集构建到模型微调)的 AI 辅助编程工具,类似于 GitHub Copilot、JetBrains AI Assistant、AutoDev 等。

Kotlin 651 53 Updated Jul 5, 2024

⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks

TypeScript 24,784 2,464 Updated Mar 23, 2025
Next
Showing results