Lists (11)
Sort Name ascending (A-Z)
ai-image
ai imageai-llm
ai LLMai-robot
ai robotai-util
ai utilsandroid-arch
android architectureandroid-ui
android uiandroid-util
android utilscourse-tech
technical coursesStars
使用AI大模型,一键生成高清故事短视频。Generate high-definition story short videos with one click using AI large models.
Meridian is an MMM framework that enables advertisers to set up and run their own in-house models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Unified Backend Framework for APIs, Events and Agents
【新增PDF和Office文件解析上传】安卓端全场景GPT助手,可用音量键唤起并进行语音交流,支持联网、拍照、模板、PDF和Office文件解析等 | GPT assistant for Android, activated via volume keys for voice interaction, supporting features such as networking, takin…
A generative speech model for daily dialogue.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
AI 助手全套开源解决方案,自带运营管理后台,开箱即用。集成了 ChatGPT, Azure, ChatGLM,讯飞星火,文心一言等多个平台的大语言模型。支持 MJ AI 绘画,Stable Diffusion AI 绘画,微博热搜等插件工具。采用 Go + Vue3 + element-plus 实现。
AingDesk是一款简单好用的AI助手,支持知识库、模型API、分享、联网搜索、智能体,它还在飞快成长中。 AingDesk is a simple and easy-to-use AI assistant that supports knowledge bases, model APIs, sharing, internet search, and intelligent agents.…
Suna - Open Source Generalist AI Agent
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Use any LLMs (Large Language Models) for Deep Research. Support SSE API and MCP server.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
DreamO: A Unified Framework for Image Customization
A browser extension that helps users publish content to multiple social media platforms with one click.
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
Featuring powerful AI capabilities and supporting various e-book formats, it makes reading smarter and more focused.
[SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Have a natural, spoken conversation with AI!