Skip to content
View rocksen's full-sized avatar

Block or report rocksen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

使用AI大模型,一键生成高清故事短视频。Generate high-definition story short videos with one click using AI large models.

Python 1,758 304 Updated Mar 12, 2025

Meridian is an MMM framework that enables advertisers to set up and run their own in-house models.

Python 998 154 Updated May 22, 2025

AI 视频笔记生成工具 让 AI 为你的视频做笔记

Python 1,579 175 Updated May 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,776 12,364 Updated May 22, 2025

Unified Backend Framework for APIs, Events and Agents

TypeScript 1,930 180 Updated May 22, 2025

【新增PDF和Office文件解析上传】安卓端全场景GPT助手,可用音量键唤起并进行语音交流,支持联网、拍照、模板、PDF和Office文件解析等 | GPT assistant for Android, activated via volume keys for voice interaction, supporting features such as networking, takin…

Java 803 114 Updated May 6, 2025

A generative speech model for daily dialogue.

Python 36,316 3,926 Updated May 6, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 6,062 696 Updated May 22, 2025

Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling

Python 25,325 4,416 Updated May 22, 2025

AI 助手全套开源解决方案,自带运营管理后台,开箱即用。集成了 ChatGPT, Azure, ChatGLM,讯飞星火,文心一言等多个平台的大语言模型。支持 MJ AI 绘画,Stable Diffusion AI 绘画,微博热搜等插件工具。采用 Go + Vue3 + element-plus 实现。

Vue 4,255 1,031 Updated May 18, 2025

AingDesk是一款简单好用的AI助手,支持知识库、模型API、分享、联网搜索、智能体,它还在飞快成长中。 AingDesk is a simple and easy-to-use AI assistant that supports knowledge bases, model APIs, sharing, internet search, and intelligent agents.…

TypeScript 1,804 200 Updated May 8, 2025

Suna - Open Source Generalist AI Agent

TypeScript 12,458 1,765 Updated May 22, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 83,464 61,161 Updated Apr 19, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 33,923 2,731 Updated May 22, 2025

Use any LLMs (Large Language Models) for Deep Research. Support SSE API and MCP server.

JavaScript 2,421 641 Updated May 22, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 7,286 592 Updated May 3, 2025

Converts text to speech in realtime

Python 3,078 302 Updated May 14, 2025

best way to save what you love

Svelte 32,150 2,684 Updated May 22, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 1,548 129 Updated Apr 12, 2025

The Desktop AgentOS.

Python 7,279 894 Updated May 13, 2025

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 693 63 Updated May 21, 2025

DreamO: A Unified Framework for Image Customization

Python 1,282 84 Updated May 13, 2025

A browser extension that helps users publish content to multiple social media platforms with one click.

TypeScript 1,662 158 Updated May 19, 2025

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 322,272 41,593 Updated May 22, 2025

Featuring powerful AI capabilities and supporting various e-book formats, it makes reading smarter and more focused.

Dart 5,107 286 Updated May 22, 2025

[SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"

Python 265 23 Updated May 12, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 921 70 Updated May 15, 2025

Have a natural, spoken conversation with AI!

Python 2,289 185 Updated May 17, 2025
Next