Stars
Real time interactive streaming digital human
This repository contains the Open Source Software components of the Iluvatar Corex IxRT. It includes the sources for IxRT plugins and deploy tools, as well as sample applications demonstrating the …
DeepSparkInference has selected 48 inference model examples, covering fields such as computer vision, natural language processing, and speech recognition. Subsequent phases will gradually expand to…
DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to support the mainstream intelligent computing scenarios.
The DeepSpark open platform selects hundreds of open source application algorithms and models that are deeply coupled with industrial applications, supports mainstream application frameworks, and p…
Meridian cuts through news noise by scraping hundreds of sources, analyzing stories with AI, and delivering concise, personalized daily briefs.
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
Transform Your Favorite Websites into Seamless Desktop Experiences✨! 把常用的网站集合到一个桌面程序里。
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/
MoLing is a computer-use and browser-use based MCP server. It is a locally deployed, dependency-free office AI assistant.
Powerful & Easy-to-Use Video Face Swapping and Editing Software
CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with features comparable to Hugging Face. Gain…
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
📝 An intelligent text editor with interactions inspired by drawing software
a web-based tool for processing images and converting documents with a simple interface
All-in-One Development Tool based on PaddlePaddle
A powerful tool for creating fine-tuning datasets for LLM
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
kaldi-asr/kaldi is the official location of the Kaldi project.
SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks, fine-tune transcription with beam size adjustment, and spec…
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Basic gun detection algorithm, designed using YOLOv7 with AR-15 guns training data
A web application that performs gun detection on images and videos.