Stars
The leader in Next-Generation Customer Data Infrastructure
[ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text
Tool to give AI generated score, to analyze with patterns how much input text is AI generated using AWS Bedrock Llama 3.1 405B
1000UserGuide:对独立开发者和创业者来说,找到前1000个早期用户太关键了。这里精心整理了300多个国内外渠道,适合独立开发者和创业者推广产品的渠道。
AigoTools can help users quickly create and manage website directory, with built-in site auto-crawling features, and also provides internationalization, SEO, image storage, and other functions. It …
AI 导航是一个现代化的人工智能网站导航系统,致力于帮助用户发现、分享和管理优质的 AI 工具与资源。项目采用最新的 Web 技术栈构建,提供流畅的用户体验和强大的管理功能。
基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。
A Conversational Speech Generation Model
A lightweight, powerful framework for multi-agent workflows
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
No fortress, purely open ground. OpenManus is Coming.
《高军 AI 日报》: 每天花 1 分钟时间,获取精选的前沿 AI 信息。内容涵盖但不限于 前沿 AI 资讯、AI 工具、AI 绘画、开源项目和学习教程 等等。
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
Wan: Open and Advanced Large-Scale Video Generative Models
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
BizyAir: Comfy Nodes that can run in any environment.
Diffusers wrapper to run Kwai-Kolors model
FlashMLA: Efficient MLA decoding kernels
SkyReels V1: The first and most advanced open-source human-centric video foundation model
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
PantoMatrix: Generating Face and Body Animation from Speech
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
Full Stack application for retrieving Stock Data and News using LLM, LangChain and LangGraph