Stars
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from comp…
Collection of Composed Image Retrieval (CIR) papers.
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,标注平台,自动化标注,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持R…
一个原创多端IM通信层框架,轻量级、高度提炼,历经10年、久经考验。可能是市面上唯一同时支持UDP+TCP+WebSocket三种协议的同类开源框架,支持 iOS、Android、Java、H5、小程序、Uniapp、鸿蒙Next,服务端基于Netty。
Rocket.Chat mobile clients
The communications platform that puts data protection first.
🕊️ The world's most advanced open source instant messaging engine for 100K~10M concurrent users https://turms-im.github.io/docs
OCR & Document Extraction using vision models
Alluxio, data orchestration for analytics and machine learning in the cloud
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
PulsarRPA Pro Edition: Empower Your Workflows with AI-Driven Web Data Extraction.
Event Study package is an open-source python project created to facilitate the computation of financial event study analysis.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Awesome-LLM: a curated list of Large Language Model
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
A program that provides LLMs with the ability to complete complex tasks using plugins.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
An open-source tool-augmented conversational language model from Fudan University