Starred repositories
verl: Volcano Engine Reinforcement Learning for LLMs
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search…
智能闲鱼客服机器人系统:专为闲鱼平台打造的AI值守解决方案,实现闲鱼平台7×24小时自动化值守,支持多专家协同决策、智能议价和上下文感知对话。
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
A lightweight, powerful framework for multi-agent workflows
Download market data from Yahoo! Finance's API
No fortress, purely open ground. OpenManus is Coming.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Train transformer language models with reinforcement learning.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
書籍「つくりながら学ぶ!深層強化学習」のサポートリポジトリです
Fully open reproduction of DeepSeek-R1
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
DeepBattler - Your BEST LLM Battlegrounds Coach/Friend!