Starred repositories
verl: Volcano Engine Reinforcement Learning for LLMs
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
智能闲鱼客服机器人系统:专为闲鱼平台打造的AI值守解决方案,实现闲鱼平台7×24小时自动化值守,支持多专家协同决策、智能议价和上下文感知对话。
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
A lightweight, powerful framework for multi-agent workflows
Download market data from Yahoo! Finance's API
No fortress, purely open ground. OpenManus is Coming.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Train transformer language models with reinforcement learning.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
書籍「つくりながら学ぶ!深層強化学習」のサポートリポジトリです
Fully open reproduction of DeepSeek-R1
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
DeepBattler - Your BEST LLM Battlegrounds Coach/Friend!
使用Github Action将国外的Docker镜像转存到阿里云私有仓库,供国内服务器使用,免费易用
🥑 Language focused docker images, minus the operating system.