Stars
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Official Repo for Open-Reasoner-Zero
An implementation of local windowed attention for language modeling
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
ClickPrompt - Streamline your prompt design, with ClickPrompt, you can easily view, share, and run these prompts with just one click. ClickPrompt 用于一键轻松查看、分享和执行您的 Prompt。
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
save 200 a month and use deep research right in your terminal. - port of https://github.com/dzhng/deep-research but in python
A fork to add multimodal model training to open-r1
Fully open reproduction of DeepSeek-R1
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
Fully local web research and report writing assistant
Awesome LLM Books: Curated list of books on Large Language Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
DSPy: The framework for programming—not prompting—language models
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.