Skip to content
View cwz427's full-sized avatar

Block or report cwz427

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepSeek 系列工作解读、扩展和复现。

Python 583 44 Updated Feb 15, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,202 172 Updated Mar 8, 2025

The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.

Python 1,904 569 Updated Jun 27, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 43,605 3,897 Updated Mar 10, 2025

LIMO: Less is More for Reasoning

Python 822 36 Updated Feb 24, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,252 69 Updated Mar 7, 2025

pytorch distribute tutorials

Jupyter Notebook 115 25 Updated Feb 23, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 70,339 7,585 Updated Mar 9, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 920 68 Updated Mar 7, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 883 53 Updated Mar 4, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 80,201 11,730 Updated Mar 10, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,872 6,091 Updated Mar 10, 2025

Official Repo for Open-Reasoner-Zero

Python 1,555 73 Updated Mar 5, 2025

NanoGPT (124M) in 3 minutes

Python 2,364 258 Updated Mar 9, 2025

Exploring Applications of GRPO

Python 104 9 Updated Feb 16, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 82,132 9,882 Updated Mar 10, 2025

minimal-cost for training 0.5B R1-Zero

Python 617 80 Updated Feb 26, 2025

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,284 393 Updated Feb 12, 2025

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Python 331 25 Updated Feb 17, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,959 171 Updated Feb 16, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,019 2,466 Updated Mar 10, 2025

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 33,049 3,138 Updated Mar 4, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,530 423 Updated Mar 10, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,057 134 Updated Mar 3, 2025

Witness the aha moment of VLM with less than $3.

Python 3,107 245 Updated Mar 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,102 229 Updated Feb 19, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,053 1,407 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 22,450 2,015 Updated Mar 9, 2025
Next