cwz427

cwz427

6 followers · 12 following

Stars

datawhalechina / unlock-deepseek

DeepSeek 系列工作解读、扩展和复现。

Python 583 44 Updated Feb 15, 2025

aburkov / theLMbook

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,202 172 Updated Mar 8, 2025

aburkov / theMLbook

The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.

Python 1,904 569 Updated Jun 27, 2024

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 43,605 3,897 Updated Mar 10, 2025

GAIR-NLP / LIMO

LIMO: Less is More for Reasoning

Python 822 36 Updated Feb 24, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,252 69 Updated Mar 7, 2025

chunhuizhang / pytorch_distribute_tutorials

pytorch distribute tutorials

Jupyter Notebook 115 25 Updated Feb 23, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 70,339 7,585 Updated Mar 9, 2025

huggingface / picotron_tutorial

Python 152 15 Updated Feb 13, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 920 68 Updated Mar 7, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 883 53 Updated Mar 4, 2025

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 80,201 11,730 Updated Mar 10, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,872 6,091 Updated Mar 10, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,555 73 Updated Mar 5, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,364 258 Updated Mar 9, 2025

brendanhogan / DeepSeekRL-Extended

Exploring Applications of GRPO

Python 104 9 Updated Feb 16, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 82,132 9,882 Updated Mar 10, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 617 80 Updated Feb 26, 2025

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,284 393 Updated Feb 12, 2025

Gen-Verse / ReasonFlux

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Python 331 25 Updated Feb 17, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,959 171 Updated Feb 16, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,019 2,466 Updated Mar 10, 2025

philschmid / deep-learning-pytorch-huggingface

Jupyter Notebook 1,067 223 Updated Feb 27, 2025

Bin-Huang / chatbox

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 33,049 3,138 Updated Mar 4, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,530 423 Updated Mar 10, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,057 134 Updated Mar 3, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,107 245 Updated Mar 1, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,102 229 Updated Feb 19, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,053 1,407 Updated Feb 1, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,450 2,015 Updated Mar 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly