Skip to content
View Aorunfa's full-sized avatar

Block or report Aorunfa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 23,163 2,109 Updated Mar 22, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 1,317 197 Updated Mar 8, 2025

一个微调qwenVL系列的仓库

Jupyter Notebook 1 Updated Feb 25, 2025

Efficient Triton Kernels for LLM Training

Python 4,698 284 Updated Mar 22, 2025

一个用于llava 7b模型微调的仓库,主要用于理解算法设计、deepspeed分布训练、模型量化等

Python 1 Updated Feb 25, 2025

一个快速学习deepseekV3模型以及r1强化学习grpo的仓库,侧重于理解技术报告与模型设计细节,理解原理

Jupyter Notebook 3 2 Updated Mar 21, 2025

Train transformer language models with reinforcement learning.

Python 12,714 1,715 Updated Mar 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,464 527 Updated Mar 23, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,199 146 Updated Mar 20, 2025

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,803 225 Updated Aug 13, 2024

hiera微调

Jupyter Notebook 1 Updated Jan 20, 2025

clip微调

Jupyter Notebook 2 Updated Feb 19, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,926 2,404 Updated Aug 12, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,117 679 Updated Aug 5, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 33,833 3,802 Updated Mar 19, 2025

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,649 1,252 Updated Jul 23, 2024

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Python 958 49 Updated Mar 2, 2024

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,499 2,104 Updated Jul 24, 2024

一个用于快速入门transformer的仓库,梳理相关nlp和vit模型结构、原理,训练的基本步骤及微调方法, 配套能快速学习的代码实战项目

Python 16 3 Updated Mar 19, 2025

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,744 464 Updated Aug 18, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,624 426 Updated Mar 18, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

19,064 1,837 Updated Sep 19, 2024

AllenAI's post-training codebase

Python 2,830 365 Updated Mar 22, 2025

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 146 12 Updated Mar 18, 2024

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 3,731 547 Updated Mar 20, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,467 203 Updated Aug 11, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,840 1,790 Updated Mar 21, 2025

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,629 362 Updated Jul 15, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 16,707 1,859 Updated Feb 23, 2025

Parallel S3 and local filesystem execution tool.

Go 2,936 255 Updated Jan 17, 2025
Next