Skip to content
View RuicXie's full-sized avatar

Block or report RuicXie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 5,014 488 Updated Aug 6, 2024

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

Python 262 85 Updated Mar 25, 2023

The related works and background techniques about Openai o1

217 9 Updated Jan 7, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 104,227 16,861 Updated Mar 25, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,601 365 Updated Mar 25, 2025
Jupyter Notebook 125 16 Updated Mar 4, 2025

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,350 432 Updated Jul 28, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,542 1,209 Updated May 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,261 5,533 Updated Mar 25, 2025

LLM101n: Let's build a Storyteller

32,938 1,800 Updated Aug 1, 2024

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,999 311 Updated Mar 25, 2025

比做算法的懂工程落地,比做工程的懂算法模型。

Jupyter Notebook 241 38 Updated Jan 14, 2025

Large Language Model (LLM) Systems Paper List

827 32 Updated Mar 25, 2025

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. 🎉🎉

3,713 263 Updated Mar 25, 2025

The official Meta Llama 3 GitHub site

Python 28,550 3,335 Updated Jan 26, 2025

Awesome-LLM: a curated list of Large Language Model

22,328 1,837 Updated Mar 24, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,862 130 Updated Dec 26, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 48,497 5,186 Updated Jan 22, 2025

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,498 1,297 Updated Sep 5, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,255 876 Updated Mar 11, 2025

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,786 762 Updated May 31, 2024

Robust recipes to align language models with human and AI preferences

Python 5,082 437 Updated Nov 21, 2024

LLM inference in C/C++

C++ 77,168 11,199 Updated Mar 25, 2025

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,156 574 Updated Sep 23, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,130 299 Updated Nov 8, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,979 237 Updated Sep 6, 2023

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 3,745 549 Updated Mar 20, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,766 1,890 Updated Apr 30, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,092 765 Updated Oct 16, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 59,437 6,019 Updated Aug 24, 2024
Next
Showing results