-
ZTE
- China
code practice
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast …
Attention is all you need implementation
Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
llama3 implementation one matrix multiplication at a time
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
MPI programming lessons in C and executable code examples
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
An unnecessarily tiny implementation of GPT-2 in NumPy.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Video+code lecture on building nanoGPT from scratch
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
ncnn is a high-performance neural network inference framework optimized for the mobile platform
FlashInfer: Kernel Library for LLM Serving
Quick, visual, principled introduction to pytorch code through five colab notebooks.
Examples of how to use PyTorch's TensorIterator in C++
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Accessible large language models via k-bit quantization for PyTorch.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Open Thoughts: Fully Open Data Curation for Thinking Models
Applied AI experiments and examples for PyTorch