Skip to content
View 2niuhe's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report 2niuhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ai

31 repositories

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 437 74 Updated Jun 14, 2023

Tensor library for machine learning

C++ 11,909 1,136 Updated Feb 12, 2025

LLM inference in C/C++

C++ 74,832 10,816 Updated Feb 20, 2025

Notebooks for learning deep learning

Jupyter Notebook 5,673 5,339 Updated Oct 3, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,054 768 Updated Oct 16, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,351 520 Updated Jan 6, 2025

OpenAI Triton backend for Intel® GPUs

MLIR 165 50 Updated Feb 20, 2025

A throughput-oriented high-performance serving framework for LLMs

Cuda 739 29 Updated Sep 21, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 2,106 217 Updated Feb 20, 2025

Minimalistic large language model 3D-parallelism training

Python 1,491 151 Updated Feb 19, 2025

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,418 527 Updated Feb 18, 2025

Felafax is building AI infra for non-NVIDIA GPUs

Jupyter Notebook 554 34 Updated Jan 24, 2025

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,239 154 Updated Feb 15, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,520 818 Updated Sep 1, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 28,065 3,173 Updated Feb 20, 2025

how to optimize some algorithm in cuda.

Cuda 1,906 165 Updated Feb 19, 2025

Distribute and run LLMs with a single file.

C++ 21,768 1,138 Updated Jan 30, 2025

Productive, portable, and performant GPU programming in Python.

C++ 26,749 2,327 Updated Jan 6, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,746 379 Updated Feb 9, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,967 4,203 Updated Feb 20, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,062 538 Updated Dec 25, 2024

Quick, visual, principled introduction to pytorch code through five colab notebooks.

Jupyter Notebook 414 65 Updated Jan 13, 2025

A library to analyze PyTorch traces.

Python 333 51 Updated Feb 13, 2025

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,741 262 Updated Feb 18, 2025

Examples of how to use PyTorch's TensorIterator in C++

C++ 1 1 Updated Apr 9, 2021

A PyTorch native library for large model training

Python 3,329 279 Updated Feb 20, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 10,986 1,160 Updated Feb 19, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 6,702 665 Updated Feb 20, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,403 1,341 Updated Feb 1, 2025

Open Thoughts: Fully Open Data Curation for Thinking Models

Python 1,237 104 Updated Feb 17, 2025