2niuhe

🎯

Focusing

niu_he 2niuhe

🎯

Focusing

33 followers · 143 following

ZTE
China

Achievements

Stars

ai

31 repositories

Xirider / finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

Python 437 74 Updated Jun 14, 2023

ggml-org / ggml

Tensor library for machine learning

C++ 11,909 1,136 Updated Feb 12, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 74,832 10,816 Updated Feb 20, 2025

lmoroney / dlaicourse

Notebooks for learning deep learning

Jupyter Notebook 5,673 5,339 Updated Oct 3, 2023

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,054 768 Updated Oct 16, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,351 520 Updated Jan 6, 2025

intel / intel-xpu-backend-for-triton

OpenAI Triton backend for Intel® GPUs

MLIR 165 50 Updated Feb 20, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 739 29 Updated Sep 21, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,106 217 Updated Feb 20, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,491 151 Updated Feb 19, 2025

exaloop / codon

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,418 527 Updated Feb 18, 2025

felafax / felafax

Felafax is building AI infra for non-NVIDIA GPUs

Jupyter Notebook 554 34 Updated Jan 24, 2025

patrick-kidger / equinox

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,239 154 Updated Feb 15, 2025

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,520 818 Updated Sep 1, 2024

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 28,065 3,173 Updated Feb 20, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,906 165 Updated Feb 19, 2025

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 21,768 1,138 Updated Jan 30, 2025

taichi-dev / taichi

Productive, portable, and performant GPU programming in Python.

C++ 26,749 2,327 Updated Jan 6, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,746 379 Updated Feb 9, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,967 4,203 Updated Feb 20, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,062 538 Updated Dec 25, 2024

davidbau / how-to-read-pytorch

Quick, visual, principled introduction to pytorch code through five colab notebooks.

Jupyter Notebook 414 65 Updated Jan 13, 2025

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 333 51 Updated Feb 13, 2025

intel / intel-extension-for-pytorch

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,741 262 Updated Feb 18, 2025

kurtamohler / pytorch-TensorIterator-examples

Examples of how to use PyTorch's TensorIterator in C++

C++ 1 1 Updated Apr 9, 2021

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,329 279 Updated Feb 20, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 10,986 1,160 Updated Feb 19, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 6,702 665 Updated Feb 20, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,403 1,341 Updated Feb 1, 2025

open-thoughts / open-thoughts

Open Thoughts: Fully Open Data Curation for Thinking Models

Python 1,237 104 Updated Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

niu_he 2niuhe

Achievements

Achievements

Block or report 2niuhe

ai

Xirider / finetune-gpt2xl

ggml-org / ggml

ggml-org / llama.cpp

lmoroney / dlaicourse

LianjiaTech / BELLE

google / gemma_pytorch

intel / intel-xpu-backend-for-triton

efeslab / Nanoflow

flashinfer-ai / flashinfer

huggingface / nanotron

exaloop / codon

felafax / felafax

patrick-kidger / equinox

srush / GPU-Puzzles

tinygrad / tinygrad

BBuf / how-to-optim-algorithm-in-cuda

Mozilla-Ocho / llamafile

taichi-dev / taichi

gpu-mode / lectures

Tencent / ncnn

OpenGVLab / InternVL

davidbau / how-to-read-pytorch

facebookresearch / HolisticTraceAnalysis

intel / intel-extension-for-pytorch

kurtamohler / pytorch-TensorIterator-examples

pytorch / torchtitan

jingyaogong / minimind

bitsandbytes-foundation / bitsandbytes

Jiayi-Pan / TinyZero

open-thoughts / open-thoughts