Skip to content
View harveyp123's full-sized avatar

Block or report harveyp123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of "RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs"

Cuda 9 1 Updated Apr 2, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 607 30 Updated Mar 19, 2025

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)

Python 28 2 Updated Feb 5, 2025

Codes for "Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective" (AAAI 2025)

Python 11 Updated Dec 5, 2023

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 52 11 Updated Feb 19, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,456 120 Updated Apr 17, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,472 1,449 Updated Apr 2, 2025

Simple RL training for reasoning

Python 3,380 248 Updated Mar 31, 2025

NanoGPT (124M) in 3 minutes

Python 2,445 272 Updated Apr 1, 2025

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 204 22 Updated Feb 21, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 103 7 Updated Nov 19, 2024

maximal update parametrization (µP)

Jupyter Notebook 1,486 98 Updated Jul 17, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,197 495 Updated Jan 16, 2025

A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.

Python 90 9 Updated Dec 17, 2024

GRadient-INformed MoE

261 16 Updated Sep 25, 2024

Sparse Backpropagation for Mixture-of-Expert Training

Python 28 5 Updated Jul 2, 2024

Official inference repo for FLUX.1 models

Python 21,175 1,497 Updated Feb 6, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,173 90 Updated Feb 16, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 516 43 Updated Apr 1, 2025

A PyTorch native library for large model training

Python 3,536 328 Updated Apr 3, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,997 354 Updated Jul 21, 2024
Python 2,488 227 Updated Mar 20, 2025

Large Language Model Text Generation Inference

Python 9,961 1,176 Updated Apr 3, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 789 51 Updated Apr 3, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,854 509 Updated Sep 25, 2024

[CVPR 2024] Rewrite the Stars

Python 363 20 Updated May 7, 2024
Python 234 15 Updated May 1, 2024

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 180 18 Updated Aug 2, 2024

Ring attention implementation with flash attention

Python 723 62 Updated Feb 24, 2025
Next