Skip to content
View mengbingrock's full-sized avatar
  • 02:17 (UTC -04:00)

Highlights

  • Pro

Organizations

@trthackthonFighters

Block or report mengbingrock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A simple tutorial of Variational AutoEncoders with Pytorch

Jupyter Notebook 323 74 Updated Feb 15, 2024
Jupyter Notebook 73 16 Updated Aug 7, 2023

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 580 50 Updated Apr 7, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,236 340 Updated Sep 27, 2024

Low-bit LLM inference on CPU with lookup table

C++ 477 34 Updated Oct 12, 2024

A simple, easy-to-hack GraphRAG implementation

Python 943 94 Updated Oct 10, 2024

Nightly release of ControlNet 1.1

Python 4,694 373 Updated Aug 8, 2024

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 34,435 5,757 Updated Aug 30, 2024

A plugin for Jupyter Notebook to run CUDA C/C++ code

Jupyter Notebook 195 87 Updated Sep 13, 2024

An ML Systems Onboarding list

521 20 Updated Jul 23, 2024

Apple G13 GPU architecture docs and tools

HTML 539 38 Updated May 6, 2024

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

Jupyter Notebook 53 3 Updated Aug 1, 2024

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 174 14 Updated Jun 18, 2024

LoRA (Low-Rank Adaptation) inspector for Stable Diffusion

Python 83 5 Updated Sep 20, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,972 819 Updated Jun 10, 2024

Metal Guide

Swift 82 9 Updated Sep 23, 2023

Generative Models by Stability AI

Python 24,338 2,709 Updated Sep 4, 2024

A simplified POC implementation of a RAG-based virtual assistant

Python 3 Updated May 15, 2024

Stable Diffusion web UI

Python 281 44 Updated Jun 26, 2024
Swift 427 33 Updated Sep 26, 2024

List of papers related to neural network quantization in recent AI conferences and journals.

438 37 Updated Sep 22, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,525 2,932 Updated Oct 14, 2024

Everything we actually know about the Apple Neural Engine (ANE)

2,042 75 Updated Sep 23, 2024

MLX: An array framework for Apple silicon

C++ 16,748 963 Updated Oct 13, 2024

List of Tech Company OAs. Save your time from finding them all over the internet.

1,364 87 Updated Oct 11, 2024

LLM training in simple, raw C/CUDA

Cuda 23,989 2,685 Updated Oct 2, 2024

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 5,073 553 Updated Oct 13, 2024

Apple GPU microarchitecture

Metal 464 18 Updated Sep 22, 2024
Python 751 138 Updated Nov 29, 2023

Distribute and run LLMs with a single file.

C++ 19,735 995 Updated Oct 14, 2024
Next