-
02:17
(UTC -04:00)
Highlights
- Pro
Starred repositories
A simple tutorial of Variational AutoEncoders with Pytorch
Flash Attention in ~100 lines of CUDA (forward pass only)
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A simple, easy-to-hack GraphRAG implementation
Nightly release of ControlNet 1.1
12 Weeks, 24 Lessons, AI for All!
A plugin for Jupyter Notebook to run CUDA C/C++ code
[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
flash attention tutorial written in python, triton, cuda, cutlass
LoRA (Low-Rank Adaptation) inspector for Stable Diffusion
QLoRA: Efficient Finetuning of Quantized LLMs
Generative Models by Stability AI
A simplified POC implementation of a RAG-based virtual assistant
Stable Diffusion web UI
List of papers related to neural network quantization in recent AI conferences and journals.
You like pytorch? You like micrograd? You love tinygrad! ❤️
Everything we actually know about the Apple Neural Engine (ANE)
List of Tech Company OAs. Save your time from finding them all over the internet.
Distribute and run LLMs with a single file.