thevasudevgupta

🎓

enjoying hard work!

Vasudev Gupta thevasudevgupta

🎓

enjoying hard work!

trying to learn what AI learns

201 followers · 110 following

Achievements

Organizations

Lists (2)

Sort

🔮 Future ideas

1 repository

Important

Important resources which you should read again & again

1 repository

Stars

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,236 89 Updated Feb 9, 2025

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 506 30 Updated Oct 25, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 43,401 1,224 Updated Mar 12, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,713 2,636 Updated Mar 12, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,492 111 Updated Nov 18, 2024

Deep-Learning-Profiling-Tools / triton-viz

Python 187 15 Updated Feb 20, 2025

lucidrains / triton-transformer

Implementation of a Transformer, but completely in Triton

Python 259 16 Updated Apr 5, 2022

sovrasov / flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Python 2,867 309 Updated Jan 20, 2025

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,237 5,810 Updated Sep 18, 2024

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,876 541 Updated Dec 14, 2024

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 679 36 Updated Mar 9, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,675 163 Updated Mar 10, 2025

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 33,575 3,785 Updated Mar 11, 2025

tlkh / asitop

Perf monitoring CLI tool for Apple Silicon

Python 3,899 160 Updated Apr 18, 2024

dask / dask

Parallel computing with task scheduling

Python 13,009 1,750 Updated Mar 7, 2025

pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 32,319 2,113 Updated Mar 11, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,721 1,776 Updated Mar 11, 2025

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,250 156 Updated Mar 5, 2025

huggingface / transformers-bloom-inference

Fast Inference Solutions for BLOOM

Python 563 113 Updated Oct 9, 2024

mlfoundations / datacomp

DataComp: In search of the next generation of multimodal datasets

Python 685 56 Updated Jan 2, 2024

sayakpaul / ml-deployment-k8s-fastapi

This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.

Jupyter Notebook 209 37 Updated Jul 27, 2022

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,841 298 Updated Aug 31, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,876 4,056 Updated Jul 17, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,832 2,227 Updated Jul 29, 2024

thevasudevgupta / gpu-programming

GPU Programming @ IIT Madras

Cuda 2 Updated May 10, 2022

thevasudevgupta / gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Jupyter Notebook 91 29 Updated Jan 11, 2022

cloneofsimo / lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,236 487 Updated Mar 22, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,472 722 Updated Dec 17, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,992 6,563 Updated Dec 9, 2024

LambdaLabsML / lambda-diffusers

Jupyter Notebook 574 90 Updated Oct 18, 2024