Skip to content
View thevasudevgupta's full-sized avatar
🎓
enjoying hard work!
🎓
enjoying hard work!

Organizations

@analytics-club-iitm @Unbox-AI

Block or report thevasudevgupta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recipes to train reward model for RLHF.

Python 1,236 89 Updated Feb 9, 2025

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 506 30 Updated Oct 25, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 43,401 1,224 Updated Mar 12, 2025

Ongoing research training transformer models at scale

Python 11,713 2,636 Updated Mar 12, 2025

Puzzles for learning Triton

Jupyter Notebook 1,492 111 Updated Nov 18, 2024

Implementation of a Transformer, but completely in Triton

Python 259 16 Updated Apr 5, 2022

Flops counter for convolutional networks in pytorch framework

Python 2,867 309 Updated Jan 20, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,237 5,810 Updated Sep 18, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,876 541 Updated Dec 14, 2024

Helpful tools and examples for working with flex-attention

Python 679 36 Updated Mar 9, 2025

Minimalistic large language model 3D-parallelism training

Python 1,675 163 Updated Mar 10, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 33,575 3,785 Updated Mar 11, 2025

Perf monitoring CLI tool for Apple Silicon

Python 3,899 160 Updated Apr 18, 2024

Parallel computing with task scheduling

Python 13,009 1,750 Updated Mar 7, 2025

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 32,319 2,113 Updated Mar 11, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,721 1,776 Updated Mar 11, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,250 156 Updated Mar 5, 2025

Fast Inference Solutions for BLOOM

Python 563 113 Updated Oct 9, 2024

DataComp: In search of the next generation of multimodal datasets

Python 685 56 Updated Jan 2, 2024

This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.

Jupyter Notebook 209 37 Updated Jul 27, 2022

An open-source framework for training large multimodal models.

Python 3,841 298 Updated Aug 31, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,876 4,056 Updated Jul 17, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,832 2,227 Updated Jul 29, 2024

GPU Programming @ IIT Madras

Cuda 2 Updated May 10, 2022

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Jupyter Notebook 91 29 Updated Jan 11, 2022

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,236 487 Updated Mar 22, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,472 722 Updated Dec 17, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,992 6,563 Updated Dec 9, 2024
Jupyter Notebook 574 90 Updated Oct 18, 2024
Next