Stars
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
CoreNet: A library for training deep neural networks
Fast and Accurate ML in 3 Lines of Code
This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Open weights language model from Google DeepMind, based on Griffin.
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
For educational materials related to the spinning up workshops.
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
VMamba: Visual State Space Models,code is based on mamba
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
PyTorch code and models for V-JEPA self-supervised learning from video.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI
Unofficial Implementation of Evolutionary Model Merging
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.