Skip to content
View fkhawar's full-sized avatar

Block or report fkhawar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,125 185 Updated Mar 25, 2025
Jupyter Notebook 7,411 1,302 Updated Sep 22, 2024

The n-gram Language Model

C 1,416 100 Updated Aug 5, 2024

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,007 544 Updated Oct 14, 2024

Fast and Accurate ML in 3 Lines of Code

Python 8,727 1,000 Updated Apr 23, 2025

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.

Jupyter Notebook 303 29 Updated Apr 2, 2025
Python 713 79 Updated Mar 19, 2025

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,655 285 Updated Aug 14, 2024

Open weights language model from Google DeepMind, based on Griffin.

Python 635 29 Updated Feb 20, 2025

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,024 176 Updated Dec 27, 2020

LLM training in simple, raw C/CUDA

Cuda 26,424 3,040 Updated Oct 2, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 7,344 815 Updated Aug 24, 2023

For educational materials related to the spinning up workshops.

TeX 200 48 Updated Feb 12, 2019

An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.

Jupyter Notebook 1,145 364 Updated Jul 14, 2023
Jupyter Notebook 150 121 Updated Jan 17, 2023

Code behind Arxiv Papers

Python 513 60 Updated Apr 2, 2024
Jupyter Notebook 4,113 543 Updated Mar 28, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,556 175 Updated Mar 7, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,378 231 Updated Feb 13, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 43,600 6,565 Updated Apr 23, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,953 294 Updated Feb 27, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 40,850 6,763 Updated Dec 9, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,597 912 Updated Jul 1, 2024

Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI

Python 30 1 Updated Nov 11, 2024

Unofficial Implementation of Evolutionary Model Merging

Python 38 2 Updated Mar 28, 2024

Grok open release

Python 50,235 8,347 Updated Aug 30, 2024

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 11,885 1,391 Updated Apr 17, 2025

3D Visualization of an GPT-style LLM

TypeScript 4,638 520 Updated Aug 24, 2024

Notes on the Mistral AI model

Jupyter Notebook 19 5 Updated Dec 27, 2023
Next