Starred repositories
A Python scikit for building and analyzing recommender systems
Slides, notes, and materials for the workshop
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Learn ML engineering for free in 4 months!
Free MLOps course from DataTalks.Club
Utility for behavioral and representational analyses of Language Models
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Python scripts to interact with the Tractive GPS API with extended functionality.
An ongoing list of pandas quirks
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
Approaching (Almost) Any Machine Learning Problem
Data augmentation for NLP, presented at EMNLP 2019
Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimiz…
🗨 Repository to host our minBert implementation for the course 'Deep Learning for Natural Language Processing' at the University of Göttingen.
CS 224N Winter 2023 Default Final Project: Multitask BERT
Prefix-Tuning: Optimizing Continuous Prompts for Generation
CMATH: Can your language model pass Chinese elementary school math test?
Code for 3D-LLM: Injecting the 3D World into Large Language Models
CS231n: Deep Learning for Computer Vision, Stanford - Spring 2023
Public facing notes page
A quick guide (especially) for trending instruction finetuning datasets
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
some bravo or inspiring research works on the topic of curriculum learning
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、…
This repo contains the dataset and code in the EMNLP'23 paper: StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding.
Notes for Stanford CS224N: Natural Language Processing with Deep Learning.