Skip to content
View yihan-zhou's full-sized avatar
🥕
making things
🥕
making things

Block or report yihan-zhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Python scikit for building and analyzing recommender systems

Python 6,458 1,018 Updated Jun 16, 2024

Slides, notes, and materials for the workshop

308 31 Updated Jun 1, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,731 817 Updated Jan 9, 2025

Learn ML engineering for free in 4 months!

Jupyter Notebook 9,798 2,314 Updated Jan 6, 2025

Free MLOps course from DataTalks.Club

Jupyter Notebook 11,346 2,183 Updated Sep 9, 2024

Utility for behavioral and representational analyses of Language Models

Python 126 32 Updated Dec 1, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,968 4,375 Updated Jul 28, 2024

Python scripts to interact with the Tractive GPS API with extended functionality.

Python 6 Updated Aug 3, 2023

An ongoing list of pandas quirks

Jupyter Notebook 950 133 Updated May 8, 2023

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

Jupyter Notebook 3,544 557 Updated Dec 27, 2019

Approaching (Almost) Any Machine Learning Problem

7,655 1,084 Updated Mar 25, 2023

Data augmentation for NLP, presented at EMNLP 2019

Python 1,616 316 Updated Mar 19, 2023

Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimiz…

Python 15 3 Updated Oct 8, 2023

🗨 Repository to host our minBert implementation for the course 'Deep Learning for Natural Language Processing' at the University of Göttingen.

Python 3 Updated Sep 4, 2023

CS 224N Winter 2023 Default Final Project: Multitask BERT

Python 25 50 Updated Mar 23, 2023

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 905 160 Updated Apr 26, 2024

CMATH: Can your language model pass Chinese elementary school math test?

Python 40 4 Updated Jul 3, 2023

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 992 59 Updated Jun 6, 2024

CS231n: Deep Learning for Computer Vision, Stanford - Spring 2023

Jupyter Notebook 4 1 Updated Mar 19, 2024

Public facing notes page

Jupyter Notebook 10,292 4,087 Updated Aug 1, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,755 176 Updated Nov 28, 2023

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,770 2,674 Updated Aug 15, 2024

some bravo or inspiring research works on the topic of curriculum learning

230 16 Updated Aug 7, 2022

Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

Jupyter Notebook 789 213 Updated Mar 31, 2022

🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、…

HTML 8,402 2,913 Updated Dec 23, 2024

This repo contains the dataset and code in the EMNLP'23 paper: StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding.

8 Updated Jan 4, 2025
Jupyter Notebook 2 Updated Jul 24, 2022

Notes for Stanford CS224N: Natural Language Processing with Deep Learning.

Jupyter Notebook 69 33 Updated Oct 11, 2021
Next