Stars
My learning notes/codes for ML SYS.
Different matrix multiplication implementation and benchmarking on CPUs
Providing simple methods to locate deep learning model problems
An address generator for populating real addresses for China, USA, UK, Germany, France and 22 other countries.一个用于填充真实地址的地址生成器,可以生成中国,美国,英国,德国,法国等22个国家的真实地址
Large Language Model (LLM) Systems Paper List
Awesome-LLM: a curated list of Large Language Model
Machine Learning Engineering Open Book
PyTorch emulation library for Microscaling (MX)-compatible data formats
Automatic GPU+CPU memory profiling, re-use and memory leaks detection using jupyter/ipython experiment containers
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
An open-source academic paper management tool.
Chinese NewsTitle Generation Project by GPT2.带有超级详细注释的中文GPT2新闻标题生成项目。
PyTorch extensions for high performance and large scale training.
Collected Alfred Workflows & Proof of Concept
Step-by-step optimization of CUDA SGEMM
An unofficial cuda assembler, for all generations of SASS, hopefully :)
Evaluating different memory managers for dynamic GPU memory
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Track emissions from Compute and recommend ways to reduce their impact on the environment.
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Dataset of GPT-2 outputs for research in detection, biases, and more