Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
A list of awesome papers and resources of recommender system on large language model (LLM).
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
An unnecessarily tiny implementation of GPT-2 in NumPy.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
GitHub's code repository is all you need
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
LlamaIndex is the leading framework for building LLM-powered agents over your data.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Instruction Tuning with GPT-4
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Coursera Deep Learning Specialization
Stable Diffusion web UI
🦜🔗 Build context-aware reasoning applications
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
The RedPajama-Data repository contains code for preparing large datasets for training large language models.