paperfactory

paperfactory

16 followers · 45 following

Lists (2)

Sort

✨ Inspiration

3 repositories

llm

6 repositories

Stars

anthropics / courses

Anthropic's educational courses

Jupyter Notebook 8,522 679 Updated Nov 26, 2024

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 229 10 Updated Sep 9, 2024

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,065 1,430 Updated Dec 23, 2024

lucidrains / MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python 632 53 Updated Dec 27, 2024

yuval-alaluf / Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

Jupyter Notebook 712 62 Updated Jan 26, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,247 6,152 Updated Dec 9, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 70,096 10,119 Updated Jan 2, 2025

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 17,751 2,133 Updated Aug 6, 2024

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,423 5,392 Updated Jan 2, 2025

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,338 5,513 Updated Dec 18, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,136 827 Updated Jun 10, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,789 377 Updated Mar 14, 2024

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,016 517 Updated Sep 6, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,739 2,223 Updated Jul 29, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,308 725 Updated Aug 5, 2024

yuxiaochen1103 / FDT

Python 61 5 Updated Jun 16, 2023

jeykigung / P5

Python 319 45 Updated Oct 9, 2023

dair-ai / ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

16,108 1,926 Updated Jan 22, 2024

ufoym / recursive-bf

A lightweight C++ library for recursive bilateral filtering [Yang, Qingxiong. "Recursive bilateral filtering". European Conference on Computer Vision, 2012].

C++ 352 57 Updated Dec 4, 2021