GitHub - NanQiAi/alpaca-lora at 67077755175f72ad639ca2eb87918679c79f7190

NanQiAi / alpaca-lora Public

forked from tloen/alpaca-lora

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Instruct-tune LLaMA on consumer hardware

Apache-2.0 license

0 stars 2.2k forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
DATA_LICENSE		DATA_LICENSE
LICENSE		LICENSE
README.md		README.md
alpaca_data.json		alpaca_data.json
conversion.py		conversion.py
finetune.py		finetune.py
generate.py		generate.py
lengths.ipynb		lengths.ipynb
loss.ipynb		loss.ipynb

Repository files navigation

alpaca-lora (WIP)

This repository contains code for reproducing the Stanford Alpaca results. Users will need to be ready to fork transformers.

Setup

Install dependencies (install zphang's transformers fork)

pip install -q datasets accelerate loralib sentencepiece

pip install -q git+https://github.com/zphang/transformers@llama_push
pip install -q git+https://github.com/huggingface/peft.git

Install bitsandbytes from source

Inference

See generate.py. This file reads the decapoda-research/llama-7b-hf model from the Huggingface model hub and the LoRA weights from tloen/alpaca-lora-7b, and runs inference on a specified input. Users should treat this as example code for the use of the model, and modify it as needed.