wangxuebing0906 / OLMo Public

forked from allenai/OLMo

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Modeling, training, eval, and inference code for OLMo

allenai.org/olmo

Apache-2.0 license

0 stars 528 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 2,361 Commits
.github		.github
configs		configs
docker		docker
docs		docs
evaluation		evaluation
hf_olmo		hf_olmo
inference		inference
olmo		olmo
scripts		scripts
test_fixtures		test_fixtures
tests		tests
tokenizers		tokenizers
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
dev-requirements.txt		dev-requirements.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Repository files navigation

OLMo: Open Language Model

Installation

pip install ai2-olmo

Fine-tuning

To fine-tune an OLMo model you'll first need to prepare your dataset by tokenizing and saving it to a numpy memory-mapped array. See scripts/prepare_tulu_data.py for an example with the Tulu V2 dataset, which can be easily modified for other datasets.

Next, prepare your training config. There are many examples in the configs/ directory. Make sure the model parameters match up with the model your fine-tuning. To be safe you can always start from the config that comes with the model checkpoint.

Then launch the training job:

torchrun --nproc_per_node=8 scripts/train.py {path_to_train_config} \
    --data.paths=[{path_to_data}/input_ids.npy] \
    --data.label_mask_paths=[{path_to_data}/label_mask.npy] \
    --load_path={path_to_checkpoint} \
    --reset_trainer_state