Fine-tune Llama model with MLX

Pre-requisites

You need a Hugging Face account and agreed to the terms for Meta-Llama 3.1 8B

Checkout code

git clone https://github.com/sampot/mlx-llama-finetune

Install dependencies

poetry Install
pip install -r llama.cpp/requirements.txt

Prepare training dataset

This is a time-consuming process. In this case, the dataset from mlx-examples is used instead.

Convert to GGUF format

As Ollama can only use GGUF-formated model for inference, so we need to convert the model to GGUF with the script from llama.cpp project.

Before executing the convertion script, we need to ensure the llama.cpp version is compatible with the one in Ollama. Otherewise, Ollama might not be able to create the model due to error such as expecting tensor layers 292, got 291 instead

The tag branch b3418 is used.

cd llama.cpp
git checkout b3418

python ./llama.cpp/convert-hf-to-gguf.py --outfile ./models/model.gguf --outtype q8_0 ./models/llama3.1-spk1

Run the fine-tuned model locally with Ollama

ollama run llama3.1-spk1

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
llama.cpp @ f299aa9		llama.cpp @ f299aa9
.gitignore		.gitignore
.gitmodules		.gitmodules
Modelfile		Modelfile
README.md		README.md
lora_config.yaml		lora_config.yaml
mlx-ft.sh		mlx-ft.sh
poetry.lock		poetry.lock
prepare_data.py		prepare_data.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tune Llama model with MLX

Pre-requisites

Checkout code

Install dependencies

Prepare training dataset

Convert to GGUF format

Run the fine-tuned model locally with Ollama

About

Releases

Packages

Languages

sampot/mlx-llama-finetune

Folders and files

Latest commit

History

Repository files navigation

Fine-tune Llama model with MLX

Pre-requisites

Checkout code

Install dependencies

Prepare training dataset

Convert to GGUF format

Run the fine-tuned model locally with Ollama

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages