Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
.github/workflows		.github/workflows
docs		docs
integration_tests		integration_tests
recipes		recipes
scripts		scripts
tests		tests
torchtune		torchtune
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
dev-requirements.txt		dev-requirements.txt
pull_request_template.md		pull_request_template.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

TorchTune (alpha release)

Introduction | Installation | Get Started | Contributing |

Introduction

TorchTune is a native-Pytorch library for easily authoring, fine-tuning and experimenting with LLMs.

The library provides:

Native-PyTorch implementations of popular LLMs, with convertors to transform checkpoints into TorchTune's format
Training recipes for popular fine-tuning techniques with reference benchmarks and comprehensive correctness checks
Integration with HuggingFace Datasets for training and EleutherAI's Eval Harness for evaluation
Support for distributed training using FSDP from PyTorch Distributed
Yaml configs for easily configuring training runs

Model	Sizes	Finetuning Methods
Llama2	7B	Full Finetuning, LoRA

Design Principles

TorchTune embodies PyTorch’s design philosophy [details], especially "usability over everything else".

Native PyTorch

TorchTune is a native-PyTorch library. While we provide integrations with the surrounding ecosystem (eg: HuggingFace Datasets, EluetherAI Eval Harness), all of the core functionality is written in PyTorch.

Simplicity and Extensibility

TorchTune is designed to be easy to understand, use and extend.

Composition over implementation inheritance - layers of inheritance for code re-use makes the code hard to read and extend
No training frameworks - explicitly outlining the training logic makes it easy to extend for custom use cases
Code duplication is prefered over unecessary abstractions
Modular building blocks over monolithic components

Correctness

TorchTune provides well-tested components with a high-bar on correctness. The library will never be the first to provide a feature, but available features will be thoroughly tested. We provide

Extensive unit-tests to ensure component-level numerical parity with reference implementations
Checkpoint-tests to ensure model-level numerical parity with reference implementations
Integration tests to ensure recipe-level performance parity with reference implementations on standard benchmarks

Installation

This library requires PyTorch >= 2.0. Please install locally using this guide.

Currently, torchtune must be built via cloning the repository and installing as follows:

git clone https://github.com/pytorch-labs/torchtune
cd torchtune
pip install -e .

To verify successful installation, one can run:

tune recipe list

And as an example, the following import should work:

from torchtune.modules import TransformerDecoder

Get Started

Downloading a model

Follow the instructions on the official meta-llama repository to ensure you have access to the Llama2 model weights. Once you have confirmed access, you can run the following command to download the weights to your local machine. This will also download the tokenizer model and a responsible use guide.

Set your environment variable HF_TOKEN or pass in --hf-token to the command in order to validate your access.

tune download --repo-id meta-llama/Llama-2-7b --hf-token <HF_TOKEN>

Converting the checkpoint into PyTorch-native

Running recipes

On a single GPU

tune finetune_llm --config alpaca_llama2_finetune

On multiple GPUs using FSDP

tune --nnodes 1 --nproc_per_node 4 finetune_llm --config alpaca_llama2_finetune --fsdp True

Copy and edit a custom recipe

To copy a recipe to customize it yourself and then run

tune recipe cp finetune_llm my_recipe/finetune_llm.py
tune config cp alpaca_llama2_finetune my_recipe/alpaca_llama2_finetune.yaml
tune my_recipe/finetune_llm.py --config my_recipe/alpaca_llama2_finetune.yaml

Command Utilities

tune provides functionality for launching torchtune recipes as well as local recipes. Aside from torchtune recipe utilties, it integrates with torch.distributed.run to support distributed job launching by default. tune offers everyting that torchrun does with the following additional functionalities:

tune <recipe> <recipe_args> with no optional torchrun options launches a single python process
<recipe> and recipe arg <config> can both be passed in as names instead of paths if they're included in torchtune
tune <path/to/recipe.py> <recipe_args> can be used to launch local recipes
tune <torchrun_options> <recipe> <recipe_args> will launch a torchrun job
tune recipe and tune config commands provide utilities for listing and copying packaged recipes and configs

Contributing

We welcome any feature requests, bug reports, or pull requests from the community. See the CONTRIBUTING file for how to help out.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TorchTune (alpha release)

Introduction

Design Principles

Native PyTorch

Simplicity and Extensibility

Correctness

Installation

Get Started

Downloading a model

Converting the checkpoint into PyTorch-native

Running recipes

Copy and edit a custom recipe

Command Utilities

Contributing

About

Releases

Packages

Languages

License

vrvrv/torchtune

Folders and files

Latest commit

History

Repository files navigation

TorchTune (alpha release)

Introduction

Design Principles

Native PyTorch

Simplicity and Extensibility

Correctness

Installation

Get Started

Downloading a model

Converting the checkpoint into PyTorch-native

Running recipes

Copy and edit a custom recipe

Command Utilities

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages