Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Unreleased

Fixed

Fixed default value of --tokenizer argument to scripts/prepare_tulu_data.py to be an absolute path, not relative path, the script can be run from other directories.
Added the option to directly pass input embeddings to OLMo and OLMoForCausalLM.
Added support for Python 3.8.
Added code to throw an error if output_attentions is set to True in forward call to OLMoForCausalLM. This functionality hasn't been implemented yet.

Added

Added output_hidden_states argument and associated functionality to OLMo and OLMoForCausalLM to return model intermediate hidden states.
Added MMLU downstream evaluation tasks.
Added support for PyTorch v2.2.

v0.2.4 - 2024-02-02

Fixed

Fixed an issue with the HuggingFace integration where we were inadvertently using a feature that was introduced in Python 3.10, causing an error for older Python versions.

v0.2.3 - 2024-01-31

v0.2.2 - 2023-12-10

v0.2.1 - 2023-12-10

v0.2.0 - 2023-12-08

Added

GPT-based model.
Tokenizer and data pre-processing pipeline.
training script.
Triton-based FlashAttention.