LLM Fine-Tuning Project

This repository contains scripts and resources for fine-tuning the Falcon-7B model for dialogue generation tasks using the Persona-Chat dataset. The project includes custom preprocessing, training, and evaluation pipelines with BLEU and ROUGE-L metrics.

Project Highlights

Choice of LLM: Falcon-7B
Fine-Tuning Method: LoRA with 4-bit precision
Justifications: Explained below
Link to Model Weights: Hugging Face Repository

1. Choice of LLM

We chose Falcon-7B as our base language model because:

It is a high-performing, open-source causal language model designed for text generation tasks.
It is lightweight (7 billion parameters) compared to larger models, making it computationally efficient for fine-tuning.
It supports advanced quantization techniques such as 4-bit precision, reducing memory requirements without significant performance loss.

2. Choice of Fine-Tuning Method

Low-Rank Adaptation (LoRA)

LoRA is a parameter-efficient fine-tuning technique that modifies only a subset of model parameters while keeping the base model frozen.
In this project:
- Rank (r): 16
- Alpha: 32
- Dropout: 0.05
LoRA allows efficient training on consumer-grade GPUs (e.g., NVIDIA RTX 3090) while maintaining strong downstream task performance.

Quantization with 4-bit Precision

Quantization reduces the memory footprint and computational overhead.
We use NF4 quantization, which is designed to maximize performance in transformer-based models.

3. Justifications

Efficiency:
- The combination of LoRA and 4-bit quantization ensures the fine-tuning process is both memory- and compute-efficient.
- This enables fine-tuning even on limited hardware resources without sacrificing model quality.
Reproducibility:
- The scripts in this repository are designed to be modular and reusable for similar text generation tasks.
Metrics for Evaluation:
- BLEU and ROUGE-L are industry-standard metrics for evaluating text generation tasks.
- These metrics provide insight into the quality, fluency, and informativeness of the generated responses.

4. Objective Function

The training objective is causal language modeling, where the model predicts the next token in a sequence based on the previous context.

Below is a visual representation of the cross-entropy loss used during training:

5. Model Weights

The fine-tuned model weights and tokenizer are available on Hugging Face:

Fine-Tuned Falcon-7B

Getting Started

Install Dependencies

Install the required libraries using:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Images		Images
Scripts		Scripts
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Fine-Tuning Project

Project Highlights

1. Choice of LLM

2. Choice of Fine-Tuning Method

Low-Rank Adaptation (LoRA)

Quantization with 4-bit Precision

3. Justifications

4. Objective Function

5. Model Weights

Getting Started

Install Dependencies

About

Releases

Packages

Languages

HBX814/Falcon-7b-FineTuned

Folders and files

Latest commit

History

Repository files navigation

LLM Fine-Tuning Project

Project Highlights

1. Choice of LLM

2. Choice of Fine-Tuning Method

Low-Rank Adaptation (LoRA)

Quantization with 4-bit Precision

3. Justifications

4. Objective Function

5. Model Weights

Getting Started

Install Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages