Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
DominikLindorfer authored Aug 10, 2023
1 parent c202f01 commit 1897d2a
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -199,6 +199,7 @@ The SQL-LLaMA models are fine-tuned using HuggingFace's Trainer an the following
| Weight decay | 0.1 | 0.1 | 0.1 | 0.1 |
| Warm-Up-Ratio | - | - | - | - |

**SQL-LLaMA 13B with 5 and 10 Epoch Training are now available too! (Same other parameters as SQL-LLaMA-13B)**

Please note that both SQL-LLaMA-small 7B & 13B use the same LIMA training strategy proposed in Ref. [7] except that no dropout has been used and a cosine LR scheduler was employed.

Expand Down

0 comments on commit 1897d2a

Please sign in to comment.