Example code for parsing the dataset needed to train Stanford Alpaca.
This contains both a function, process_data.py
, which shows how to transform the given alpaca data into the format expected by cog train
. It also contains an example parsed dataset as a reference for that {'prompt': ..., 'completion':...}
format.