forked from decodingml/llm-twin-course
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
e0f268d
commit 6621975
Showing
4 changed files
with
33 additions
and
92 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# Generate Data for LLM finetuning task component | ||
|
||
## Component Structure | ||
|
||
### File Handling | ||
- `file_handler.py`: Manages file I/O operations, enabling reading and writing of JSON formatted data. | ||
|
||
### LLM Communication | ||
- `llm_communication.py`: Handles communication with OpenAI's LLMs, sending prompts and processing responses. | ||
|
||
### Data Generation | ||
- `generate_data.py`: Orchestrates the generation of training data by integrating file handling, LLM communication, and data formatting. | ||
|
||
|
||
### Usage | ||
|
||
The project includes a `Makefile` for easy management of common tasks. Here are the main commands you can use: | ||
|
||
- `make help`: Displays help for each make command. | ||
- `make local-start`: Build and start mongodb, mq and qdrant. | ||
- `make local-test-github`: Insert data to mongodb | ||
- `make generate-dataset`: Generate dataset for finetuning and version it in CometML |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters