Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
feizc authored Aug 30, 2024
1 parent 4978dc8 commit bca95d5
Showing 1 changed file with 7 additions and 1 deletion.
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,8 @@ torchrun --nnodes=1 --nproc_per_node=8 train.py \
--global_batch_size 128
```

More scripts can reference to `scripts` file direction.


### 2. Inference

Expand All @@ -33,7 +35,7 @@ python sample.py \
```


### 3. Download Models and Data
### 3. Download Ckpts and Data

We use VAE and Vocoder in AudioLDM2, CLAP-L, and T5-XXL. You can download in the following table directly, we also provide the training scripts in our experiments.
Note that as in actual experiments, a restart experiment was performed due to machine malfunction, so there will be resume options in some scripts.
Expand All @@ -51,6 +53,10 @@ Note that as in actual experiments, a restart experiment was performed due to ma
| FluxMusic-Giant | [link](https://huggingface.co/feizhengcong/FluxMusic) | [link](https://github.com/feizc/FluxMusic/blob/main/scripts/train_g.sh) |


The construction of training data can refer to the `test.py` file. Considering copyright issues, the data used in the paper needs to be downloaded by oneself.
A quick download link can be found in [Huggingface](https://huggingface.co/datasets?search=music) : ).


### Acknowledgments

The codebase is based on the awesome [Flux](https://github.com/black-forest-labs/flux) and [AudioLDM2](https://github.com/haoheliu/AudioLDM2) repos.
Expand Down

0 comments on commit bca95d5

Please sign in to comment.