Name		Name	Last commit message	Last commit date
parent directory ..
__init__.py		__init__.py
diffusion_energon_datamodule.py		diffusion_energon_datamodule.py
diffusion_fake_datamodule.py		diffusion_fake_datamodule.py
diffusion_mock_datamodule.py		diffusion_mock_datamodule.py
diffusion_taskencoder.py		diffusion_taskencoder.py
prepare_energon_dataset.py		prepare_energon_dataset.py
readme.rst		readme.rst

readme.rst

Preparing Image / Video Megatron Energon WebDataset with Cosmos Tokenizer

This script is an example on preparing a WebDataset for an image / video + text dataset using distributed processing with the Cosmos Tokenizer. It processes each sample by generating a continuous image / video latent using the Cosmos video tokenizer and a T5 embedding from the text caption. Then, the processed data is stored in a WebDataset-compatible format.

Requirements

Dependencies: - Please use the latest NeMo dev container: nvcr.io/nvidia/nemo:dev - You may also need to install jammy and mediapy depending on your dev container version.
Data: - The script uses an example dataset that comes in parquet format. To use a custom, you will need to write a custom process_func and create a new factory recipe that uses your new process_func.

Usage

Set up your environment: Pull and launch the NeMo dev container to run your script.
Customize Cache Path: Set the T5 cache directory path in the script by specifying the t5_cache_dir variable.
Running the Script: To run the script on 8 GPUs, use the following command:

bash torchrun --nproc_per_node=8 nemo/collections/diffusion/data/prepare_energon_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

readme.rst

Preparing Image / Video Megatron Energon WebDataset with Cosmos Tokenizer

Requirements

Usage

Files

data

Directory actions

More options

Directory actions

More options

Latest commit

History

data

Folders and files

parent directory

readme.rst

Preparing Image / Video Megatron Energon WebDataset with Cosmos Tokenizer

Requirements

Usage