why dit.safetensors is 40GB in size? #115

david-beckham-315 · 2024-12-16T10:02:42Z

Hi

The diffusion model has 10B parameter, but I found the dit.safetensors is 40GB in size? What's the dtype store in the model? In TF32?

Looking forward to the feedback, thanks.

ajayjain · 2024-12-16T19:36:36Z

The checkpoint is stored in float32, not TF32. It should be fine to convert it to bfloat16, but a few parameters should ideally stay in float32 (one pos_frequencies tensor, and q_norm_x.weight, q_norm_y.weight, k_norm_x.weight, k_norm_y.weight in each block).

david-beckham-315 · 2024-12-17T02:04:24Z

The checkpoint is stored in float32, not TF32. It should be fine to convert it to bfloat16, but a few parameters should ideally stay in float32 (one pos_frequencies tensor, and q_norm_x.weight, q_norm_y.weight, k_norm_x.weight, k_norm_y.weight in each block).

Thanks for your answer!

david-beckham-315 · 2024-12-18T11:04:53Z

The checkpoint is stored in float32, not TF32. It should be fine to convert it to bfloat16, but a few parameters should ideally stay in float32 (one pos_frequencies tensor, and q_norm_x.weight, q_norm_y.weight, k_norm_x.weight, k_norm_y.weight in each block).

Hi
I'd like to ask 2 questions

The checkpoint is stored in float32, but why run in bfloat16?
How to run in float32 dtype? I changed model_dtype="bf16" to model_dtype="fp32" in cli.py, but it prompts "assert self.kwargs["model_dtype"] == "bf16", "FP8 is not supported for multi-GPU inference""

ajayjain added the question Further information is requested label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why dit.safetensors is 40GB in size? #115

why dit.safetensors is 40GB in size? #115

david-beckham-315 commented Dec 16, 2024

ajayjain commented Dec 16, 2024 •

edited

Loading

david-beckham-315 commented Dec 17, 2024

david-beckham-315 commented Dec 18, 2024

why dit.safetensors is 40GB in size? #115

why dit.safetensors is 40GB in size? #115

Comments

david-beckham-315 commented Dec 16, 2024

ajayjain commented Dec 16, 2024 • edited Loading

david-beckham-315 commented Dec 17, 2024

david-beckham-315 commented Dec 18, 2024

ajayjain commented Dec 16, 2024 •

edited

Loading