Tags · subhankar-ghosh/NeMo

24.09-alpha.rc0

Support TE-DPA For Stable Diffusion (NVIDIA#10314)

* [SD] Add te-dpa support

Signed-off-by: Wil Kong <[email protected]>

* [SD] Add te-dpa support, resolve compatibility with TE-master

Signed-off-by: Wil Kong <[email protected]>

* [SD] Add te-dpa support, add check for attention configs.

Signed-off-by: Wil Kong <[email protected]>

* Fix bugs of flash-attn and dpa in SD.

Signed-off-by: Wil Kong <[email protected]>

* Fix the issue of DPA API change.

Signed-off-by: Wil Kong <[email protected]>

* Apply isort and black reformatting

Signed-off-by: alpha0422 <[email protected]>
Signed-off-by: Wil Kong <[email protected]>

* [SD] TE-DPA: disbale use te-dpa in inference flow.

---------

Signed-off-by: Wil Kong <[email protected]>
Signed-off-by: alpha0422 <[email protected]>
Co-authored-by: Mengdi Wang <[email protected]>

Sep 16, 2024
4068955
zip
tar.gz

r2.0.0rc1

add manifest file (NVIDIA#10161)

Signed-off-by: Oliver Koenig <[email protected]>

Aug 15, 2024
579983f
zip
tar.gz

v2.0.0rc0

Merge branch 'r2.0.0rc0' of github.com:NVIDIA/NeMo into r2.0.0rc0

Jun 5, 2024
265bd73
zip
tar.gz

stable

Merge branch 'r2.0.0rc0' of github.com:NVIDIA/NeMo into r2.0.0rc0

Jun 5, 2024
265bd73
zip
tar.gz

v2.0.0.rc0.beta

Add option for mutex timeout in distributed optimizer backward hook (N…

…VIDIA#9087)

* Tim: Add option for timeout in distopt callback mutex

Signed-off-by: Jaemin Choi <[email protected]>

* Replace parent's _lock

Signed-off-by: Jaemin Choi <[email protected]>

* Revert "Replace parent's _lock"

This reverts commit 972d1b6.

Signed-off-by: Jaemin Choi <[email protected]>

* Raise RuntimeError when timeout

Signed-off-by: Jaemin Choi <[email protected]>

* Change RuntimeError to print

Signed-off-by: Jaemin Choi <[email protected]>

---------

Signed-off-by: Jaemin Choi <[email protected]>
Co-authored-by: Jaemin Choi <[email protected]>

May 2, 2024
b2eccd2
zip
tar.gz

v1.23.0

update github raw content link (NVIDIA#8517)

Signed-off-by: Chen Cui <[email protected]>

Feb 26, 2024
d2283e3
zip
tar.gz

v1.22.0

Merge branch 'r1.22.0' of github.com:NVIDIA/NeMo into r1.22.0

Jan 10, 2024
0b7467e
zip
tar.gz

v1.21.0

Update Apex install command in Dockerfile (NVIDIA#7794)

* move core install to /workspace (NVIDIA#7706)

Signed-off-by: Abhinav Khattar <[email protected]>

* update apex install in dockerfile

Signed-off-by: eharper <[email protected]>

* use fetch head

Signed-off-by: eharper <[email protected]>

---------

Signed-off-by: Abhinav Khattar <[email protected]>
Signed-off-by: eharper <[email protected]>
Co-authored-by: Abhinav Khattar <[email protected]>

Oct 25, 2023
c0022ae
zip
tar.gz

nvidia-mlperf

tag for mlperf Oct submission

Sep 29, 2023
32f592c
zip
tar.gz

v1.20.0

Eagerly accumulate embedding grads into fp32 buffer (NVIDIA#6958)

Signed-off-by: Tim Moon <[email protected]>

Aug 2, 2023
2baef81
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

24.09-alpha.rc0

r2.0.0rc1

v2.0.0rc0

stable

v2.0.0.rc0.beta

v1.23.0

v1.22.0

v1.21.0

nvidia-mlperf

v1.20.0

Tags: subhankar-ghosh/NeMo