Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

是否支持Megatron-Deepspeed的模型并行? #7033

Closed
RyanOvO opened this issue Feb 21, 2025 · 1 comment
Closed

是否支持Megatron-Deepspeed的模型并行? #7033

RyanOvO opened this issue Feb 21, 2025 · 1 comment
Labels
wontfix This will not be worked on

Comments

@RyanOvO
Copy link

RyanOvO commented Feb 21, 2025

背景:
目前deepspeed推出了类似Metagtron的模型并行功能,即Deepspeed版的Megatron-LM的模型并行功能。但当前LLaMA-Factory所推出的ds并行训练配置模板yaml中并没有相关的配置,现阶段LLaMA-Factory提供的ds功能都是zero数据并行的配置。

期望:
LLaMA-Factory能提供Megatron-Deepspeed的模型并行的配置模板,且支持到昇腾体系嘛?

@hiyouga
Copy link
Owner

hiyouga commented Feb 21, 2025

不支持

@hiyouga hiyouga closed this as completed Feb 21, 2025
@hiyouga hiyouga closed this as not planned Won't fix, can't repro, duplicate, stale Feb 21, 2025
@hiyouga hiyouga added the wontfix This will not be worked on label Feb 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants