We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
背景: 目前deepspeed推出了类似Metagtron的模型并行功能,即Deepspeed版的Megatron-LM的模型并行功能。但当前LLaMA-Factory所推出的ds并行训练配置模板yaml中并没有相关的配置,现阶段LLaMA-Factory提供的ds功能都是zero数据并行的配置。
期望: LLaMA-Factory能提供Megatron-Deepspeed的模型并行的配置模板,且支持到昇腾体系嘛?
The text was updated successfully, but these errors were encountered:
不支持
Sorry, something went wrong.
No branches or pull requests
背景:
目前deepspeed推出了类似Metagtron的模型并行功能,即Deepspeed版的Megatron-LM的模型并行功能。但当前LLaMA-Factory所推出的ds并行训练配置模板yaml中并没有相关的配置,现阶段LLaMA-Factory提供的ds功能都是zero数据并行的配置。
期望:
LLaMA-Factory能提供Megatron-Deepspeed的模型并行的配置模板,且支持到昇腾体系嘛?
The text was updated successfully, but these errors were encountered: