-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QUESTION] Are there targeted optimizations for the ada architecture? #56
Comments
most of them are not tuned. you can tune it yourself. for GEMM+RS and AG+GEMM, use tools here: https://github.com/bytedance/flux/tree/main/tools for MOE related: no tools yet. A PR is welcome. |
thx, i will try. |
We recently open source the moe part, and related tuning script can be find here. You can use that for reference. |
I pulled the latest code immediately. When I compiled it on ada, the following error occurred. Could you please tell me what suggestions there are for fixing it? @wenlei-bao
|
try clean your workspace then follow the README.md and try recompiles it. NOTE that you have to run this before build.sh https://github.com/bytedance/flux/blob/main/install_deps.sh also make sure you compile with the right image. we use NVCC 12.4 + gcc 12. |
Hi,I noticed that you've been running benchmarks on the L20. May I ask if there are targeted optimizations for the ada architecture?
The text was updated successfully, but these errors were encountered: