-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Geometry optimization fails in multi-GPU version #281
Comments
Hi @akhilshajan, can you please provide me a smaller example that can be used to reproduce this issue? The one I have (attached above) is too big for debugging. |
Hi @Madu86, I have tried out few examples which takes ~40 iterations with serial, MPI and cuda.MPI and it works out fine giving same results. I was not able to find something that would be helpful for debugging. I am still working on few other molecules there is some discrepancy I will update you. |
@akhilshajan Any update on this? |
Hi @Madu86, I apologize for my delayed response. I have tried out some molecules and it appears that the issue we are experiencing with MPI arises when using the D3BJ keyword. I did not encounter any errors when running the calculations without this keyword. Attached is the input file I used to test benzene molecule, where the SCF calculation failed. |
Hi @Madu86, I tried to run this test case with modifications to the MPI we made for DL-Find just to confirm if the cause was with MPI. I see some discrepancies in the results as I ran this system on single CPU(took 53 iterations), CUDA(122 iterations) and multi-GPU(still fails!!). I have attached my results including the old results shared by you. I have also attached the slurm out file for multi-CUDA calculation. |
The geometry optimization in multi-GPU version of the latest version fails for some reason. See the attached .zip file for CUDA serial and multi-GPU output files of a test case.
1077.out.zip
The text was updated successfully, but these errors were encountered: