You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Previous calculation of flops using the thop library is not accurate. As it is ignoring the scan flops.
Using the latest code using fv_core method, which is the modification from VMamba Paper, they have also modified according to the Albert Gu Method of calculation of flops for SSMs. I am getting the below results
1 MambaIR Block
GFlops: 61.45901 Params: 106097
1 Restormer Block
GFlops: 7.2507 Params: 31729
So my questions are
Why is the MambaIR block almost 9 times more expensive to compute than the Restormer Block?
Why are the parameters almost thrice as those of the restormer block?
I didn't see any flop comparison with restormer in the Real Denoising Case, in the Paper of MambaIR v1
The text was updated successfully, but these errors were encountered:
Hi, thanks for your interest!
In the ./analysis folder in this repo, we have already update the FLOPs calculation code which uses fvcore for calculation.
As for the model complexity, it is suggested to evaluate the whole model instead of only one block, given different methods have varying layer number hyper-parameters.
I have used the latest code from ./analysis , The results which I pasted above are from flops_param_fvcore.py
Coming to Model complexity, I have compared the entire model flops of restormer vs MambaIR for denoising task, which uses UNET Shaped MambaIR Model which is similar to the case of Restormer. Which was mentioned in Paper & code, That for denoising task UNET based MambaIR is taken for fair comparison with Restormer. The below are my results on an image of shape 3,128,128
The Previous calculation of flops using the thop library is not accurate. As it is ignoring the scan flops.
Using the latest code using fv_core method, which is the modification from VMamba Paper, they have also modified according to the Albert Gu Method of calculation of flops for SSMs. I am getting the below results
1 MambaIR Block
GFlops: 61.45901 Params: 106097
1 Restormer Block
GFlops: 7.2507 Params: 31729
So my questions are
Why is the MambaIR block almost 9 times more expensive to compute than the Restormer Block?
Why are the parameters almost thrice as those of the restormer block?
I didn't see any flop comparison with restormer in the Real Denoising Case, in the Paper of MambaIR v1
The text was updated successfully, but these errors were encountered: