Fix adversarial training #23

rithik83 · 2024-08-30T00:57:31Z

To generate an adversarial example given a model and a clean example, gradient-based techniques generally move along the loss gradients of the clean example trying to maximise loss.

in my old adv-training implementation, I notice I generate a batch of adversarial examples by perturbing a batch of clean examples. My way tended to aggregate the cross-entropy loss of all the examples in the batch and used that loss for all the points, instead of each point's own CE loss to perturb it. No wonder pure AT did not work in my thesis, elementary error

The fix is manually perturbing each clean example in the batch on its own, computing and using CE loss for that point alone.

Not sure if my word salad here is comprehensible but hey

rithik83 self-assigned this Aug 30, 2024

rithik83 mentioned this issue Aug 30, 2024

Adv training #24

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix adversarial training #23

Fix adversarial training #23

rithik83 commented Aug 30, 2024

Fix adversarial training #23

Fix adversarial training #23

Comments

rithik83 commented Aug 30, 2024