dash29

Sihui Dai dash29

PhD student at Princeton ECE

16 followers · 20 following

dash29.github.io

Achievements

Stars

wrh14 / deep_unlearning

Official github page for the paper "Evaluating Deep Unlearning in Large Language Model"

Jupyter Notebook 14 1 Updated Feb 15, 2025

ledllm / ledllm

Jupyter Notebook 18 3 Updated Jun 16, 2024

kangqiyu / SODEF

Python 15 7 Updated Dec 10, 2022

thu-coai / SafeUnlearning

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Python 25 1 Updated Jul 9, 2024

zjunlp / EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 2,139 263 Updated Mar 18, 2025

greshake / llm-security

New ways of breaking app-integrated LLMs

Jupyter Notebook 1,903 130 Updated Jun 17, 2023

EasyJailbreak / EasyJailbreak

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 590 48 Updated Sep 2, 2024

JonasGeiping / carving

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 67 6 Updated Feb 22, 2024

princeton-nlp / MQuAKE

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook 107 11 Updated Sep 12, 2024

drimpossible / corrective-unlearning-bench

Shell 13 8 Updated Mar 4, 2025

llm-attacks / llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Python 3,782 510 Updated Aug 2, 2024

shirleyzhu233 / PyTorch-MAML

A PyTorch implementation of Model Agnostic Meta-Learning (MAML) that faithfully reproduces the results from the original paper.

Python 223 34 Updated Sep 2, 2024

NVlabs / DiffPure

A new adversarial purification method that uses the forward and reverse processes of diffusion models to remove adversarial perturbations.

Python 288 34 Updated Jan 29, 2023

zhangq327 / ARC

Official Code for ICLR2022 Paper: Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap

Jupyter Notebook 28 3 Updated Sep 2, 2022

HaipengXiong / weighted-hausdorff-loss

A loss function (Weighted Hausdorff Distance) for object localization in PyTorch

Python 90 22 Updated Jun 29, 2018

arjunbhagoji / log-loss-lower-bounds

Python 4 3 Updated Jun 4, 2021

csdongxian / AWP

Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"

Python 177 19 Updated Feb 18, 2021

wangren09 / MetaAdv

Jupyter Notebook 58 23 Updated Sep 11, 2023

goldblum / AdversarialQuerying

A PyTorch implementation of the method found in "Adversarially Robust Few-Shot Learning: A Meta-Learning Approach"

Python 50 10 Updated Oct 9, 2020

liyunsheng13 / dcd

official code for dynamic convolution decomposition

Python 131 15 Updated Nov 22, 2021

cassidylaidlaw / perceptual-advex

Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".

Python 55 10 Updated Jan 18, 2022

google-research / vision_transformer

Jupyter Notebook 11,071 1,354 Updated Mar 6, 2025

imrahulr / adversarial_robustness_pytorch

Unofficial implementation of the DeepMind papers "Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples" & "Fixing Data Augmentation to Improve Adversarial Robustn…

Python 95 12 Updated Mar 4, 2022

P2333 / Bag-of-Tricks-for-AT

Empirical tricks for training robust models (ICLR 2021)

Python 250 26 Updated May 25, 2023

fra31 / auto-attack

Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"

Python 683 116 Updated May 16, 2024

zbh2047 / L_inf-dist-net

[ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.

Python 41 7 Updated Mar 16, 2022

fairlearn / fairlearn

A Python package to assess and improve fairness of machine learning models.

Python 2,034 452 Updated Mar 20, 2025

alaude1 / fairness-mia

Jupyter Notebook 2 Updated May 8, 2021

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,632 2,644 Updated Nov 18, 2024

inspire-group / membership-inference-evaluation

Systematic Evaluation of Membership Inference Privacy Risks of Machine Learning Models

Python 125 19 Updated Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sihui Dai dash29

Achievements

Achievements

Block or report dash29

Stars

wrh14 / deep_unlearning

ledllm / ledllm

kangqiyu / SODEF

thu-coai / SafeUnlearning

zjunlp / EasyEdit

greshake / llm-security

EasyJailbreak / EasyJailbreak

JonasGeiping / carving

princeton-nlp / MQuAKE

drimpossible / corrective-unlearning-bench

llm-attacks / llm-attacks

shirleyzhu233 / PyTorch-MAML

NVlabs / DiffPure

zhangq327 / ARC

HaipengXiong / weighted-hausdorff-loss

arjunbhagoji / log-loss-lower-bounds

csdongxian / AWP

wangren09 / MetaAdv

goldblum / AdversarialQuerying

liyunsheng13 / dcd

cassidylaidlaw / perceptual-advex

google-research / vision_transformer

imrahulr / adversarial_robustness_pytorch

P2333 / Bag-of-Tricks-for-AT

fra31 / auto-attack

zbh2047 / L_inf-dist-net

fairlearn / fairlearn

alaude1 / fairness-mia

google-deepmind / deepmind-research

inspire-group / membership-inference-evaluation