Hello, I am Tinghao Xie 谢廷浩, a second year ECE PhD candidate at Princeton advised by Prof. Prateek Mittal. I received my Bachelor degree from Computer Science and Technology at Zhejiang University. Check my website for more information!
-
Princeton University
- Princeton, NJ
-
20:38
- 4h behind - https://tinghaoxie.com
- @VitusXie
Highlights
- Pro
Pinned Loading
-
SORRY-Bench/sorry-bench
SORRY-Bench/sorry-bench PublicBenchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)
-
LLM-Tuning-Safety/LLMs-Finetuning-Safety
LLM-Tuning-Safety/LLMs-Finetuning-Safety PublicWe jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
-
backdoor-toolbox
backdoor-toolbox PublicA compact toolbox for backdoor attacks and defenses.
-
Unispac/Subnet-Replacement-Attack
Unispac/Subnet-Replacement-Attack PublicOfficial implementation of (CVPR 2022 Oral) Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks.
-
Unispac/Fight-Poison-With-Poison
Unispac/Fight-Poison-With-Poison PublicCode repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples
-
ain-soph/trojanzoo
ain-soph/trojanzoo PublicTrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.
94 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |