Skip to content
View TkSTynCHD's full-sized avatar

Block or report TkSTynCHD

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the NeurIPS 2024 submission: "DAGER: Extracting Text from Gradients with Language Model Priors"

Python 5 2 Updated Oct 31, 2024

Papers and resources related to the security and privacy of LLMs 🤖

Python 467 35 Updated Nov 27, 2024

The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".

Python 39 5 Updated Oct 23, 2024

A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)

HTML 32 4 Updated Nov 28, 2024

[arXiv:2411.10023] "Model Inversion Attacks: A Survey of Approaches and Countermeasures"

150 10 Updated Jan 5, 2025

Code for replicating experiments in our paper (accepted by AAAI-24).

Python 5 Updated Aug 2, 2024

[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation

Python 69 17 Updated Apr 7, 2022
Python 50 5 Updated Jun 13, 2024

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,123 73 Updated Feb 5, 2025

LAMP: Extracting Text from Gradients with Language Model Priors (NeurIPS '22)

Python 24 7 Updated Feb 7, 2023

Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"

Python 102 12 Updated Dec 28, 2022

Learning Sparse Neural Networks through L0 regularization

Python 238 47 Updated Jul 17, 2020

The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word prediction language models.

Python 88 22 Updated Aug 13, 2024
Python 2 Updated Jun 27, 2024

This project explores training data extraction attacks on the LLaMa 7B, GPT-2XL, and GPT-2-IMDB models to discover memorized content using perplexity, perturbation scoring metrics, and large scale …

Python 9 1 Updated Jun 15, 2023

The Memory layer for AI Agents

Python 24,377 2,258 Updated Feb 5, 2025

😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)

Jupyter Notebook 13 Updated Apr 19, 2023

Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation"

Python 22 Updated May 8, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,153 1,716 Updated Feb 4, 2025

Curated list of project-based tutorials

215,672 28,131 Updated Aug 15, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,596 1,533 Updated Jan 15, 2025

[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

86 6 Updated Aug 7, 2024

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

1,146 58 Updated Feb 3, 2025

Code for Findings-ACL 2023 paper: Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence

Python 43 14 Updated Jun 3, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 52,980 5,166 Updated Jan 21, 2025
Python 60 17 Updated Jun 2, 2023

An easy-to-use federated learning platform

Python 1,360 221 Updated Aug 10, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,219 706 Updated Dec 17, 2024
Python 4 1 Updated Mar 19, 2023
Next