Skip to content
View AlphaPav's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Organizations

@AI-secure

Block or report AlphaPav

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 24,115 2,212 Updated Apr 23, 2025

Dataset and evaluation benchmark for Privacy Leakage Evaluation of Autonomous Web Agents

Python 11 1 Updated Mar 25, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

630 56 Updated Apr 20, 2025

Comprehensive benchmark for RAG

Jupyter Notebook 170 19 Updated Nov 5, 2024

Code for watermarking language models

Python 78 10 Updated Sep 7, 2024

Code for the paper "Firewalls to Secure Dynamic LLM Agentic Networks"

Python 14 Updated Apr 22, 2025

Specification and documentation for the Model Context Protocol

TypeScript 2,376 321 Updated Apr 24, 2025

A lightweight, powerful framework for multi-agent workflows

Python 9,347 1,217 Updated Apr 24, 2025

Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts

Python 482 67 Updated Sep 24, 2024

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,138 153 Updated Apr 23, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,353 122 Updated Apr 3, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,319 153 Updated Mar 20, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,075 777 Updated Apr 24, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,642 1,470 Updated Apr 2, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,049 144 Updated Apr 11, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,467 105 Updated Apr 24, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,711 273 Updated Apr 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,401 629 Updated Apr 24, 2025

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 4,133 413 Updated Apr 10, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 15,580 1,588 Updated Apr 23, 2025

Continual Memorization of Factoids in Large Language Models

Python 7 1 Updated Nov 20, 2024
Jupyter Notebook 31 5 Updated Jan 7, 2025

Attribute (or cite) statements generated by LLMs back to in-context information.

Jupyter Notebook 228 18 Updated Oct 8, 2024

Multi AI agents for customer support email automation built with Langchain & Langgraph

Python 93 28 Updated Feb 13, 2025

[ICLR 2025] Dissecting Adversarial Robustness of Multimodal LM Agents

Python 81 6 Updated Feb 19, 2025

Autonomous Agents (LLMs) research papers. Updated Daily.

780 39 Updated Apr 23, 2025

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,377 87 Updated Apr 24, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 43,617 6,574 Updated Apr 24, 2025
Next