AlphaPav

🏠

Working from home

AlphaPav

🏠

Working from home

148 followers · 68 following

University of Illinois Urbana-Champaign
https://alphapav.github.io/

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,115 2,212 Updated Apr 23, 2025

facebookresearch / ai-agent-privacy

Dataset and evaluation benchmark for Privacy Leakage Evaluation of Autonomous Web Agents

Python 11 1 Updated Mar 25, 2025

yueliu1999 / Awesome-Jailbreak-on-LLMs

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

630 56 Updated Apr 20, 2025

facebookresearch / CRAG

Comprehensive benchmark for RAG

Jupyter Notebook 170 19 Updated Nov 5, 2024

multi-agent-systems-failure-taxonomy / MAST

Python 125 8 Updated Apr 24, 2025

jthickstun / watermark

Code for watermarking language models

Python 78 10 Updated Sep 7, 2024

microsoft / Firewalled-Agentic-Networks

Code for the paper "Firewalls to Secure Dynamic LLM Agentic Networks"

Python 14 Updated Apr 22, 2025

modelcontextprotocol / modelcontextprotocol

Specification and documentation for the Model Context Protocol

TypeScript 2,376 321 Updated Apr 24, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 9,347 1,217 Updated Apr 24, 2025

sherdencooper / GPTFuzz

Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts

Python 482 67 Updated Sep 24, 2024

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,138 153 Updated Apr 23, 2025

openai / SWELancer-Benchmark

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,353 122 Updated Apr 3, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,319 153 Updated Mar 20, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,075 777 Updated Apr 24, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,642 1,470 Updated Apr 2, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,049 144 Updated Apr 11, 2025

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,467 105 Updated Apr 24, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,711 273 Updated Apr 14, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,401 629 Updated Apr 24, 2025

X-PLUG / MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 4,133 413 Updated Apr 10, 2025

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 15,580 1,588 Updated Apr 23, 2025

princeton-nlp / continual-factoid-memorization

Continual Memorization of Factoids in Large Language Models

Python 7 1 Updated Nov 20, 2024

katiekang1998 / reasoning_generalization

Jupyter Notebook 31 5 Updated Jan 7, 2025

MadryLab / context-cite

Attribute (or cite) statements generated by LLMs back to in-context information.

Jupyter Notebook 228 18 Updated Oct 8, 2024

kaymen99 / langgraph-email-automation

Multi AI agents for customer support email automation built with Langchain & Langgraph

Python 93 28 Updated Feb 13, 2025

ChenWu98 / agent-attack

[ICLR 2025] Dissecting Adversarial Robustness of Multimodal LM Agents

Python 81 6 Updated Feb 19, 2025

OSU-NLP-Group / EIA_against_webagent

Python 23 1 Updated Oct 2, 2024

tmgthb / Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

780 39 Updated Apr 23, 2025

ThuCCSLab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,377 87 Updated Apr 24, 2025

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 43,617 6,574 Updated Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AlphaPav

Achievements

Achievements

Highlights

Organizations

Block or report AlphaPav

Stars

huggingface / open-r1

facebookresearch / ai-agent-privacy

yueliu1999 / Awesome-Jailbreak-on-LLMs

facebookresearch / CRAG

multi-agent-systems-failure-taxonomy / MAST

jthickstun / watermark

microsoft / Firewalled-Agentic-Networks

modelcontextprotocol / modelcontextprotocol

openai / openai-agents-python

sherdencooper / GPTFuzz

hiyouga / EasyR1

openai / SWELancer-Benchmark

Unakar / Logic-RL

volcengine / verl

Jiayi-Pan / TinyZero

PeterGriffinJin / Search-R1

RAGEN-AI / RAGEN

deepseek-ai / open-infra-index

OpenRLHF / OpenRLHF

X-PLUG / MobileAgent

SWE-agent / SWE-agent

princeton-nlp / continual-factoid-memorization

katiekang1998 / reasoning_generalization

MadryLab / context-cite

kaymen99 / langgraph-email-automation

ChenWu98 / agent-attack

OSU-NLP-Group / EIA_against_webagent

tmgthb / Autonomous-Agents

ThuCCSLab / Awesome-LM-SSP

microsoft / autogen