JoshuaPurtell

💭

Working

Josh Purtell JoshuaPurtell

💭

Working

AI Agent Research

41 followers · 108 following

Achievements

x2 x3

Achievements

x2 x3

Stars

aaely / rocket_backend

Rust 1 Updated Dec 27, 2024

XiaojuanTang / Mars

a benchmark to evaluate the situated inductive reasoning

Python 10 2 Updated Dec 10, 2024

facebookresearch / oni

Learn online intrinsic rewards from LLM feedback

Python 27 Updated Dec 17, 2024

SWE-agent / SWE-ReX

Sandboxed code execution for AI agents, locally or on the cloud.

Python 25 4 Updated Dec 10, 2024

lmnr-ai / flow

A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.

Python 795 26 Updated Dec 18, 2024

balrog-ai / BALROG

Benchmarking Agentic LLM and VLM Reasoning On Games

Python 81 12 Updated Dec 18, 2024

circlemind-ai / fast-graphrag

RAG that intelligently adapts to your use case, data, and queries

Python 2,614 118 Updated Dec 23, 2024

Integuru-AI / Integuru

The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.

Python 3,260 238 Updated Dec 17, 2024

Not-Diamond / RoRF

Routing on Random Forest (RoRF)

Python 92 5 Updated Sep 24, 2024

zorazrw / agent-workflow-memory

AWM: Agent Workflow Memory

Python 224 19 Updated Nov 25, 2024

alirezadir / rightful-ai

1 Updated Sep 8, 2024

JD-P / RetroInstruct

Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.

Python 24 1 Updated Dec 6, 2024

hitorilabs / hitorilabs-site

home of hitorilabs

Astro 1 1 Updated Aug 30, 2024

magicproduct / hash-hop

Long context evaluation for large language models

Python 194 15 Updated Dec 23, 2024

weavel-ai / Ape

Your first AI prompt engineer

Python 352 14 Updated Nov 7, 2024

casper-hansen / quiet-star-original

Forked from ezelikman/quiet-star

Code for Quiet-STaR

Python 5 Updated Aug 21, 2024

ars22 / scaling-LLM-math-synthetic-data

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

27 Updated Jun 16, 2024

dvruette / pokemon-emerald-experiments

Forked from PWhiddy/PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 14 Updated Jun 22, 2024

Surfer-Org / Protocol

Open-source framework for exporting your personal data.

TypeScript 1,291 56 Updated Dec 25, 2024

rbren / OpenHands

Forked from All-Hands-AI/OpenHands

🐚 OpenDevin: Code Less, Make More

Python 1 Updated Aug 26, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,475 1,215 Updated Nov 8, 2024

thomasnormal / fewshot

Python 27 1 Updated Sep 23, 2024

Zyphra / tree_attention

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Python 108 5 Updated Dec 3, 2024

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 252 26 Updated Aug 6, 2024

DramaCow / jaxued

Python 66 8 Updated Aug 21, 2024

cmu-l3 / llmlean

LLMs + Lean, on your laptop or in the cloud

Lean 130 17 Updated Oct 23, 2024

KarolisRam / vpt-mi

Code for paper "Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent"

Jupyter Notebook 6 Updated Jul 18, 2024

project-numina / aimo-progress-prize

Jupyter Notebook 349 27 Updated Jul 22, 2024

NousResearch / Open-Reasoning-Tasks

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 291 40 Updated Sep 27, 2024

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 252 26 Updated Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Josh Purtell JoshuaPurtell

Achievements

Achievements

Block or report JoshuaPurtell

Stars

aaely / rocket_backend

XiaojuanTang / Mars

facebookresearch / oni

SWE-agent / SWE-ReX

lmnr-ai / flow

balrog-ai / BALROG

circlemind-ai / fast-graphrag

Integuru-AI / Integuru

Not-Diamond / RoRF

zorazrw / agent-workflow-memory

alirezadir / rightful-ai

JD-P / RetroInstruct

hitorilabs / hitorilabs-site

magicproduct / hash-hop

weavel-ai / Ape

casper-hansen / quiet-star-original

ars22 / scaling-LLM-math-synthetic-data

dvruette / pokemon-emerald-experiments

Surfer-Org / Protocol

rbren / OpenHands

SakanaAI / AI-Scientist

thomasnormal / fewshot

Zyphra / tree_attention

YuxiXie / MCTS-DPO

DramaCow / jaxued

cmu-l3 / llmlean

KarolisRam / vpt-mi

project-numina / aimo-progress-prize

NousResearch / Open-Reasoning-Tasks

EdanToledo / Stoix