Skip to content
View JoshuaPurtell's full-sized avatar
πŸ’­
Working
πŸ’­
Working

Block or report JoshuaPurtell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Rust 1 Updated Dec 27, 2024

a benchmark to evaluate the situated inductive reasoning

Python 10 2 Updated Dec 10, 2024

Learn online intrinsic rewards from LLM feedback

Python 27 Updated Dec 17, 2024

Sandboxed code execution for AI agents, locally or on the cloud.

Python 25 4 Updated Dec 10, 2024

A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.

Python 795 26 Updated Dec 18, 2024

Benchmarking Agentic LLM and VLM Reasoning On Games

Python 81 12 Updated Dec 18, 2024

RAG that intelligently adapts to your use case, data, and queries

Python 2,614 118 Updated Dec 23, 2024

The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.

Python 3,260 238 Updated Dec 17, 2024

Routing on Random Forest (RoRF)

Python 92 5 Updated Sep 24, 2024

AWM: Agent Workflow Memory

Python 224 19 Updated Nov 25, 2024

Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.

Python 24 1 Updated Dec 6, 2024

home of hitorilabs

Astro 1 1 Updated Aug 30, 2024

Long context evaluation for large language models

Python 194 15 Updated Dec 23, 2024

Your first AI prompt engineer

Python 352 14 Updated Nov 7, 2024

Code for Quiet-STaR

Python 5 Updated Aug 21, 2024

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

27 Updated Jun 16, 2024

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 14 Updated Jun 22, 2024

Open-source framework for exporting your personal data.

TypeScript 1,291 56 Updated Dec 25, 2024

🐚 OpenDevin: Code Less, Make More

Python 1 Updated Aug 26, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery πŸ§‘β€πŸ”¬

Jupyter Notebook 8,475 1,215 Updated Nov 8, 2024
Python 27 1 Updated Sep 23, 2024

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Python 108 5 Updated Dec 3, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 252 26 Updated Aug 6, 2024
Python 66 8 Updated Aug 21, 2024

LLMs + Lean, on your laptop or in the cloud

Lean 130 17 Updated Oct 23, 2024

Code for paper "Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent"

Jupyter Notebook 6 Updated Jul 18, 2024
Jupyter Notebook 349 27 Updated Jul 22, 2024

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 291 40 Updated Sep 27, 2024

πŸ›οΈA research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX β€’ End-to-End JAX RL

Python 252 26 Updated Dec 4, 2024
Next