willccbb

Follow

will brown willccbb

Follow

434 followers · 4 following

Achievements

Achievements

Lists (1)

Sort

Gyms

Stars

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 10,034 759 Updated Apr 17, 2025

openai / TestUGxlYXNlIGlnbm9yZQo

Test Please ignore

13 3 Updated Apr 17, 2025

OpenPipe / ART

OpenPipe ART (Agent Reinforcement Trainer): train LLM agents

Python 87 1 Updated Apr 17, 2025

huggingface / yourbench

Forked from sumukshashidhar/yourbench

🤗 Benchmark Large Language Models Reliably On Your Data

Python 234 16 Updated Apr 14, 2025

junegunn / fzf

🌸 A command-line fuzzy finder

Go 69,457 2,494 Updated Apr 13, 2025

simple-bench / SimpleBench

Python 100 11 Updated Dec 20, 2024

cpldcpu / MisguidedAttention

A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information

Python 399 21 Updated Apr 10, 2025

dmtrKovalenko / my-nvim-config

My neovim configs (yes I use neovim instead of tmux and it is good 😱)

Lua 95 5 Updated Apr 9, 2025

Saghen / blink.cmp

Performant, batteries-included completion plugin for Neovim

Lua 4,154 230 Updated Apr 15, 2025

brendanhogan / DeepSeekRL-Extended

Exploring Applications of GRPO

Python 179 18 Updated Apr 11, 2025

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 296,513 49,299 Updated Dec 2, 2024

groundlight / r1_vlm

Build your own visual reasoning model

Python 338 18 Updated Apr 17, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,492 284 Updated Apr 1, 2025

ezyang / codemcp

Coding assistant MCP for Claude Desktop

Python 1,093 90 Updated Apr 15, 2025

instructor-ai / instructor

structured outputs for llms

Python 10,153 773 Updated Apr 14, 2025

bradhilton / temporal-clue

Clue inspired puzzles for testing LLM deduction abilities

Python 33 3 Updated Mar 24, 2025

OpenPipe / deductive-reasoning

Train your own SOTA deductive reasoning model

Python 86 6 Updated Mar 6, 2025

doomslide / attention-graph

JavaScript 55 2 Updated Mar 5, 2025

willccbb / claude-code-mcp

Letting Claude Code develop his own MCP tools :)

TypeScript 99 18 Updated Mar 8, 2025

haizelabs / verdict

Verdict is a library for scaling judge-time compute.

Jupyter Notebook 196 8 Updated Mar 18, 2025

willccbb / verifiers

Verifiers for LLM Reinforcement Learning

Python 806 91 Updated Apr 2, 2025

ZeroSumEval / ZeroSumEval

A framework for pitting LLMs against each other in an evolving library of games ⚔

Python 33 4 Updated Mar 29, 2025

gkamradt / SnakeBench

TypeScript 84 9 Updated Apr 14, 2025

eddycmu / demystify-long-cot

Python 281 16 Updated Mar 16, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 118 23 Updated Apr 16, 2025

qpwo / dsv3-lowmem

Forked from deepseek-ai/DeepSeek-V3

run deepseek v3 on a single node. Drops unused experts from memory.

Python 14 1 Updated Jan 26, 2025

open-thought / reasoning-gym

procedural reasoning datasets

Python 562 57 Updated Apr 16, 2025

ezetech / anthropic-gui

A user-friendly, feature-rich UI enhancing interaction with Anthropic's Claude AI, enabling model selection, chat saving, and improved prompt editing.

TypeScript 108 28 Updated Jul 25, 2023

ekzhang / jute

A native Jupyter notebook frontend with local + remote kernels, reactive cells, and IDE features, implemented in Rust

Rust 113 9 Updated Feb 3, 2025

yetone / avante.nvim

Use your Neovim like using Cursor AI IDE!

Lua 12,749 528 Updated Apr 17, 2025