Skip to content
View willccbb's full-sized avatar

Block or report willccbb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lightweight coding agent that runs in your terminal

TypeScript 10,034 759 Updated Apr 17, 2025

Test Please ignore

13 3 Updated Apr 17, 2025

OpenPipe ART (Agent Reinforcement Trainer): train LLM agents

Python 87 1 Updated Apr 17, 2025

🤗 Benchmark Large Language Models Reliably On Your Data

Python 234 16 Updated Apr 14, 2025

🌸 A command-line fuzzy finder

Go 69,457 2,494 Updated Apr 13, 2025
Python 100 11 Updated Dec 20, 2024

A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information

Python 399 21 Updated Apr 10, 2025

My neovim configs (yes I use neovim instead of tmux and it is good 😱)

Lua 95 5 Updated Apr 9, 2025

Performant, batteries-included completion plugin for Neovim

Lua 4,154 230 Updated Apr 15, 2025

Exploring Applications of GRPO

Python 179 18 Updated Apr 11, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 296,513 49,299 Updated Dec 2, 2024

Build your own visual reasoning model

Python 338 18 Updated Apr 17, 2025

NanoGPT (124M) in 3 minutes

Python 2,492 284 Updated Apr 1, 2025

Coding assistant MCP for Claude Desktop

Python 1,093 90 Updated Apr 15, 2025

structured outputs for llms

Python 10,153 773 Updated Apr 14, 2025

Clue inspired puzzles for testing LLM deduction abilities

Python 33 3 Updated Mar 24, 2025

Train your own SOTA deductive reasoning model

Python 86 6 Updated Mar 6, 2025
JavaScript 55 2 Updated Mar 5, 2025

Letting Claude Code develop his own MCP tools :)

TypeScript 99 18 Updated Mar 8, 2025

Verdict is a library for scaling judge-time compute.

Jupyter Notebook 196 8 Updated Mar 18, 2025

Verifiers for LLM Reinforcement Learning

Python 806 91 Updated Apr 2, 2025

A framework for pitting LLMs against each other in an evolving library of games ⚔

Python 33 4 Updated Mar 29, 2025
TypeScript 84 9 Updated Apr 14, 2025
Python 281 16 Updated Mar 16, 2025

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 118 23 Updated Apr 16, 2025

run deepseek v3 on a single node. Drops unused experts from memory.

Python 14 1 Updated Jan 26, 2025

procedural reasoning datasets

Python 562 57 Updated Apr 16, 2025

A user-friendly, feature-rich UI enhancing interaction with Anthropic's Claude AI, enabling model selection, chat saving, and improved prompt editing.

TypeScript 108 28 Updated Jul 25, 2023

A native Jupyter notebook frontend with local + remote kernels, reactive cells, and IDE features, implemented in Rust

Rust 113 9 Updated Feb 3, 2025

Use your Neovim like using Cursor AI IDE!

Lua 12,749 528 Updated Apr 17, 2025
Next