alicogniai

⚔️

grinding

Don't fool yourself!

Stars

Animation engine for explanatory math videos

Python 73,686 6,441 Updated Dec 28, 2024

An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO

Python 25 7 Updated Jan 6, 2025

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 96 10 Updated Dec 16, 2024

Forked from NVlabs/verilog-eval

Verilog evaluation benchmark for large language model

SystemVerilog 1 Updated Sep 17, 2024

Python 111 23 Updated Jul 17, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,522 245 Updated Dec 11, 2024