In this directory we write benchmarks for Mentat's performance on different tasks.
./benchmarks/exercism_practice.py
Flags that control the performance of the benchmarks are defined here and set conservatively so benchmarks without flags will run relatively quickly and cheaply. To run the exercism benchmark with multiple workers on all the tests with one retry for the clojure language run the following:
./benchmarks/exercism_practice.py --max_benchmarks 134 --max_iterations 2 --max_workers 2 --language clojure
Warning: If you increase max_workers
much higher you'll start to get rate limited.
./benchmarks/benchmark_runner.py
Real world benchmarks can either be samples or python files.