Name		Name	Last commit message	Last commit date
parent directory ..
benchmarks/mentat		benchmarks/mentat
exercise_runners		exercise_runners
migrations		migrations
resources/templates		resources/templates
README.md		README.md
__init__.py		__init__.py
arg_parser.py		arg_parser.py
benchmark_result.py		benchmark_result.py
benchmark_result_list.py		benchmark_result_list.py
benchmark_run.py		benchmark_run.py
benchmark_run_summary.py		benchmark_run_summary.py
benchmark_runner.py		benchmark_runner.py
context_benchmark.py		context_benchmark.py
edit_rubric_benchmark.py		edit_rubric_benchmark.py
exercism_practice.py		exercism_practice.py
plot_generator.py		plot_generator.py
run_sample.py		run_sample.py

README.md

Benchmarks

In this directory we write benchmarks for Mentat's performance on different tasks.

Running Exercism Benchmarks

./benchmarks/exercism_practice.py

Flags that control the performance of the benchmarks are defined here and set conservatively so benchmarks without flags will run relatively quickly and cheaply. To run the exercism benchmark with multiple workers on all the tests with one retry for the clojure language run the following:

./benchmarks/exercism_practice.py  --max_benchmarks 134 --max_iterations 2 --max_workers 2 --language clojure

Warning: If you increase max_workers much higher you'll start to get rate limited.

Running Real World Benchmarks

./benchmarks/benchmark_runner.py

Making Real World Benchmarks

Real world benchmarks can either be samples or python files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks

benchmarks

README.md

Benchmarks

Running Exercism Benchmarks

Running Real World Benchmarks

Making Real World Benchmarks

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmarks

Running Exercism Benchmarks

Running Real World Benchmarks

Making Real World Benchmarks