Run the baseline:
PYTHONPATH=. python experiments/run_mmlu.py --mode=DirectAnswer
Run fully-connected swarm ablation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=FullConnectedSwarm
Run randomly-connected swarm ablation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=RandomSwarm
Run the main experiment with optimization and eventual evaluation:
PYTHONPATH=. python experiments/run_mmlu.py --num-truthful-agents=3 --mode=OptimizedSwarm
Run the REINFORCE algorithm for edge optimization with three agents as described in the paper.
PYTHONPATH=. python experiments/run_crosswords.py
Run node optimization that improves the demonstration examples of each node.
PYTHONPATH=. python experiments/run_humaneval.py --learn_demonstration True
Run the general assistant tasks.
PYTHONPATH=. python experiments/run_gaia.py