GitHub - MrZilinXiao/LLMHeuristicReHEAT: [COLM'24] Official Implementation of `Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs`

Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs

This repository contains the code and data for the paper "Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs" by Pengda Wang, Zilin Xiao, Hanjie Chen and Frederick L. Oswald.

[arxiv]

Data

We release the first version of data in test_cases_0320.jsonl with the following schema:

{"test_case": "Suppose we know that the number of graduate students...", "type": "Base Rate Fallacy", "ground_truth": "[2] Social science and social work > [4] Engineering"}

We also release the evaluation toolkit in eval_kit/eval_0219.py.

Code

Check all run_heuristic_*.py files for running the LLM queries on different endpoints.

BibTex

If you use this code or data, it would be great if you could cite our paper:

@article{wang2024real,
  title   = {Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs},
  author  = {Pengda Wang and Zilin Xiao and Hanjie Chen and Frederick L. Oswald},
  year    = {2024},
  journal = {arXiv preprint arXiv: 2404.01461}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
eval_kit		eval_kit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
common_prompt_kit.py		common_prompt_kit.py
run_heuristic_gpt.py		run_heuristic_gpt.py
run_heuristic_llama.py		run_heuristic_llama.py
run_heuristic_palm.py		run_heuristic_palm.py
run_self_con_eval.py		run_self_con_eval.py
test_cases_0320.jsonl		test_cases_0320.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs

Data

Code

BibTex

About

Languages

License

MrZilinXiao/LLMHeuristicReHEAT

Folders and files

Latest commit

History

Repository files navigation

Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs

Data

Code

BibTex

About

Resources

License

Stars

Watchers

Forks

Languages