Skip to content

[COLM'24] Official Implementation of `Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs`

License

Notifications You must be signed in to change notification settings

MrZilinXiao/LLMHeuristicReHEAT

Repository files navigation

Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs

This repository contains the code and data for the paper "Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs" by Pengda Wang, Zilin Xiao, Hanjie Chen and Frederick L. Oswald.

[arxiv]

Data

We release the first version of data in test_cases_0320.jsonl with the following schema:

{"test_case": "Suppose we know that the number of graduate students...", "type": "Base Rate Fallacy", "ground_truth": "[2] Social science and social work > [4] Engineering"}

We also release the evaluation toolkit in eval_kit/eval_0219.py.

Code

Check all run_heuristic_*.py files for running the LLM queries on different endpoints.

BibTex

If you use this code or data, it would be great if you could cite our paper:

@article{wang2024real,
  title   = {Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs},
  author  = {Pengda Wang and Zilin Xiao and Hanjie Chen and Frederick L. Oswald},
  year    = {2024},
  journal = {arXiv preprint arXiv: 2404.01461}
}

About

[COLM'24] Official Implementation of `Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs`

Resources

License

Stars

Watchers

Forks

Languages