The Black Spatula Project

Overview

The goal of this project is to evaluate whether AI models (initially OpenAI's "o1" and possibly "o1-pro") can reliably identify factual, logical, and mathematical errors in published scientific papers. We will measure:

Number of errors detected
Severity of identified errors
False positive rate
Effort required to verify AI findings

Background

This project is named after a recent high-profile paper on black plastic kitchen utensils that contained a simple but consequential math error. This mistake, which passed peer review, could have been flagged by an AI reviewer.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data_pipeline		data_pipeline
docs		docs
notebook_lab		notebook_lab
.gitignore		.gitignore
README.md		README.md
pull_papers.py		pull_papers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Black Spatula Project

Overview

Background

About

Releases

Packages

Languages

udosreis/black-spatula-project

Folders and files

Latest commit

History

Repository files navigation

The Black Spatula Project

Overview

Background

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages