Skip to content
@METR

METR

Model Evaluation and Threat Research

Model Evaluation and Threat Research (METR)

METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.

We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.

Read more about our work here.

Our Software

Popular repositories Loading

  1. task-standard task-standard Public

    METR Task Standard

    TypeScript 147 32

  2. vivaria vivaria Public

    Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

    TypeScript 89 31

  3. public-tasks public-tasks Public

    HTML 88 10

  4. RE-Bench RE-Bench Public

    Python 71 6

  5. eval-analysis-public eval-analysis-public Public

    Public repository containing METR's DVC pipeline for eval data analysis

    Python 43 9

  6. hcast-public hcast-public Public

    HTML 10 1

Repositories

Showing 10 of 29 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.