forked from stanford-crfm/helm
-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add index page and navigation to documentation
- Loading branch information
Showing
6 changed files
with
48 additions
and
12 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,20 +1,17 @@ | ||
<!--intro-start--> | ||
# Holistic Evaluation of Language Models | ||
|
||
[comment]: <> (When using the img tag, which allows us to specify size, src has to be a URL.) | ||
<img src="https://github.com/stanford-crfm/helm/raw/main/src/helm/benchmark/static/images/helm-logo.png" alt="" width="800"/> | ||
|
||
Welcome! This repository contains all the assets for [Holistic Evaluation of Language Models](https://arxiv.org/abs/2211.09110), | ||
which includes the following features: | ||
Welcome! The **`crfm-helm`** Python package contains code used in the **Holistic Evaluation of Language Models** project ([paper](https://arxiv.org/abs/2211.09110), [website](https://crfm.stanford.edu/helm/v1.0/)) by [Stanford CRFM](https://crfm.stanford.edu/). This package includes the following features: | ||
|
||
- Collection of datasets in a standard format (e.g., NaturalQuestions) | ||
- Collection of models accessible via a unified API (e.g., GPT-3, MT-NLG, OPT, BLOOM) | ||
- Collection of metrics beyond accuracy (efficiency, bias, toxicity, etc.) | ||
- Collection of perturbations for evaluating robustness and fairness (e.g., typos, dialect) | ||
- Modular framework for constructing prompts from datasets | ||
- Proxy server for managing accounts and providing unified interface to access models | ||
<!--intro-end--> | ||
|
||
To read more: | ||
|
||
- [Setup](docs/setup.md): how to run the code | ||
- [Code](docs/code.md): how to contribute new scenarios or models | ||
- [Running the proxy server](docs/proxy-server.md) | ||
- [Running the benchmark](docs/benchmark.md) | ||
- [Deployment](docs/deployment.md): for CRFM maintainers of the proxy server | ||
To get started, refer to [the documentation on Read the Docs](https://crfm-helm.readthedocs.io/) for how to install and run the package. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
{% | ||
include-markdown "../README.md" | ||
start="<!--intro-start-->" | ||
end="<!--intro-end-->" | ||
%} | ||
|
||
The code is [hosted on GitHub here](https://github.com/stanford-crfm/helm/). | ||
|
||
To run the code, refer to the User Guide's chapters: | ||
|
||
- [Installation](installation.md) | ||
- [Quick Start](quick_start.md) | ||
- [Tutorial](tutorial.md) | ||
|
||
To add new models and scenarios, refer to the Developer Guide's chapters: | ||
|
||
- [Developer Setup](developer_setup.md) | ||
- [Code Structure](code.md) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
mkdocs==1.4.2 | ||
mkdocstrings[python]==0.19.0 | ||
mkdocs-macros-plugin==0.7.0 | ||
mkdocs-include-markdown-plugin==4.0.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters