Skip to content

Commit

Permalink
Update ClusterData2019.md
Browse files Browse the repository at this point in the history
Tweaked and clarified description.
  • Loading branch information
johnwilkes authored May 1, 2020
1 parent 53f120e commit a27ddd7
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions ClusterData2019.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,17 +8,17 @@ The `clusterdata-2019` trace dataset provides information about eight different
* information about alloc sets (shared resource reservations used by jobs);
* job-parent information for master/worker relationships such as MapReduce jobs.

Just like the [version 2](ClusterData2011_2.md) trace, these new traces focus on resource requests and usage, and contain no information about end users, their data, or access patterns to storage systems and other services.
The 2019 traces focus on resource requests and usage, and contain no information about end users, their data, or access patterns to storage systems and other services.

In addition to providing a downloadable format, we are also making the trace data available via Google BigQuery so that sophisticated analyses can be performed without requiring local resources.
Because of it's size (about 2.4TiB compressed), we are only making the trace data available via [Google BigQuery](https://cloud.google.com/bigquery) so that sophisticated analyses can be performed without requiring local resources.

**The `clusterdata-2019` traces are described in this document:
[Google cluster-usage traces v3](https://drive.google.com/file/d/10r6cnJ5cJ89fPWCgj7j4LtLBqYN9RiI9/view).** You can find the download and access instructions there, as well as many more details about what is in the traces, and how to interpret them. For additional background information, please refer to the 2015 Borg paper, [Large-scale cluster management at Google with Borg](https://ai.google/research/pubs/pub43438).

* If you haven't already joined our
[mailing list](https://groups.google.com/forum/#!forum/googleclusterdata-discuss),
please do so now.
*Important: to avoid spammers, you MUST fill out the "reason" field, or your application will be rejected.*

**The `clusterdata-2019` traces are described in this document:
[Google cluster-usage traces v3](https://drive.google.com/file/d/10r6cnJ5cJ89fPWCgj7j4LtLBqYN9RiI9/view).** You can find the download and access instructions there, as well as many more details about what is in the traces, and how to interpret them. For additional background information, please refer to the 2015 Borg paper, [Large-scale cluster management at Google with Borg](https://ai.google/research/pubs/pub43438).

![Creative Commons CC-BY license](https://i.creativecommons.org/l/by/4.0/88x31.png)
The data and trace documentation are made available under the
Expand Down

0 comments on commit a27ddd7

Please sign in to comment.