Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cluster-trace-gpu-v2020中pai_sensor_table 中cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem , max_gpu_wrk_mem 疑问 #182

Closed
Jackjiayou opened this issue Apr 9, 2023 · 2 comments

Comments

@Jackjiayou
Copy link

请问pai_sensor_table 中cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem , max_gpu_wrk_mem代表一个任务(以worker_name区分)所用资源还是在这个任务执行的同时还会有别的任务一起使用cpu或者gpu从而代表着这个执行任务那一时刻这台机器所被占用的资源(包含其他任务)

@qzweng
Copy link
Collaborator

qzweng commented Apr 10, 2023

data cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem, max_gpu_wrk_mem in pai_sensor_table is the metric for each instance, not the machine.

@qzweng qzweng closed this as completed Apr 10, 2023
@BhAem
Copy link

BhAem commented Sep 23, 2024

What does it mean if the gpu_wrk_util is greater than 100%?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants