[UR][Benchmarks] Add flamegraphs to benchmark results #19678

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

mateuszpn wants to merge 3 commits into intel:sycl from mateuszpn:flamegraphs

Contributor

mateuszpn commented Aug 1, 2025

Adds presentation of perf results as flamegraphs

mateuszpn added 2 commits

July 31, 2025 15:43


          extend data.js with necessary variables

261c837


          Add flamegraphs to benchmarks

a51c767

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn changed the title ~~[UR][Benchmarks]~~ [UR][Benchmarks] Add flamegraphs to benchmark reslts

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 13:58

— with

GitHub Actions Inactive

mateuszpn changed the title ~~[UR][Benchmarks] Add flamegraphs to benchmark reslts~~ [UR][Benchmarks] Add flamegraphs to benchmark results

mateuszpn marked this pull request as ready for review

August 4, 2025 13:59

mateuszpn requested a review from a team as a code owner

August 4, 2025 13:59

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:19

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:19

— with

GitHub Actions Inactive

mateuszpn force-pushed the flamegraphs branch from dfbfad0 to e4dd31b Compare

August 4, 2025 14:31

mateuszpn requested a review from a team as a code owner

August 4, 2025 14:31

mateuszpn had a problem deploying to WindowsCILock

August 4, 2025 14:31

— with

GitHub Actions Error

mateuszpn force-pushed the flamegraphs branch from e4dd31b to e0390c8 Compare

August 4, 2025 14:32

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:32

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 15:07

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 15:07

— with

GitHub Actions Inactive


          Merge remote-tracking branch 'upstream/sycl' into flamegraphs

e0390c8

Signed-off-by: Mateusz P. Nowak <[email protected]>

pbalcer reviewed

View reviewed changes

devops/scripts/benchmarks/benches/base.py

Comment on lines 128 to +131

                       run_unitrace=False,
                       extra_unitrace_opt=None,
+                      run_flamegraph=False,
+                      extra_perf_opt=None,  # VERIFY

Contributor

pbalcer Aug 6, 2025

You already added a tracing type enum. I'd extend this to be some sort of generic "TraceTool", and here, in run_bench, I suggest simply accepting a generic trace tool (I imagine you wouldn't want to enable two at the same time).

devops/scripts/benchmarks/output_html.py

@@ @@ -24,24 +25,67 @@ def _write_output_to_file( @@
                   if options.output_html == "local":
                       data_path = os.path.join(html_path, f"{filename}.js")
+                      # Check if the file exists and has flamegraph data that we need to preserve
+                      existing_flamegraph_data = None

Contributor

pbalcer Aug 6, 2025

why do we need to do this? we store all the other results separately from the html output.

pbalcer reviewed

View reviewed changes

devops/scripts/benchmarks/utils/flamegraph.py

+                          options.workdir,
+                          "flamegraph-repo",
+                          "https://github.com/brendangregg/FlameGraph.git",
+                          "master",

Contributor

pbalcer Aug 6, 2025

don't clone master, use a fixed commit.

devops/scripts/benchmarks/utils/flamegraph.py

+                          "master",
+                      )
+                      # FlameGraph doesn't need building, just verify scripts exist and are executable

Contributor

pbalcer Aug 6, 2025

We don't check this anywhere else. I'm not sure if this would ever find an issue?

devops/scripts/benchmarks/utils/flamegraph.py

+                          )
+                  def _prune_flamegraph_dirs(self, res_dir: str, FILECNT: int = 10):
+                      """Keep only the last FILECNT files in the flamegraphs directory."""

Contributor

pbalcer Aug 6, 2025

this seems similar to what you have for unitrace. can we share code?

devops/scripts/benchmarks/utils/flamegraph.py

+                              "record",
+                              "-g",  # Enable call-graph recording
+                              "-F",
+                              "99",  # Sample frequency

Contributor

pbalcer Aug 6, 2025

this seems low. we should experiment with different values and pick what gives us the best flamecharts.

devops/scripts/benchmarks/utils/flamegraph.py

Comment on lines +142 to +227

+                  def handle_output(self, bench_name: str, perf_data_file: str):
+                      """
+                      Generate SVG flamegraph from perf data file.
+                      Returns the path to the generated SVG file.
+                      """
+                      if not os.path.exists(perf_data_file) or os.path.getsize(perf_data_file) == 0:
+                          raise FileNotFoundError(
+                              f"Perf data file not found or empty: {perf_data_file}"
+                          )
+                      # Generate output SVG filename following same pattern as perf data
+                      svg_file = perf_data_file.replace(".perf.data", ".svg")
+                      folded_file = perf_data_file.replace(".perf.data", ".folded")
+                      try:
+                          # Step 1: Convert perf script to folded format
+                          log.debug(f"Converting perf data to folded format: {folded_file}")
+                          with open(folded_file, "w") as f_folded:
+                              # Run perf script to get the stack traces
+                              perf_script_proc = subprocess.Popen(
+                                  ["perf", "script", "-i", perf_data_file],
+                                  stdout=subprocess.PIPE,
+                                  stderr=subprocess.DEVNULL,
+                                  text=True,
+                              )
+                              # Pipe through stackcollapse-perf.pl
+                              stackcollapse_perf_path = os.path.join(
+                                  self.repo_dir, "stackcollapse-perf.pl"
+                              )
+                              stackcollapse_proc = subprocess.Popen(
+                                  [stackcollapse_perf_path],
+                                  stdin=perf_script_proc.stdout,
+                                  stdout=f_folded,
+                                  stderr=subprocess.DEVNULL,
+                                  text=True,
+                              )
+                              perf_script_proc.stdout.close()
+                              stackcollapse_proc.wait()
+                              perf_script_proc.wait()
+                          # Step 2: Generate flamegraph SVG
+                          log.debug(f"Generating flamegraph SVG: {svg_file}")
+                          flamegraph_pl_path = os.path.join(self.repo_dir, "flamegraph.pl")
+                          with open(folded_file, "r") as f_folded, open(svg_file, "w") as f_svg:
+                              flamegraph_proc = subprocess.Popen(
+                                  [
+                                      flamegraph_pl_path,
+                                      "--title",
+                                      f"{options.save_name} - {bench_name}",
+                                      "--width",
+                                      str(
+                                          self.FLAMEGRAPH_WIDTH
+                                      ),  # Fit within container without scrollbars
+                                  ],
+                                  stdin=f_folded,
+                                  stdout=f_svg,
+                                  stderr=subprocess.DEVNULL,
+                                  text=True,
+                              )
+                              flamegraph_proc.wait()
+                          # Clean up intermediate files
+                          if os.path.exists(folded_file):
+                              os.remove(folded_file)
+                          if not os.path.exists(svg_file) or os.path.getsize(svg_file) == 0:
+                              raise RuntimeError(f"Failed to generate flamegraph SVG: {svg_file}")
+                          log.debug(f"Generated flamegraph: {svg_file}")
+                          # Create symlink immediately after SVG generation
+                          self._create_immediate_symlink(svg_file)
+                          # Prune old flamegraph directories
+                          self._prune_flamegraph_dirs(os.path.dirname(perf_data_file))
+                          return svg_file
+                      except Exception as e:
+                          # Clean up on failure
+                          for temp_file in [folded_file, svg_file]:
+                              if os.path.exists(temp_file):
+                                  os.remove(temp_file)
+                          raise RuntimeError(f"Failed to generate flamegraph for {bench_name}: {e}")

Contributor

pbalcer Aug 6, 2025 •

edited

Loading

use run helpers... I suggest expanding run to support redirecting stdout to a file (and then not printing it to the console).

devops/scripts/benchmarks/utils/flamegraph.py

+                      except Exception as e:
+                          log.debug(f"Failed to create immediate symlink for {svg_file}: {e}")
+                  def _update_flamegraph_manifest(

Contributor

pbalcer Aug 6, 2025

I genuinely don't understand the idea here. data.js is purely an output file. we should not parse it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet