Skip to content

Commit

Permalink
Feat/3414 debug pprof (influxdata#3840)
Browse files Browse the repository at this point in the history
* feat: add v2.1 runtime doc for /debug/pprof profiling, trace, and command-line (influxdata#3414).

* fix: runtime description. (influxdata#3414)

* Apply suggestions from code review

Co-authored-by: kelseiv <[email protected]>

* fix: apply PR suggestions (influxdata#3414).

Co-authored-by: kelseiv <[email protected]>
  • Loading branch information
jstirnaman and kelseiv authored Mar 9, 2022
1 parent 274ad2a commit b7c984e
Showing 1 changed file with 347 additions and 0 deletions.
347 changes: 347 additions & 0 deletions content/influxdb/v2.1/reference/internals/runtime.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,347 @@
---
title: InfluxDB runtime
description: >
Learn how to collect Go runtime profiling and tracing information to help with InfluxDB performance analysis and debugging.
menu:
influxdb_2_1_ref:
name: Runtime
parent: InfluxDB internals
weight: 103
influxdb/v2.1/tags: [go, internals, performance]
---

InfluxDB provides Go runtime profiles, trace, and other information
useful for analyzing and debugging the server runtime execution.

- [Overview of Go runtime profiles](#overview-of-go-runtime-profiles)
- [Analyze Go runtime profiles](#analyze-go-runtime-profiles)
- [Analyze the Go runtime trace](#analyze-the-go-runtime-trace)
- [View the command line that invoked InfluxDB](#view-the-command-line-that-invoked-influxdb)

## Overview of Go runtime profiles

A **Go runtime profile** is a collection of stack traces showing call sequences that led to instances of a particular event. InfluxDB provides profile data for the following events:

- blocks
- CPU usage
- memory allocation
- mutual exclusion (mutex)
- OS thread creation

When you send a profile request to InfluxDB, the [Golang runtime pprof package](https://pkg.go.dev/runtime/pprof) samples the events on the runtime to collect stack traces and statistics (e.g., number of bytes of memory for heap allocation events). For some profiles, you can set the number of seconds that InfluxDB will collect profile data.

Once data collection is complete, InfluxDB returns the profile data. The default response format is a compressed protocol buffer in
[profile.proto](https://github.com/google/pprof/blob/master/proto/profile.proto) format. **profile.proto** files are compatible with the [pprof](https://github.com/google/pprof) and [`go tool pprof`](https://go.dev/blog/pprof) analysis tools. For some profiles, InfluxDB provides an alternative human-readable plain text format with comments that translate to function calls and line numbers, but the `pprof` tools and **profile.proto** format offer the following advantages:

- Read profiles from disk or HTTP.
- Aggregate and compare multiple profiles of the same type.
- Analyze and filter profile data.
- Generate visualizations and reports.

## Analyze Go runtime profiles

Use the `/debug/pprof` InfluxDB endpoints to download all the profiles at once or request them individually.

- [Get all runtime profiles](#get-all-runtime-profiles)
- [Profile all memory allocations](#profile-all-memory-allocations)
- [Profile blocking operations](#profile-blocking-operations)
- [Profile CPU](#profile-cpu)
- [Profile goroutines](#profile-goroutines)
- [Profile heap memory allocations](#profile-heap-memory-allocations)
- [Profile mutual exclusions](#profile-mutual-exclusions-mutexes)
- [Profile thread creation](#profile-thread-creation)

### Get all runtime profiles

To download all runtime profiles at once, use an HTTP client to send a `GET` request to the `/debug/pprof/all` endpoint. `go tool pprof` can't fetch profiles directly from `/debug/pprof/all`.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/all" %}}

InfluxDB returns a gzipped tar file that contains the following profiles in the **profile.proto** format:

- `profiles/allocs.pb.gz`: [profile all memory allocations](#profile-all-memory-allocations)
- `profiles/block.pb.gz`: [profile blocking operations](#profile-blocking-operations)
- `profiles/cpu.pb.gz`: _(Optional)_ [profile CPU](#profile-cpu).
- `profiles/goroutine.pb.gz`: [profile goroutines](#profile-goroutines)
- `profiles/heap.pb.gz`: [profile heap memory allocations](#profile-heap-memory-allocations)
- `profiles/mutex.pb.gz`: [profile mutual exclusions](#profile-mutual-exclusions-mutexes)
- `profiles/threadcreate.pb.gz`: [profile thread creation](#profile-thread-creation)

| Option | Include by |
|:--------|:-----------|
| Profile CPU | Pass a [duration of seconds](/influxdb/v2.1/reference/glossary/#duration) with the `cpu` query parameter in your request URL |

Use an HTTP client like `curl` or `wget` to download profiles from `/debug/pprof/all`.

#### Example

```sh
# Use `curl` to download a `.tar.gz` of all profiles after 10 seconds of CPU sampling.
# Use `tar` to extract the profiles folder.

curl "http://localhost:8086/debug/pprof/all?cpu=10s" | tar -xz

# Analyze an extracted profile.

go tool pprof profiles/heap.pb.gz
```

### Profile all memory allocations

Profiles memory allocations and sets the default profile display to __alloc_space__,
the total number of bytes allocated since the program began (including garbage-collected bytes).

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/allocs" %}}

| Option | Include by |
|:--------|:-----------|
| Seconds to sample | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |
| Output plain text (mutually exclusive with `seconds`) | Pass `1` with the `debug` query parameter in your request URL |

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/allocs

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N memory allocations.

(pprof) top10
```

### Profile blocking operations

Profiles operations that led to blocking on synchronization primitives and caused Go to suspend goroutine's execution.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/block" %}}

| Option | Include by |
|:--------|:-----------|
| Output plain text | Pass `1` with the `debug` query parameter in your request URL |

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/block

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N entries.

(pprof) top10
```

### Profile CPU

Profiles program counters sampled from the execution stack. To download the profile, use an HTTP client to send a `GET` request to the `/debug/pprof/profile` endpoint. `go tool pprof` can't fetch the CPU profile directly.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/profile" %}}

| Option | Include by |
|:--------|:-----------|
| Seconds to sample (default `30`) | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |

Use an HTTP client like `curl` or `wget` to download the profile.

#### Example

```sh
# Get the profile.

curl http://localhost:8086/debug/pprof/profile -o cpu

# Analyze the profile in interactive mode.

go tool pprof ./cpu

# At the prompt, get the top N functions most often running
# or waiting during the sample period.

(pprof) top10
```

Use the `seconds` query parameter to control the sampling duration.

{{% note %}}

`/debug/pprof/profile?seconds=SECONDS` returns the same CPU profile as `/debug/pprof/all?cpu=DURATION`.

{{% /note %}}

#### Example

```sh
# Get the CPU profile after 10 seconds of sampling.

curl "http://localhost:8086/debug/pprof/profile?seconds=10" -o cpu

# Get all profiles after 10 seconds of CPU sampling.

curl "http://localhost:8086/debug/pprof/all?cpu=10s" -o all.tar.gz
```

### Profile goroutines

Profiles all current goroutines.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/goroutine" %}}

| Option | Include by |
|:--------|:-----------|
| Seconds to sample | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |
| Output plain text (mutually exclusive with `seconds`) | Pass `1` with the `debug` query parameter in your request URL |

#### Example

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/goroutine

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N entries.

(pprof) top10
```

### Profile heap memory allocations

Profiles heap, or memory allocations for live objects.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/heap" %}}

| Option | Include by |
|:--------|:-----------|
| Run garbage control before sampling | Pass `1` with the `gc` query parameter in your request URL |
| Seconds to sample | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |
| Output plain text (mutually exclusive with `seconds`) | Pass `1` with the `debug` query parameter in your request URL |

#### Example

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/heap

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N memory-intensive nodes.

(pprof) top10

# pprof displays the list:
# Showing nodes accounting for 142.46MB, 85.43% of 166.75MB total
# Dropped 895 nodes (cum <= 0.83MB)
# Showing top 10 nodes out of 143
```

### Profile mutual exclusions (mutexes)

Profiles holders of contended mutual exclusions (mutexes).

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/mutex" %}}

| Option | Include by |
|:--------|:-----------|
| Seconds to sample | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |
| Output plain text (mutually exclusive with `seconds`) | Pass `1` with the `debug` query parameter in your request URL |

#### Example

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/mutex

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N entries.

(pprof) top10
```

### Profile thread creation

Profiles operations that led to the creation of OS threads.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/threadcreate" %}}

| Option | Include by |
|:--------|:-----------|
| Seconds to sample | Pass an [unsigned integer](/influxdb/v2.1/reference/glossary/#unsigned-integer) with the `seconds` query parameter in your request URL |
| Output plain text (mutually exclusive with `seconds`) | Pass `1` with the `debug` query parameter in your request URL |

#### Example

```sh
# Analyze the profile in interactive mode.

go tool pprof http://localhost:8086/debug/pprof/threadcreate

# `pprof` returns the following prompt:
# Entering interactive mode (type "help" for commands, "o" for options)
# (pprof)

# At the prompt, get the top N entries.

(pprof) top10
```

## Analyze the Go runtime trace

To trace execution events for InfluxDB, use the `/debug/pprof/trace`
endpoint with `go tool trace`.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/trace" %}}

#### Example

```sh
# Download the trace file.

curl http://localhost:8086/debug/pprof/trace -o trace.out

# Analyze the trace.

go tool trace ./trace.out
```

#### Generate a pprof-like profile from trace

You can use `go tool trace` to generate _pprof-like_ profiles from
a trace file and then analyze them with `go tool pprof`.

#### Example

```sh
# Generate a profile from the downloaded trace file.

go tool trace -pprof=PROFILE_TYPE ./trace.out > PROFILE_TYPE.pprof
```

Replace *`PROFILE_TYPE`* with one of the following [Golang profile types](https://pkg.go.dev/cmd/trace):

- `net`: network blocking profile
- `sync`: synchronization blocking profile
- `syscall`: syscall blocking profile
- `sched`: scheduler latency profile

## View the command line that invoked InfluxDB

To view the command, arguments, and command-line variables that invoked InfluxDB, use the `/debug/pprof/cmdline` endpoint.

{{% api-endpoint method="GET" endpoint="http://localhost:8086/debug/pprof/cmdline" %}}

`/debug/pprof/cmdline` returns the command line invocation in plain text.

0 comments on commit b7c984e

Please sign in to comment.