Skip to content

Commit

Permalink
Fix name in pivotting column (hadley#1418)
Browse files Browse the repository at this point in the history
  • Loading branch information
hadley authored Apr 10, 2023
1 parent d2b27da commit 97b9a1b
Show file tree
Hide file tree
Showing 3 changed files with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions data-tidy.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -360,7 +360,7 @@ There are two columns that are already variables and are easy to interpret: `cou
They are followed by 56 columns like `sp_m_014`, `ep_m_4554`, and `rel_m_3544`.
If you stare at these columns for long enough, you'll notice there's a pattern.
Each column name is made up of three pieces separated by `_`.
The first piece, `sp`/`rel`/`ep`, describes the method used for the diagnosis, the second piece, `m`/`f` is the `gender` (coded as a binary variable in this dataset), and the third piece, `014`/`1524`/`2534`/`3544`/`4554`/`5564/``65` is the `age` range (`014` represents 0-14, for example).
The first piece, `sp`/`rel`/`ep`, describes the method used for the diagnosis, the second piece, `m`/`f` is the `gender` (coded as a binary variable in this dataset), and the third piece, `014`/`1524`/`2534`/`3544`/`4554`/`5564`/`65` is the `age` range (`014` represents 0-14, for example).

So in this case we have six pieces of information recorded in `who2`: the country and the year (already columns); the method of diagnosis, the gender category, and the age range category (contained in the other column names); and the count of patients in that category (cell values).
To organize these six pieces of information in six separate columns, we use `pivot_longer()` with a vector of column names for `names_to` and instructors for splitting the original variable names into pieces for `names_sep` as well as a column name for `values_to`:
Expand Down Expand Up @@ -432,7 +432,7 @@ When you use `".value"` in `names_to`, the column names in the input contribute
#| label: fig-pivot-names-and-values
#| echo: false
#| fig-cap: >
#| Pivoting with `names_to = c(".value", "id")` splits the column names
#| Pivoting with `names_to = c(".value", "num")` splits the column names
#| into two components: the first part determines the output column
#| name (`x` or `y`), and the second part determines the value of the
#| `id` column.
Expand Down
Binary file modified diagrams/tidy-data.graffle
Binary file not shown.
Binary file modified diagrams/tidy-data/names-and-values.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 97b9a1b

Please sign in to comment.