Dumping some content into code style chapter

This commit is contained in:
Hadley Wickham 2022-02-15 15:49:31 -06:00
parent 5b7f2de32d
commit a3c9bf0f1b
3 changed files with 55 additions and 20 deletions

View File

@ -83,16 +83,3 @@ Must be named.
Doesn't solve problem above, but helps out in lots of other places.
Expect it to continue to evolve.
## When not to use the pipe
The pipe is such fun to use, it's easy to go overboard and use pipes when better alternatives exists.
Pipes are most useful for rewriting a fairly short linear sequence of operations.
I think you should reach for another tool when:
- Your pipes are longer than (say) ten steps.
In that case, create intermediate objects with meaningful names.
That will make debugging easier, because you can more easily check the intermediate results, and it makes it easier to understand your code, because the variable names can help communicate intent.
- You have multiple inputs or outputs.
If there isn't one primary object being transformed, but two or more objects being combined together, don't use the pipe.

View File

@ -15,6 +15,10 @@ Keep experimenting in the console, but once you have written code that works and
RStudio will automatically save the contents of the editor when you quit RStudio, and will automatically load it when you re-open.
Nevertheless, it's a good idea to save your scripts regularly and to back them up.
TODO: Add file naming advice
TODO: Add advice about creating sections
## Running code
The script editor is also a great place to build up complex ggplot2 plots or long sequences of dplyr manipulations.

View File

@ -1,28 +1,72 @@
# Workflow: code style {#workflow-style}
- Indenting and line breaks
```{r, results = "asis", echo = FALSE}
status("drafting")
```
Good coding style is like correct punctuation: you can manage without it, butitsuremakesthingseasiertoread.
<https://style.tidyverse.org>, <https://style.tidyverse.org/pipes.html>
- Variables should use only lowercase letters, numbers, and `_`.
Use underscores (`_`) (so called snake case) to separate words within a name.
Better to use overly long names than overly short.
Autocomplete means that
- Do not put spaces inside or outside parentheses for regular function calls.
Always put a space after a comma, never before, just like in regular English.
- Most operators ([`==`](https://rdrr.io/r/base/Comparison.html), [`+`](https://rdrr.io/r/base/Arithmetic.html), [`-`](https://rdrr.io/r/base/Arithmetic.html), [`<-`](https://rdrr.io/r/base/assignOps.html), etc.) should be surrounded by spaces; the chief exception is `^`.
- `%>%` should always have a space before it, and should usually be followed by a new line.
After the first step, each line should be indented by two spaces.
This structure makes it easier to add new steps (or rearrange existing steps) and harder to overlook a step.
- In a pipeline, put each function on its on line.
And if the function as named arguments (`=`) then put each of those on a single line.
```{r, eval = FALSE}
df %>% mutate(y = x + 1)
df |> mutate(y = x + 1)
# vs
df %>%
df |>
mutate(
y = x + 1
)
```
- `mutate(df, y = x + 1)` vs `df %>% mutate(df, y = x + 1)`
Line up the opening and closing parens.
- It's ok to add extra spaces if it improves alignment of [`=`](https://rdrr.io/r/base/assignOps.html).
- For very short snippets that fit on one line, it's ok to write (e.g.) `mutate(df, y = x + 1)` vs `df %>% mutate(df, y = x + 1)`.
But in my experience, short snippets often grow longer, so you'll save time in the long run but start out how you wish to continue.
- Use empty lines to organize your code into "paragraphs" of related thoughts.
- with ggplot2
```{r, eval = FALSE}
df %>%
df |>
ggplot(aes())
```
Don't forget to switch to plus!
- How long should your pipes be?
Too long vs too short
Too long vs too short.
Your pipes are longer than (say) ten steps.
In that case, create intermediate objects with meaningful names.
That will make debugging easier, because you can more easily check the intermediate results, and it makes it easier to understand your code, because the variable names can help communicate intent.
- Restyling with style.
- Whenever you can give something an informative name, you should give it an informative name.
Don't expect to get it right the first time!
It's highly recommended to regularly spend some time just working on the clarity of your code.
The results might be exactly the same but it's not wasted effort: when you come back to the code in the future, you'll find it easier to remember what you did and easy to adapt to new demands.
- Strive to limit your code to 80 characters per line.
This fits comfortably on a printed page with a reasonably sized font.
If you find yourself running out of room, this is a good indication that you should encapsulate some of the work in a separate function.
- Restyling with styler.
- In data analysis code, use comments to record important findings and analysis decisions.