Update the preface

This commit is contained in:
Hadley Wickham 2022-11-21 08:58:22 -06:00
parent 44d978decc
commit 825a2b2e1b
1 changed files with 15 additions and 17 deletions

View File

@ -1,28 +1,26 @@
# Preface to the second edition {.unnumbered}
Welcome to the second edition of "R for Data Science".
Welcome to the second edition of "R for Data Science"!
This is a major reworking of the first edition, removing material we no longer think is useful, adding material we wish we included in the first edition, and generally updating the text and code to reflect changes in best practices.
We're also very excited to welcome a new co-author: Mine Çetinkaya-Rundel, a noted data science educator and one of our colleagues at Posit (the company formerly known as RStudio).
## Major changes {.unnumbered}
A brief summary of the biggest changes follows:
- The first part is renamed to "whole game" to reflect the entire data science cycle.
It gains a new chapter that briefly introduces the basics of reading data from csv files.
- The first part of the book has been renamed to "Whole game".
The goal of this section is to give you the rough details of the "whole game" of data science before we dive into the details.
- We've added a new part called visualize.
- The wrangle part is now transform and gains new chapters on numbers, logical vectors, and missing values.
- The second part of the book is now called "Transform" and gains chapters new chapters on numbers, logical vectors, and missing values.
These were previously parts of the data transformation chapter, but needed much more room.
- We've added new chapters on column-wise and row-wise operations.
- The third part of the book is called "Wrangle".
It's a new set of chapters that goes beyond reading flat text files to now embrace working with spreadsheets, getting data out of databases, rectangling tree-like data, and scraping data from web sites.
- We've added a new set of chapters on import that goes beyond importing rectangular data to include chapters on working with spreadsheets, databases, and scraping data from the web.
- The "Program" part continues, but has been rewritten from top-to-bottom to focus on the most important parts of function writing and iteration.
Function writing now includes sections on how to wrap tidyverse functions (dealing with the challenges of tidy evaluation), since this has become much easier over the last few years.
We've added a new chapter on important Base R functions that you're likely to see when reading R code found in the wild.
- The modeling part has been removed.
For modeling, we recommend using packages from [tidymodels](https://www.tidymodels.org/) and reading [Tidy Modeling with R](https://www.tmwr.org/) by Max Kuhn and Julia Silge to learn more about them.
We never had enough room to fully do modelling justice, and there are now much better resources available.
We geneally recommend using the [tidymodels](https://www.tidymodels.org/) packages and reading [Tidy Modeling with R](https://www.tmwr.org/) by Max Kuhn and Julia Silge.
- We've switched from the magrittr pipe to the base pipe.
- The communicate part now features writing computational documents with Quarto.
## Acknowledgements {.unnumbered}
*TO DO: Add acknowledgements.*
Other changes include switching from magrittr's pipe (`%>%`) to the base pipe (`|>`) and switching from RMarkdown to Quarto.