r4ds/explore.Rmd

20 lines
1.5 KiB
Plaintext
Raw Normal View History

2016-04-27 15:04:29 +08:00
# (PART) Explore {-}
2016-04-21 21:01:34 +08:00
2016-08-18 20:55:19 +08:00
# Introduction {#explore-intro}
2016-02-12 06:31:34 +08:00
2016-07-22 04:27:19 +08:00
The goal of the first part of this book is to get you up to speed with the basic tools of data exploration as quickly as possible:
2016-07-19 22:39:00 +08:00
```{r echo = FALSE, out.width = "75%"}
knitr::include_graphics("diagrams/data-science-explore.png")
```
You will get frustrated when you start programming in R, because it is such a stickler for mistakes. Even one character out of place will cause it to complain. However, that frustration is both typical and temporary. It happens to everyone, and the only way to get over it is to keep trying.
2016-07-19 22:39:00 +08:00
2016-07-22 04:27:19 +08:00
Visualisation is a great place to start with R programming, because the payoff is so clear: you get to make elegant and informative plots that help you understand data. In [data visualisation] you'll dive into visualisation, learning the basic structure of a ggplot2 plot, and powerful techniques for turning data into plots.
2016-05-19 00:57:20 +08:00
Visualisation alone is typically not enough, so in [data transformation] you'll learn the key verbs that allow you to select important variables, filter out key observations, create new variables, and compute summaries.
2016-07-19 22:39:00 +08:00
2016-07-22 04:27:19 +08:00
Finally, in [exploratory data analysis], you'll combine visualisation and transformation with your curiosity and scepticism to ask and answer interesting questions about data.
2016-05-19 00:57:20 +08:00
Modelling is an important part of the exploratory process, but you don't have the skills to effectively learn or apply it yet. We'll come back to it in [modelling](#model-intro), once you're better equipped with more data wrangling and programming tools.