a missing word inserted (#1298)
* a missing word corrected * CLT versus LLN * summary statistics instead of summary * Update data-transform.qmd --------- Co-authored-by: Mine Cetinkaya-Rundel <cetinkaya.mine@gmail.com>
This commit is contained in:
parent
df90abbde5
commit
e827725d8e
|
@ -699,9 +699,9 @@ ggplot(delays, aes(x = n, y = delay)) +
|
|||
```
|
||||
|
||||
Not surprisingly, there is much greater variation in the average delay when there are few flights for a given plane.
|
||||
The shape of this plot is very characteristic: whenever you plot a mean (or other summary) vs. group size, you'll see that the variation decreases as the sample size increases[^data-transform-4].
|
||||
The shape of this plot is very characteristic: whenever you plot a mean (or other summary statistics) vs. group size, you'll see that the variation decreases as the sample size increases[^data-transform-4].
|
||||
|
||||
[^data-transform-4]: \*cough\* the central limit theorem \*cough\*.
|
||||
[^data-transform-4]: \*cough\* the law of large numbers \*cough\*.
|
||||
|
||||
When looking at this sort of plot, it's often useful to filter out the groups with the smallest numbers of observations, so you can see more of the pattern and less of the extreme variation in the smallest groups:
|
||||
|
||||
|
|
Loading…
Reference in New Issue