Confused price with cut (#190)
This commit is contained in:
parent
149c27ef61
commit
c0ae8ee290
2
EDA.Rmd
2
EDA.Rmd
|
@ -311,7 +311,7 @@ ggplot(data = diamonds, mapping = aes(x = price, y = ..density..)) +
|
|||
geom_freqpoly(aes(colour = cut), binwidth = 500)
|
||||
```
|
||||
|
||||
There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average cut! But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot.
|
||||
There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average price! But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot.
|
||||
|
||||
Another alternative to display the distribution of a continuous variable broken down by a categorical variable is the boxplot. A **boxplot** is a type of visual shorthand for a distribution of values that is popular among statisticians. Each boxplot consists of:
|
||||
|
||||
|
|
Loading…
Reference in New Issue