From c0ae8ee290a7c282c5b93be0d4c689ca42cea248 Mon Sep 17 00:00:00 2001 From: OaCantona Date: Mon, 25 Jul 2016 22:00:06 +0200 Subject: [PATCH] Confused price with cut (#190) --- EDA.Rmd | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/EDA.Rmd b/EDA.Rmd index fbc5293..3420ea0 100644 --- a/EDA.Rmd +++ b/EDA.Rmd @@ -311,7 +311,7 @@ ggplot(data = diamonds, mapping = aes(x = price, y = ..density..)) + geom_freqpoly(aes(colour = cut), binwidth = 500) ``` -There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average cut! But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot. +There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average price! But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot. Another alternative to display the distribution of a continuous variable broken down by a categorical variable is the boxplot. A **boxplot** is a type of visual shorthand for a distribution of values that is popular among statisticians. Each boxplot consists of: