Confused price with cut (#190)
This commit is contained in:
		
				
					committed by
					
						
						Hadley Wickham
					
				
			
			
				
	
			
			
			
						parent
						
							149c27ef61
						
					
				
				
					commit
					c0ae8ee290
				
			
							
								
								
									
										2
									
								
								EDA.Rmd
									
									
									
									
									
								
							
							
						
						
									
										2
									
								
								EDA.Rmd
									
									
									
									
									
								
							@@ -311,7 +311,7 @@ ggplot(data = diamonds, mapping = aes(x = price, y = ..density..)) +
 | 
			
		||||
  geom_freqpoly(aes(colour = cut), binwidth = 500)
 | 
			
		||||
```
 | 
			
		||||
 | 
			
		||||
There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average cut!  But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot.
 | 
			
		||||
There's something rather surprising about this plot - it appears that fair diamonds (the lowest quality) have the highest average price!  But maybe that's because frequency polygons are a little hard to interpret - there's a lot going on in this plot.
 | 
			
		||||
 | 
			
		||||
Another alternative to display the distribution of a continuous variable broken down by a categorical variable is the boxplot. A **boxplot** is a type of visual shorthand for a distribution of values that is popular among statisticians. Each boxplot consists of:
 | 
			
		||||
 | 
			
		||||
 
 | 
			
		||||
		Reference in New Issue
	
	Block a user