๋ฐ์ดํ„ฐ ๊ณผํ•™์„ ์œ„ํ•œ R ์—ฐ์Šต๋ฌธ์ œ
3.Data visualisation (3.1~3.5)


[3.1.1 Prerequisites]

install.packages('tidyverse')
library("tidyverse")
str(mpg)
?mpg

 

 


[3.2.2 Creating a ggplot]

ggplot(data=mpg)
str(mpg)
ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy))
#geom_point() ์‚ฐ์ ๋„ ์ถœ๋ ฅ
#mapping ์‹œ๊ฐ์  ์†์„ฑ ์„ค์ • aes(x= y=)

 

 


[3.2.4 Exercises]
Q. ์‹คํ–‰ ggplot(data = mpg). ๋ฌด์—‡์„ ๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๊นŒ?

ggplot(data = mpg)

A. ์•„๋ฌด๊ฒƒ๋„ ์—†๋Š” ๋นˆ ๋„ค๋ชจ ์ƒ์„ฑ


Q. mpg์— ๋ช‡ ๊ฐœ์˜ ํ–‰์ด ์žˆ์Šต๋‹ˆ๊นŒ? ์—ด์€ ๋ช‡ ๊ฐœ์ž…๋‹ˆ๊นŒ?

mpg #234X11
nrow(mpg) #234
ncol(mpg) #11

A. 123๊ฐœ, 11๊ฐœ


Q. drv๋ณ€์ˆ˜์— ๋Œ€ํ•ด ์„ค๋ช…ํ•˜์„ธ์š”. ์•Œ์•„๋ณด๋ ค๋ฉด ๋„์›€๋ง์„ ์ฝ์–ด๋ณด์„ธ์š”.

?mpg

A. the type of drive train, where f = front-wheel drive, r = rear wheel drive, 4 = 4wd


Q. hwy์˜ ์‚ฐ์ ๋„๋ฅผ ๋งŒ๋“œ์‹ญ์‹œ์˜ค. (cy1)

ggplot (data = mpg) +
  geom_point(mapping = aes(x=cyl, y=hwy))

ggplot(mpg, aes(x = cyl, y = hwy)) +
  geom_point()

 


Q1. class์˜ ์‚ฐ์ ๋„๋ฅผ ๊ทธ๋ฆฌ๋ฉด drv์–ด๋–ป๊ฒŒ ๋ ๊นŒ์š”?
Q2. ํ”Œ๋กฏ์ด ์œ ์šฉํ•˜์ง€ ์•Š์€ ์ด์œ ๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

ggplot(data=mpg)+
  geom_point(mapping = aes(x=class, y=drv))

A. drv์™€ class๋Š” ๋ชจ๋‘ ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜๋กœ ์‚ฐ์ ๋„์— ์ ํ•ฉํ•˜์ง€ ์•Š์€ ๋ฐ์ดํ„ฐ์ด๋‹ค.
x,y๋กœ ์กฐํ•ฉํ•ด ํ‘œ์‹œํ•˜๋ฏ€๋กœ (drv๋Š” 3๊ฐœ์˜ ๊ฐ’, class๋Š” 7๊ฐœ์˜ ๊ฐ’ ์‚ฌ์šฉ) 21๊ฐœ๋งŒ ๊ด€์ฐฐ ๊ฐ€๋Šฅํ•˜๋‹ค.

count(mpg, drv, class)

 



[3.3 Aesthetic mappings]

ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy, color = class))


Left

ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy, alpha = class))

* alpha : ์ ์˜ ํˆฌ๋ช…๋„ ์กฐ์ ˆ

 


Right

ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy, shape = class))

ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy), color = "blue")

* color = "blue" : ๋ชจ๋“  ์ ์ด ํŒŒ๋ž€์ƒ‰



[3.3.1 Exercises]
Q. ์ด ์ฝ”๋“œ์— ๋ฌด์—‡์ด ๋ฌธ์ œ์˜€์Šต๋‹ˆ๊นŒ? ํฌ์ธํŠธ๊ฐ€ ํŒŒ๋ž€์ƒ‰์ด ์•„๋‹Œ ์ด์œ ๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy, color = "blue"))
ggplot(data = mpg) + 
  geom_point(mapping = aes(x = displ, y = hwy), color = "blue")

A. ๊ด„ํ˜ธ์„ค์ •๋•Œ๋ฌธ. aes๋Š” ๋ณ€์ˆ˜์— ๋งคํ•‘ ๊ฐ€๋Šฅํ•œ ์‹œ๊ฐ์  ์†์„ฑ์„ ๋งคํ•‘์ธ์ˆ˜์— ์ „๋‹ฌ.


Q. mpg์˜ ์–ด๋–ค ๋ณ€์ˆ˜๊ฐ€ ๋ฒ”์ฃผํ˜•์ž…๋‹ˆ๊นŒ? ์–ด๋–ค ๋ณ€์ˆ˜๊ฐ€ ์—ฐ์†์ ์ž…๋‹ˆ๊นŒ?
(ํžŒํŠธ: mpg๋ฐ์ดํ„ฐ ์„ธํŠธ์— ๋Œ€ํ•œ ๋ฌธ์„œ๋ฅผ ์ฝ์œผ๋ ค๋ฉด ์ž…๋ ฅ ํ•˜์‹ญ์‹œ์˜ค). ์‹คํ–‰ํ•  ๋•Œ ์ด ์ •๋ณด๋ฅผ ์–ด๋–ป๊ฒŒ ๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๊นŒ?

str(mpg)
?mpg

A. ๋ฒ”์ฃผํ˜•:manufacturer, model, trans, drv, fl, class
A. ์—ฐ์†ํ˜•:displ, year, cyl, cty, hwy


Q. ์—ฐ์† ๋ณ€์ˆ˜๋ฅผ color, size, shape์— ๋งคํ•‘ํ•˜์„ธ์š”.
Q. ์ด๋Ÿฌํ•œ ๋ฏธํ•™์€ ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜์™€ ์—ฐ์†ํ˜• ๋ณ€์ˆ˜์— ๋Œ€ํ•ด ์–ด๋–ป๊ฒŒ ๋‹ค๋ฅด๊ฒŒ ์ž‘๋™ํ•ฉ๋‹ˆ๊นŒ?

ggplot(data=mpg)+
  geom_point(mapping = aes(x=displ, y=hwy, color=cty)) #cty:๊ฐค๋Ÿฐ ๋‹น ๋„์‹œ ๋งˆ์ผ

A. color์— mappingํ•  ๋•Œ, ์ ์˜ ์ƒ‰์ด ์ ์  ์—ฐํ•ด์ง„๋‹ค.

 

ggplot(data=mpg)+
  geom_point(mapping = aes(x=displ, y=hwy, size=cty))

A. size์— mappingํ•  ๋•Œ ์ ์˜ ํฌ๊ธฐ๊ฐ€ ์ง€์†์ ์œผ๋กœ ์ปค์ง„๋‹ค.

 

ggplot(data=mpg)+
  geom_point(mapping = aes(x=displ, y=hwy, shape=cty))

A. error. ์—ฐ์† ๋ณ€์ˆ˜๋Š” shape์— mappingํ•  ์ˆ˜ ์—†์Œ


Q. ๋™์ผํ•œ ๋ณ€์ˆ˜๋ฅผ ์—ฌ๋Ÿฌ aes์— ๋งคํ•‘ํ•˜๋ฉด ์–ด๋–ป๊ฒŒ ๋ ๊นŒ์š”?

ggplot(data=mpg)+
  geom_point(mapping = aes(x=displ, y=hwy, color=hwy, size=displ))

A. hwy๋Š” y์ถ•, ์ƒ‰์ƒ ๋†๋„๋กœ mapping. displ์€ x์ถ•, ์ ์˜ ํฌ๊ธฐ๋กœ mapping.
๋‹จ์ผ ๋ณ€์ˆ˜๊ฐ€ ์—ฌ๋Ÿฌ aes์— ์ค‘๋ณตmapping๋  ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ, ํ”ผํ•œ๋‹ค.


Q. stroke aesthetic๊ฐ€ ๋ฌด์—‡์ธ๊ฐ€์š”? ์–ด๋–ค ๋ชจ์–‘์œผ๋กœ ์ž‘๋™ํ•ฉ๋‹ˆ๊นŒ? (ํžŒํŠธ: geom_point ์‚ฌ์šฉ)

ggplot(data=mpg)+
  geom_point(mapping = aes(x=displ, y=hwy), shape=21, color="yellow", fill="blue", size=3, stroke=3)

 


Q. aes์„ ๋ณ€์ˆ˜ ์ด๋ฆ„์ด ์•„๋‹Œ ๋‹ค๋ฅธ ๊ฒƒ์œผ๋กœ ๋งคํ•‘ํ•˜๋ฉด ex) aes(colour = displ < 5)์–ด๋–ป๊ฒŒ ๋ ๊นŒ์š”? ์ฐธ๊ณ ๋กœ x์™€ y๋„ ์ง€์ •ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

ggplot(data=mpg)+
  geom_point(mapping=aes(x=displ, y=hwy, color=displ<5))

A. aes๋Š” ์—ฐ์‚ฐ์ž์—๋„ mapping๊ฐ€๋Šฅํ•˜๋‹ค.
์ž„์‹œ ๋ณ€์ˆ˜๊ฐ€ ์ถ”๊ฐ€๋œ ๊ฒƒ์ฒ˜๋Ÿผ ์ž‘๋™ํ•˜๊ณ , ์ด ๊ฒฝ์šฐ displ<5์˜ ๊ฒฐ๊ณผ๋Š” T/F๊ฐ’์„ ์ทจํ•˜๋Š” ๋…ผ๋ฆฌ ๋ณ€์ˆ˜๊ฐ€ ๋œ๋‹ค.

 

 

 

 

์ถœ์ฒ˜ : https://r4ds.had.co.nz/data-visualisation.html

+ Recent posts