[datacamp笔记]ggplot2

作者: 琼脂糖 | 来源:发表于2017-12-21 16:39 被阅读91次

[datacamp笔记]ggplot2
时间序列笔记-残差分析
时间序列笔记-双指数模型
时间序列笔记-三指数模型
时间序列笔记-Box-Cox转换
时间序列笔记-auto.arima()自动定阶
时间序列笔记-ETS模型
时间序列笔记-动态回归
时间序列笔记-评估模型准确性
时间序列笔记-朴素预测法

factor
Notice that ggplot2 treats cyl as a factor. This time the x-axis does not contain variables like 5 or 7, only the values that are present in the dataset.
当变量是factor的时候，轴上只出现dataset中的数值。
base

# A scatter plot has been made for you
ggplot(mtcars, aes(x = wt, y = mpg)) +
  geom_point()

# Replace ___ with the correct column
ggplot(mtcars, aes(x = wt, y = mpg, color = disp)+
  geom_point()

# Replace ___ with the correct column
ggplot(mtcars, aes(x = wt, y = mpg, size = disp)) +
  geom_point()

two plot
geom-point + geom-smooth
一层一层往上叠加

color = clarity
将clarity作为factor分组，分别画图。

function

# Inspect the arguments of the mean() function args(mean)

关于函数mean
用法一：mean(x, ...)
用法二：The 'Default S3 method
mean(x, trim = 0, na.rm = FALSE, ...)
省略号 is a way for R to pass arguments along without the function having to name them explicitly.

'''# The linkedin and facebook vectors have already been created for you
linkedin <- c(16, 9, 13, 5, 2, 17, 14)
facebook <- c(17, 7, 5, 16, 8, 13, 14)

Calculate the mean of the sum

avg_sum <- mean(linkedin+facebook)

Calculate the trimmed mean of the sum

avg_sum_trimmed <- mean(linkedin+facebook,trim=0.2)

Inspect both new variables

avg_sum_trimmed
avg_sum
'''

avg_sum_trimmed
[1] 22.6
avg_sum
[1] 22.28571

lapply

1.lapply(X, FUN, ...)
x为向量或list
返回结果为list，长度等于x长度

# The vector pioneers has already been created for you
pioneers <- c("GAUSS:1777", "BAYES:1702", "PASCAL:1623", "PEARSON:1857")

# Split names from birth year
split_math <- strsplit(pioneers, split = ":")
# Convert to lowercase strings: split_low
split_low <- lapply(split_math,tolower)
# Transform: use anonymous function inside lapply

names <- lapply(split_low, function(x){x[1]})

# Transform: use anonymous function inside lapply
years <- lapply(split_low, function(x){x[2]})

# Generic select function
select_el <- function(x, index) {
  x[index]
}
# Use lapply() twice on split_low: names and years
names <- lapply(split_low,select_el,index=1)
years <- lapply(split_low,select_el,index=2)

注意参数index的写法。第一个参数也就是x被直接传入。

lapply(split_low, function(x) {
  if (nchar(x[1]) > 5) {
    return(NULL)
  } else {
    return(x[2])
  }
})

根据条件来选择返回值

sapply

sapply(X,FUN,...)
X为向量或list
返回结果是向量

sapply相比lappy可以更清晰的显示结果，
但如果每列的结果长度不一致，就会选择和lapply一样的方式
identical(freezing_1,freezing_0)

or如果返回NULL，NULL也是list。和lapply相同。因为不是相同长度的vector。which is no longer a vector with the same length as the input

cat("The average temperature is", mean(x), "\n")
identical(x,y)

shell

less file1 file2
:n --remove to next file

网友评论

本文标题：[datacamp笔记]ggplot2

本文链接：https://www.haomeiwen.com/subject/ygzzixtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

[datacamp笔记]ggplot2

function

Calculate the mean of the sum

Calculate the trimmed mean of the sum

Inspect both new variables

lapply

sapply

shell

相关文章

[datacamp笔记]ggplot2

时间序列笔记-残差分析

时间序列笔记-双指数模型

时间序列笔记-三指数模型

时间序列笔记-Box-Cox转换

时间序列笔记-auto.arima()自动定阶

时间序列笔记-ETS模型

时间序列笔记-动态回归

时间序列笔记-评估模型准确性

时间序列笔记-朴素预测法

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读