data:image/s3,"s3://crabby-images/91569/915690b2b7a1b34df8565f351573bc092b0e7a59" alt=""
一、前言
条形图与柱形图类似,在柱形图的基础上进行了横纵坐标变换。条形图也是最简单的类别比较型图(之一),学术论文中经常使用的条形图主要有2个作用:
- 展示不同类别变量的数值大小;
- 比较不同类别变量之间的大小关系。
1.1 单数据系列条形图
data:image/s3,"s3://crabby-images/5208d/5208dc41b11334c6a045da86082820d4f6d4ef2b" alt=""
data:image/s3,"s3://crabby-images/99373/9937371897e310eace9e933dbb8773120ffe3d6d" alt=""
1.2 多数据系列条形图
data:image/s3,"s3://crabby-images/0ba0c/0ba0c47d9c3b5a62ed3435ed4107ff93260e1cbc" alt=""
data:image/s3,"s3://crabby-images/8e141/8e141d7e7838cfaaf142c2e5fc062f58675671e2" alt=""
1.3 堆积条形图
data:image/s3,"s3://crabby-images/2c59c/2c59cdd6520af09353384b633ff5dabcd4b01bc8" alt=""
data:image/s3,"s3://crabby-images/3b540/3b540283bebba987f91853cf97825162736f8d12" alt=""
1.4 百分比堆积条形图
data:image/s3,"s3://crabby-images/ebb67/ebb67544230c5f359008672a694bc6b8ad99ad7b" alt=""
data:image/s3,"s3://crabby-images/fae48/fae4841bb05bed217976d40467c61e631818f3cf" alt=""
二、R包
本期使用的R包主要有3个。
tidyverse包是一个集成包,包括
- ggplot2包:用于数据可视化;
- dplyr包:用于数据操作;
- tidyr包:用于数据整理;
- readr包:用于数据导入;
- purrr包:用于函数式编程;
- tibble:用于一种新型数据框;
- stringr包:用于字符串;
- forcats包:用于因子。
# load "gWQS" package
library(gWQS)
# load "dlookr" package
library(dlookr)
# load "tidyverse" package
library(tidyverse)
三、演示数据
演示数据集简介:gWQS包中有一个内置数据集,内置数据集的名称叫wqs_data,wqs_data数据集有34种多环芳烃暴露数据、25种邻苯二甲酸酯暴露数据和其他类型数据。
本期仅使用wqs_data数据集的前5种多环芳烃暴露数据和性别。
# PCBs name
PCBs_name <- c("LBX074LA","LBX099LA","LBX105LA","LBX118LA","LBX138LA")
# get the first 5 PCBs exposure data and sex
PCBs <- wqs_data[c(PCBs_name,"sex")]
# get PCBs' absolute value
PCBs[PCBs_name] <- abs(PCBs[PCBs_name])
# view PCBs data
head(PCBs)
data:image/s3,"s3://crabby-images/d8558/d8558eebc9d970edb4f37259f23ce9e38a17ac81" alt=""
四、R语言实现
4.1 单数据系列条形图
PCBs %>%
# calculate describe statistics
describe(statistics=c("mean","sd")) %>%
# draw bar plot
ggplot() +
# geometry layer
geom_bar(aes(x=reorder(described_variables,mean),y=mean),
stat="identity",
# visual channel mapping
fill="blue")+
# coordinate adjustment
scale_x_discrete(name="PCBs") +
scale_y_continuous(name="Concentration of PCBs in urine (mg/kg)") +
coord_flip() +
# theme adjustment
theme_light()
data:image/s3,"s3://crabby-images/3054b/3054b21290dec1cae388a59022c36925879ac0bd" alt=""
4.2 多数据系列条形图
PCBs %>%
# group by sex
group_by(sex) %>%
# calculate describe statistics
describe(statistics=c("mean","sd")) %>%
# draw bar plot
ggplot() +
# geometry layer
geom_bar(aes(x=reorder(described_variables,mean),y=mean,fill=sex),
stat="identity",
# visual channel mapping
position="dodge",
width=0.6) +
# coordinate adjustment
scale_x_discrete(name="PCBs") +
scale_y_continuous(name="Concentration of PCBs in urine (mg/kg)") +
coord_flip() +
# legend adjustment
scale_fill_manual(name="Sex",values=c("red","blue"),labels=c("Male","Female")) +
# theme adustment
theme_light() +
theme(legend.position="top")
data:image/s3,"s3://crabby-images/003b2/003b20d47e4c2febe4ec42992b7729de69455fc6" alt=""
4.3 堆积条形图
PCBs %>%
# group by sex
group_by(sex) %>%
# calculate describe statistics
describe(statistics=c("mean","sd")) %>%
# draw bar plot
ggplot() +
# geometry layer
geom_bar(aes(x=reorder(described_variables,mean),y=mean,fill=sex),
stat="identity",
# visual channel mapping
position="stack",
width=0.6) +
# coordinate adjustment
scale_x_discrete(name="PCBs") +
scale_y_continuous(name="Concentration of PCBs in urine (mg/kg)") +
coord_flip() +
# legend adjustment
scale_fill_manual(name="Sex",values=c("red","blue"),labels=c("Male","Female")) +
# theme adustment
theme_light() +
theme(legend.position="top")
data:image/s3,"s3://crabby-images/d7f87/d7f8768aa874aa9f2ef4555823152dd6c4738145" alt=""
4.4 百分比堆积柱形图
PCBs %>%
# group by sex
group_by(sex) %>%
# calculate describe statistics
describe(statistics=c("mean","sd")) %>%
# draw bar plot
ggplot() +
# geometry layer
geom_bar(aes(x=reorder(described_variables,mean),y=mean,fill=sex),
stat="identity",
# visual channel mapping
position="fill",
width=0.6) +
# coordinate adjustment
scale_x_discrete(name="PCBs") +
scale_y_continuous(name="Concentration of PCBs in urine (mg/kg)") +
coord_flip() +
# legend adjustment
scale_fill_manual(name="Sex",values=c("red","blue"),labels=c("Male","Female")) +
# theme adustment
theme_light() +
theme(legend.position="top")
data:image/s3,"s3://crabby-images/aace4/aace4c37b9401b62cea80d40bf67b6c759e8aa3a" alt=""
五、结果解读
NHANES数据库中多环芳烃的编码与对应名称。
编码 | 多环芳烃 |
---|---|
LBX074LA | PCB74 |
LBX099LA | PCB99 |
LBX105LA | PCB105 |
LBX118LA | PCB118 |
LBX138LA | PCB138 |
- PCB74是人体尿液中含量最高的PCBs,其次是PCB138、PCB105和PCB99,PCB118在人体尿液中的含量最低。
- 除了PCB74,女性尿液中PCBs含量一般高于男性。
本文由mdnice多平台发布
网友评论