data:image/s3,"s3://crabby-images/88a89/88a893b9bbe22cc2f56b2abf3ec77180a993af5a" alt=""
经过分析,由实时计算pisces服务spark streamming 任务日志导致hdfs空间暴增。
data:image/s3,"s3://crabby-images/e2ba8/e2ba832d8dec99d1c9311c8ca0c1fa9e5e6632a2" alt=""
.inprogress 结尾的为未完成任务
hdfs dfs -ls /user/spark/applicationHistory/ | grep application_ | grep -v '.inprogress$'|awk '{print $8}'
hdfs dfs -rm -f $(hdfs dfs -ls /user/spark/applicationHistory/ | grep application_ | grep -v '.inprogress$'|awk '{print $8}')
默认删除入hdfs回收站,所以需要清理/user/hdfs/.Trash/* 内容,才是释放空间。
data:image/s3,"s3://crabby-images/68880/68880135c9fc8a14c50b623a79af6e022a947863" alt=""
crontab -u hdfs -e
0 2 * * * sh hdfs dfs -rm -f $(hdfs dfs -ls /user/spark/applicationHistory/ | grep application_ |awk '{print $8}') && hdfs dfs -rm -r -f /user/hdfs/.Trash/*
此时空间已被正常清理,等待一会hdfs集群服务恢复正常。
data:image/s3,"s3://crabby-images/4fb4a/4fb4ae27e8c5dc24b27d7466b2de7c4b0f196f35" alt=""
data:image/s3,"s3://crabby-images/df357/df357ec06d178374a62fb63d7799fd22230d4e23" alt=""
网友评论