一、监控cup当前使用状态
可以用来监控服务器是否挂了,如果低于0.5的水平线即可报警
查询cpu每秒使用率,以最近5分钟为单位
//legend - Busy System
sum by (node_exporter)(rate(node_cpu_seconds_total{job='node_exporter',mode ='system'}[5m]))
//legend - Busy User
sum by (node_exporter)(rate(node_cpu_seconds_total{job='node_exporter',mode ='user'}[5m]))
//legend - Busy Iowait
sum by (node_exporter)(rate(node_cpu_seconds_total{job='node_exporter',mode ='iowait'}[5m]))
//legend - Busy IRQs
sum by (node_exporter)(rate(node_cpu_seconds_total{job='node_exporter',mode=~'.*irq'}[5m]))
//legend - Busy Others
sum by (node_exporter) (rate(node_cpu_seconds_total{job = 'node_exporter',mode!='idle',mode!='user',mode!='system',mode!='iowait',mode!='irq',mode!='softirq'}[5m]))
//legend - Idle
sum by (node_exporter)(rate(node_cpu_seconds_total{job='node_exporter',mode ='idle'}[5m]))
cpu报警设置:
根据机器配置决定,如Busy User 指标IS ABOVE 240,IS BELOW 0.5即可报警
![](https://img.haomeiwen.com/i6163304/0f076a9ceb17bab3.png)
二、监控内存使用状态
监控服务器的内存标准
//监控服务器总的内存大小
node_memory_MemTotal_bytes{job = 'node_exporter'}
//监控服务器已使用的内存大小
node_memory_MemTotal_bytes{job = 'node_exporter'} - node_memory_MemFree_bytes{job = 'node_exporter'}
- node_memory_Cached_bytes{job = 'node_exporter'} - node_memory_Buffers_bytes{job = 'node_exporter'}
//监控服务器RAM Cache + Buffer内存大小
node_memory_Cached_bytes{job = 'node_exporter'} + node_memory_Buffers_bytes{job = 'node_exporter'}
//监控服务器Free内存大小
node_memory_MemFree_bytes{job="node_exporter"}
//监控服务器SWAP Used内存大小
(node_memory_SwapTotal_bytes{job="node_exporter"} - node_memory_SwapFree_bytes{job="node_exporter"})
![](https://img.haomeiwen.com/i6163304/c938b327af5dee4c.png)
未完待续......
网友评论