每个程序员都应该知道的延迟数字

作者: Yim_ | 来源:发表于2019-02-25 09:23 被阅读0次

每个程序员都应该知道的延迟数字
每个程序员都应该知道的延迟数
如何利用 C# 爬取BigOne交易所的公告！
左耳听风专栏 - 07 | 推荐阅读：每个程序员都该知道的事
坚持输入的第10天
Ruby
程序员需要看看喽
「快学Docker」Docker简介、安装和Hello Worl
Java程序员五年职业生涯——阿里大牛给你的职业建议
Java程序员五年职业生涯——阿里大牛给你的职业建议

动作	时间	换算
L1 缓存访问	0.5 ns	0.5ns
分支预测错误	5 ns	5ns
L2 缓存访问	7 ns	7 ns
互斥锁/解锁	25 ns	25 ns
内存访问	100ns	100ns
使用 Zippy压缩 1KiB	3,000 ns	3 µs
通过 1 Gbps 网络发送 2KiB	20,000 ns	20 µs
SSD 随机读取	150,000 ns	150 µs
内存中连续读取 1 MB	250,000 ns	250 µs
同一个数据中心的来回	500,000 ns	0.5 ms
从 SSD 上连续读取 1 MB*	1,000,000 ns	1 ms
机械磁盘寻道	10,000,000 ns	10 ms
机械磁盘连续读取 1 MB	20,000,000 ns	20 ms
发送数据包加州->荷兰->加州	150,000,000 ns	150 ms

假设～1GB/sec SSD
如果把这些时长都乘以 10 亿的话:

动作	时间	换算
L1 缓存访问	0.5 s	一次心跳 (0.5 s)
分支预测错误	5 s	打个哈欠
L2 缓存访问	7 s	打个长哈欠
互斥锁/解锁	25 s	冲一杯咖啡
内存访问	100 s	刷牙
使用 Zippy压缩 1KiB	50 min	50 min
通过 1 Gbps 网络发送 2KiB	5.5 hr	从午餐到下午工作结束
SSD 随机读取	1.7 days	一个普通的周末
内存中连续读取 1 MB	2.9 days	一个长周末
同一个数据中心的来回	5.8 days	一个普通假期
从 SSD 上连续读取 1 MB*	11.6 days	等快递等了两周
机械磁盘寻道	16.5 weeks	大学里的一个学期
机械磁盘连续读取 1 MB	7.8 months	几乎能造个人了
上面两个加起来	1 year	整整一年
发送数据包加州->荷兰->加州	4.8 years	快能读个博士了

可视化网页： https://people.eecs.berkeley.edu/~rcs/research/interactive_latency.html

这些数字的作用是什么？

了解这些时间的量级有助于比较不同的解决方案。通过这些数字，你可以看出来从远程服务器的内存中读一些数据时比直接从硬盘上读取快的。在一般的程序中，这也就意味着使用磁盘存储比使用数据库服务器要慢，因为数据库通常把所有东西都放在内存里了。而且这也说明了为什么在服务全球客户的时候 CDN 至关重要。从北美到欧洲的一个 ping 就要花上 100+ ms，所以从地理上来说，内容应该是分布式部署的。

The idea is more about knowing the scale of time it takes to help compare different solution. With those number you can see that it's faster to get data in memory from a distant server than to read the same data from disk. In common application that means using disk storage is less efficient that storing it in a database on an other server, since it usually keeps almost everything in memory. It also tells a lot about why CDN are needed if you are serving client worldwide. A ping from North America to Europe will always takes 100+ ms, so having geographically distributed content is a good idea.

对于互斥锁的开启锁定时间 mutex lock/unlock time is important for anyone writing code that depends on code that is accessing the same data structures at the same time. If you don't know that there is a considerable cost to writing such code, now you do; and this number can quantify it.

还需要注意顺序读写和批量读写带来的提速