问题代码
Map<String, JSONArray> auditStreamInfoMapping = new HashMap<>();
//开启线程池
Integer totalArraySize = tasks.size();
Integer suggestedThreadNum = Runtime.getRuntime().availableProcessors() * 2 + 1; //建议最大线程数量,IO密集型:cpu核心数*2+1
ExecutorService threadPool = Executors.newFixedThreadPool((totalArraySize < suggestedThreadNum) ? totalArraySize : suggestedThreadNum);
final CountDownLatch countDownLatch = new CountDownLatch(totalArraySize); //创建CountDownLatch
for (Task task : tasks) {
threadPool.execute(new Runnable() { //执行线程
@Override
public void run() {
try {
auditStreamInfoMapping.put(task.getTask_id(), JSONArray.parseArray(task.getAudit_stream()));
} catch (Exception e) {
log.error(e.getMessage());
} finally {
countDownLatch.countDown(); //CountDownLatch的值减1
}
}
});
}
countDownLatch.await(); //主线程挂起,直到CountDownLatch的值为0
threadPool.shutdown(); //关闭线程池
问题分析
HashMap源码
final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
boolean evict) {
Node<K,V>[] tab; Node<K,V> p; int n, i;
if ((tab = table) == null || (n = tab.length) == 0)
n = (tab = resize()).length;
if ((p = tab[i = (n - 1) & hash]) == null) // 如果没有hash碰撞则直接插入元素
tab[i] = newNode(hash, key, value, null);
else {
Node<K,V> e; K k;
if (p.hash == hash &&
((k = p.key) == key || (key != null && key.equals(k))))
e = p;
else if (p instanceof TreeNode)
e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
else {
for (int binCount = 0; ; ++binCount) {
if ((e = p.next) == null) {
p.next = newNode(hash, key, value, null);
if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
treeifyBin(tab, hash);
break;
}
if (e.hash == hash &&
((k = e.key) == key || (key != null && key.equals(k))))
break;
p = e;
}
}
if (e != null) { // existing mapping for key
V oldValue = e.value;
if (!onlyIfAbsent || oldValue == null)
e.value = value;
afterNodeAccess(e);
return oldValue;
}
}
++modCount;
if (++size > threshold)
resize();
afterNodeInsertion(evict);
return null;
}
- 这是jdk1.8中HashMap中put操作的主函数,
注意第6行代码
,如果没有hash碰撞则会直接插入元素。 - j假设线程A和线程B同时进行put操作,刚好这两条不同的数据hash值一样,并且该位置数据为null,所以这线程A、B都会进入第6行代码中。
- 假设一种情况,线程A进入后还未进行数据插入时挂起,而线程B正常执行,从而正常插入数据,然后线程A获取CPU时间片,此时线程A不用再进行hash判断了,问题出现:线程A会把线程B插入的数据给覆盖,发生线程不安全。
总结
- 在jdk1.7中,在多线程环境下,扩容时会造成环形链或数据丢失。
- 在jdk1.8中,在多线程环境下,会发生数据覆盖的情况。
网友评论