论文阅读笔记1

论文阅读笔记1

作者: 幽并游侠儿_1425 | 来源:发表于2020-07-08 14:00 被阅读0次

论文阅读笔记1
DEEP GRAPH INFOMAX 阅读笔记
25组-Deep Residual Learning for I
Transformer-XL: 在自注意力模型中处理长距离依赖
论文阅读笔记 RPT: Learning Point Set R
深度学习经典论文Top100 系列之优化-Dropout(1)
[NLP论文笔记] Deep contextualized wo
【Verilog】变量声明中的tips
论文阅读笔记
论文阅读笔记

文章标题：Importance Estimation for Neural Network Pruning

[link](Importance Estimation for Neural Network Pruning)

阅读目的：

查看其中的prune算法：
（1）是weight prune还是neuron prune
（2）选择significant的方法
实验参数和实验效果，主要是对inference time的影响

阅读笔记：

摘要部分

neuron (filter) prune
using the first and second- order Taylor expansions to approximate a filter’s contribution

提出质疑
传统观点：Many of them rely on the belief that the magnitude of a weight and its importance are strongly correlated.
质疑：We question this belief and observe a significant gap in correlation between weight-based pruning decisions and empirically optimal one-step decisions(经验最优的一步决策) – a gap which our greedy criterion aims to fill
新提出的标准：
We define the importance as the squared change in loss induced by removing a specific filter from the network.
新标准执行时遇到的问题：
computing the exact importance is extremely expensive for large networks
解决办法：
approximate it with a Taylor expansion , resulting in a criterion computed from parameter gradients readily available during standard training
算法部分
输入为trained network,prune,再retrain with a small learning rate
（1）For each minibatch, we compute parameter gradients and update network weights by gradient descent. (即，梯度下降)
We also compute the importance of each neuron (or filter) using the gradient averaged over the minibatch (原文献中有介绍，见下图中的公式7，8)

公式介绍图

（2）After a predefined number of minibatches, we average the importance score of each neuron (or filter) over the
of minibatches, and remove the N neurons with the
smallest importance scores
4.实验效果
对比了neurons pruned vs loss
在补充材料中，we evaluate inference speed of pruned.

Pruning results in inference speed reduction, especially for the larger batch size
Pruning skip connections results in higher time reduction compared to pruning all layers.
inference time experiment result

相关文章

论文阅读笔记1
文章标题：Importance Estimation for Neural Network Pruning [li...
DEEP GRAPH INFOMAX 阅读笔记
DGI: Deep Graph Infomax 阅读笔记论文来源：2019 ICLR 论文链接：Deep Gra...
25组-Deep Residual Learning for I
“Deep Residual Learning for Image Recognition” 阅读笔记论文作者：...
Transformer-XL: 在自注意力模型中处理长距离依赖
我的博客：菱歌's Blog | 听见美好笔记原文地址：论文阅读笔记（3）：Transformer-XL 论文题...
论文阅读笔记 RPT: Learning Point Set R
论文阅读笔记 RPT: Learning Point Set Representation for Siamese...
深度学习经典论文Top100 系列之优化-Dropout(1)
深度学习经典论文Top100(Most Cited Deep Learning Papers) 阅读笔记. 论文集...
[NLP论文笔记] Deep contextualized wo
Deep contextualized word representations(ELMo)阅读笔记本文是对论文...
【Verilog】变量声明中的tips
%备查 Cummings经典论文阅读笔记，第一篇，论文是A Proposal To Remove Those Ug...
论文阅读笔记
解决的问题解决方法SD-UDN下的任务卸载方案最小平均时延问题-NP-hard分为资源分配问题和任务放置问题。问题...
论文阅读笔记
【阅读笔记一】Lattice-Based Recurrent Neural Network, Encoders f...

网友评论

本文标题：论文阅读笔记1

本文链接：https://www.haomeiwen.com/subject/yizqcktx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|论文阅读笔记1|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！