训练时 ,每个特征会按batch中最长的特征长度进行补0 ,
data:image/s3,"s3://crabby-images/67ec2/67ec21f1febd251b2a1767d4caba6945056175dc" alt=""
data:image/s3,"s3://crabby-images/98ff2/98ff2ff8fdbf4f4216854e8c219419fa11bd01cf" alt=""
输出结果:
data:image/s3,"s3://crabby-images/e56a7/e56a7c871ecdb4bb26de4848d35fc8aa52869ec2" alt=""
data:image/s3,"s3://crabby-images/fa2ef/fa2efbefed322c9906900c619ce3370fa5bcc946" alt=""
经过特征提取后每条语音的特征长度是不同的,这个batch里8条语音特征最长的是805
即max_feature_length = 805
放进inputs时会进行 补0 ,如下图所示:
data:image/s3,"s3://crabby-images/e727d/e727df7ead66198797f88a8ebdc340a2a24eb868" alt=""
最终一个batch的inputs的维度为:
data:image/s3,"s3://crabby-images/c61eb/c61eb97ed00d66261bc218dbf8c730d4a0262c94" alt=""
即 [batch_size, seq_size, feature_size]
每个batch的seq_size是不同的
data:image/s3,"s3://crabby-images/e2abc/e2abc9185dfc4fa1bb56fc52c87f99649aeac3af" alt=""
data:image/s3,"s3://crabby-images/c0b5f/c0b5f32dfe8951e160dd81296e6bf7e13bd1af3c" alt=""
训练时 ,每个特征会按batch中最长的特征长度进行补0 ,
输出结果:
经过特征提取后每条语音的特征长度是不同的,这个batch里8条语音特征最长的是805
即max_feature_length = 805
放进inputs时会进行 补0 ,如下图所示:
最终一个batch的inputs的维度为:
即 [batch_size, seq_size, feature_size]
每个batch的seq_size是不同的
本文标题:batch中特征的补0
本文链接:https://www.haomeiwen.com/subject/zxqukltx.html
网友评论