Common Terms in DL ScopeThis Blog is used to record common terms in deep learning scope. (Keep Updating.) Model Level se
Batch Normalization Batch Normalization
Batch NormalizationBatch Normalization 的优点 考虑一个网络计算: l = F_2(F_1(u, \Theta_1), \Theta_2) \tag{1}​ 学习 \Theta_2 的
Layer Normalization Layer Normalization
Layer NormalizationBackground 使用 Batch Normalization 存在问题: The effect of batch normalization is dependent on the mini-ba