#deeplearning #fastai #initialization #kaiming #lesson_8
Recent deep CNNs are mostly initialized by random weights drawn from Gaussian distributions [16]. With fixed standard deviations (e.g., 0.01 in [16]), very deep models (e.g., >8 conv layers) have difficulties to converge
If you want to change selection, open document below and click on "Move attachment"
pdf
owner:
ronaldokun - (no access) - Delving into Rectifiers.pdf, p3
Summary
status | not read | | reprioritisations | |
---|
last reprioritisation on | | | suggested re-reading day | |
---|
started reading on | | | finished reading on | |
---|
Details