MLeasyTraining instabilities~10m
Vanishing Gradient Problem
Problem
Explain what causes the vanishing gradient problem in deep networks and why sigmoid activation functions are particularly prone to causing it.
Reference solution
Reference solution available after you attempt the question.
Ready to solve it?
Start a session on Mockbit #88. You'll get graded with specific critique when you submit.
Related ML questions
← Back homemockbit.io/q/88