Mockbit/#88
MLeasyTraining instabilities~10m

Vanishing Gradient Problem

Problem

Explain what causes the vanishing gradient problem in deep networks and why sigmoid activation functions are particularly prone to causing it.

Reference solution

Reference solution available after you attempt the question.

Ready to solve it?

Start a session on Mockbit #88. You'll get graded with specific critique when you submit.

Related ML questions
← Back homemockbit.io/q/88
PrivacyTerms© 2026 Mockbit