The above pseudocode to implement grad check is taken directly from the lab assignment. According to my understanding, in the above for each set of parameters, we increase/decrease them by epsilon, and the compute the cost. Then compute gradapprox. Question 1: In step 2 why do we only update weights
⚡
Key Insights
10 editorial insights.
AiFeed24 Team·⏱ 1 min read·News
Deep Analysis
Multi-Source Intelligence
Found this useful? Share it!


