Independent parameter update in backpropagation

Question

When we calculate the gradient wrt to each paramters, we consider the other parameters remain constant, but the moment their is a change in any of the other parameters, shouldn't all the other changes become invalid?

Edit: In the usual gradient descent we assume that all the weights and biases for all the layers to be the input parameters for the loss function which we try to minimize. In that, we inherently assume that the inputs are independent on each other, which in case of a multilayer NN not true, the params of a layer should be influenced by its previous layers, which is why the param updates should atleast be done on a layer by layer bases, i.e. although inefficient, the correct method should be, if we make a nudge in the params of a layer what would be the best param value for the next layer.

Am I missing, or misinterpreting something, how is gradient descent able to take into account the interdependence of the parameters of a layer on its own since we cannot manually define how the parameters across layers are dependent on each other?

https://ai.stackexchange.com/a/20315/69468 this should answer your question — Luca Anzalone, Sep 22 '23 at 13:48
Does this answer your question? Why do we update all layers simultaneously while training a neural network? — nbro, Oct 03 '23 at 13:27
@nbro not really, have edited the question. See if that clears the doubt I have — In progress..., Oct 06 '23 at 09:39
@Inprogress... Can you please put your specific question in the title? "Independent parameter update in backpropagation" is not really a question. Thanks. — nbro, Jan 02 '24 at 13:21

Independent parameter update in backpropagation

0 Answers0