After update the gradients, model output all zeros for whatever inputs?

TempKonduitUser1 · November 27, 2022, 10:27am

I have some training code as the following:

Pair<Gradient, INDArray> pair = trainableModel.calculateGradients(state0BatchProcessed, targets, null, masks);
trainableModel.setGradient(pair.getKey());
trainableModel.update(pair.getKey());

but after one step of the above code, run some code as the following:

INDArray qValues = trainableModel.output(state0BatchProcessed);

qValues are all zeros. And for the second round of training step, all gradients become zeros.
What is the cause? How to fix it?

Thanks in advance!

Topic		Replies	Views
How to use the result of calculateGradients api to update the network? DL4J	3	257	November 28, 2022
Cannot update a model using Gradient if it hasn't had a computeGradientAndScore() called on it DL4J	4	494	October 5, 2020
Custom Loss Function and Gradient DL4J	1	268	August 3, 2023
Accessing values of variables after training DL4J	3	210	May 17, 2023
Why is my model only predicting 0 and never 1? DL4J	2	367	June 11, 2021

After update the gradients, model output all zeros for whatever inputs?

Related topics