Arbiter: Cannot perform evaluation with NaNs present in predictions

SidneyLann · November 17, 2020, 6:27am

arbiter trainning throw below exception in some epochs, but arbiter trainning can be done successfully. this exception is normal?

java.lang.IllegalStateException: Cannot perform evaluation with NaNs present in predictions: 24576 NaNs present in predictions INDArray
at org.nd4j.common.base.Preconditions.throwStateEx(Preconditions.java:641)
at org.nd4j.common.base.Preconditions.checkState(Preconditions.java:286)
at org.nd4j.evaluation.classification.Evaluation.eval(Evaluation.java:403)
at org.deeplearning4j.nn.graph.ComputationGraph.doEvaluationHelper(ComputationGraph.java:4192)
at org.deeplearning4j.nn.graph.ComputationGraph.doEvaluationHelper(ComputationGraph.java:4131)
at org.deeplearning4j.nn.graph.ComputationGraph.doEvaluation(ComputationGraph.java:4089)
at org.deeplearning4j.nn.graph.ComputationGraph.evaluate(ComputationGraph.java:3938)
at org.deeplearning4j.nn.graph.ComputationGraph.evaluate(ComputationGraph.java:3900)
at org.deeplearning4j.nn.graph.ComputationGraph.evaluate(ComputationGraph.java:3878)
at org.deeplearning4j.arbiter.scoring.impl.EvaluationScoreFunction.score(EvaluationScoreFunction.java:78)
at org.deeplearning4j.arbiter.scoring.impl.BaseNetScoreFunction.score(BaseNetScoreFunction.java:79)
at org.deeplearning4j.arbiter.scoring.impl.BaseNetScoreFunction.score(BaseNetScoreFunction.java:59)
at org.deeplearning4j.arbiter.task.ComputationGraphTaskCreator$GraphLearningTask.callHelper(ComputationGraphTaskCreator.java:239)
at org.deeplearning4j.arbiter.task.ComputationGraphTaskCreator$GraphLearningTask.call(ComputationGraphTaskCreator.java:133)
at org.deeplearning4j.arbiter.task.ComputationGraphTaskCreator$GraphLearningTask.call(ComputationGraphTaskCreator.java:88)
at org.nd4j.shade.guava.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:125)
at org.nd4j.shade.guava.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:69)
at org.nd4j.shade.guava.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:78)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)

SidneyLann · November 19, 2020, 6:47am

Only tanh can be used for LSTM, Other activation function should cause above exception.

agibsonccc · November 22, 2020, 4:25am

@SidneyLann I’m not quite convinced it’s a simple “yes/no this is an absolute rule” you can use a a few different activation functions with LSTMs.

If you see NANs that’s generally just from an unstable network though. This has the same trade offs as training any neural network.

SidneyLann · November 22, 2020, 8:58am

It has no error in trainning stage, just in validating stage has error.

Topic		Replies	Views
NaNs present in prediction DL4J	3	1411	March 23, 2021
NaN on arm server DL4J	20	1467	December 28, 2020
Error msg in LSTM RNN DL4J	2	44	July 10, 2024
Help for "Sequence lengths do not match for RnnOutputLayer input and labels:Arrays should be rank 3 with shape [minibatch, size, sequenceLength] - mismatch on dimension 2 (sequence length) - input=[32, 15, 20] vs. label=[32, 5, 20]"	2	35	February 23, 2025
Upgraded from 1.0.0-beta7 to 1.0.0-M1 - Getting Exceptions in code that worked prior DL4J	6	379	June 5, 2021

Arbiter: Cannot perform evaluation with NaNs present in predictions

Related topics