Restrict the field of prediction possible

arnaud22 · March 30, 2020, 12:42pm

Hi all,

I trained a Mnist model with DL4J.
When i use this model in inference mode :
INDArray prediction = myModel.output(myINDArrayImage);
that gives me a prediction in an INDArray, it works properly. The size of this INDArray is equal to number of output on my OutputLayer model (softmax activation).

Is there a possibility to ask the model to predict only on a certain number ? Or have I to change my model ?

thx for your responses !

treo · March 30, 2020, 2:03pm

As @eduardo has said in deeplearning4j - DL4J - Is there a way to restrict the prediction of a model - Stack Overflow already, you should multiply your output with a mask.

A mask is an array that for every possible output class has either a 1 or a 0. So if you have 10 possible output classes and you want to keep only the results for class 0, 1, 3, 5 and 7 it looks like this: [1, 1, 0, 1, 0, 1, 0, 1, 0, 0].

Again as already said in the stackoverflow answer, you should then use argMax to get the index of the class with the highest probability of the remaining classes.

arnaud22 · March 30, 2020, 2:44pm

Thx for your response, but i don’t understand anyway
myu model use a softmax output validation so in my response i have for exemple :
[0,0,0,0,1,0,0,0,0,0]
I don’t understand how a mask can say to the model “I know the result is 3 or 7, give me a prediction on these value” because my model can return 4 ([0,0,0,0,1,0,0,0,0,0])
and i want a prediction only on 3 or 7 ([0,0,0,1,0,0,0,1,0,0])

arnaud22 · March 30, 2020, 2:45pm

if my model give me that response :
[0,0,0,0,1,0,0,0,0,0]
a mask with
[0,0,0,1,0,0,0,1,0,0]
will return me [0,0,0,0,0,0,0,0,0,0] no ?

treo · March 30, 2020, 2:56pm

Usually a model doesn’t return responses like that. It is called softmax because it isn’t an actual max, so you will not have all zero results.

But, if you absolutely want to do it some other way, you can retrain your full model, and add a bias input, i.e. for each input image you train, you add an additional input vector that uses a multi hot encoding to denote which classes the current training image could be. You will have to make sure that you aren’t just training with a one-hot encoding there, otherwise your model will learn to ignore the actual picture, and just use the answer you are giving it.

arnaud22 · March 30, 2020, 3:01pm

that i say, my model use softmax so I have only one 1 and 0 to other value.

when i make a prediction, my model look my input and give me the best solution for one of ten output (softmax), i want to tell him :
don’t look on ten solution but only 2 for exemple, I think it’s not possible…

i think I have to make 10 model : one for each number and call the concerned model for prediction that can respond me “true or false”
an other solution is to use an other validationfunction like sigmoide to get a probability like 80% / 20% but I haven’t managed to do it yet
Thanks anyway for taking the time to consider my problem

treo · March 30, 2020, 3:07pm

You don’t understand how softmax works. Unless your model is 100% sure that an input is a specific class (which basically never happens), you will never get exactly 0 or exactly 1.

Anyway, if you are fine with creating multiple models, you can also use something different from softmax as your final activation function. Sigmoid will work well enough, but you have to use a different loss function, one that doesn’t expect the output to be a probability distribution.
In that case you can imagine it like 11 models working together, first one model extracts all the necessary features and then 10 more simpler models use those features as input.

arnaud22 · March 30, 2020, 3:12pm

I think I know how softmax works…
my model predict the good answer 97% times

arnaud22 · March 30, 2020, 3:13pm

in this case, the good result is the first index, but he says me 1 in other value…

treo · March 30, 2020, 3:16pm

mask out the 1 and sum the result, it will still be more than 0. So selecting from the other outputs should still work for you.

arnaud22 · March 30, 2020, 3:20pm

this response is exceptional, in most of the time i get only one 1 and 0 to others values.

i just want to know if there is a way to say in a model who are trained with 10 output :
“Give me the best result between 4 output for exemple”

arnaud22 · March 30, 2020, 3:24pm

sorry for my english, I can’t explain what I want to do, the mask is not the solution I’m waiting for because I only have one possible solution with my prediction

arnaud22 · March 30, 2020, 3:43pm

ok i understand, i use a loss function who are too restrictive for my model. I have to change it. Thx !

treo · March 30, 2020, 3:52pm

If you are getting exactly 0 on those outputs (i.e. not just values that are rounded to 0 for the .toString() call, which prints at a limited precision), then you are getting huge inputs into your softmax which forces the values into underflow, and you should probably check your model.

Otherwise the function that defines the softmax can not return anything that is exactly 0:

I’ve tried it again with a very overfitting model on data that it has seen thousands of times, and even then I can not force softmax to return something that resembles your output. It is always something like this: [ 8.7626e-8, 5.6123e-7, 0.9999, ... 1.9215e-5, 3.3234e-5, 4.4047e-8]

I suggest you check your model as well.

arnaud22 · March 30, 2020, 4:00pm

witch loss function do you use ?

treo · March 30, 2020, 4:03pm

That was with NEGATIVELOGLIKELIHOOD(which is identical to MCXENT).

arnaud22 · March 30, 2020, 4:04pm

hum i use the same

treo · March 30, 2020, 4:07pm

Post your network configuration, and I might be able to tell you where things can go wrong.

arnaud22 · March 31, 2020, 9:17am

ok, I use a relu rathen than a sigmoide before last FC layer. Now i’m able to put a mask on prediction because I don’t only have 1 or 0 results thx!

Topic		Replies	Views
Recommended way to create INDArray for prediction? DL4J	5	987	May 29, 2020
Cross validation in DL4J DL4J	0	247	April 14, 2022
I am asking for help on a Dl4j code to extract class labels from the output layer of a FaceNetNN4Small2 computationgraph	0	163	April 6, 2023
Why is my model only predicting 0 and never 1? DL4J	2	372	June 11, 2021
DL4J Need help with my input data DL4J	5	460	December 25, 2020

Restrict the field of prediction possible

Related topics