Shape of weight

SidneyLann · December 13, 2021, 7:48am

sameDiff.loss.softmaxCrossEntropy(“loss”, label=[8,128,6], out=[8,128,6], weight=[?,?])

What’t the shape of the weight should be here?

SidneyLann · December 13, 2021, 10:42am

output=[miniBatch, sentenceLength, positionCharLabels], What’t the shape of the weights should be then?

agibsonccc · December 13, 2021, 11:44am

@SidneyLann generally it’s 1 weight per class output. Weights are also broadcastable.

SidneyLann · December 13, 2021, 12:18pm

But the miniBatch is dynamic and the weights are static, how to weight the sentences?

agibsonccc · December 14, 2021, 1:16pm

@SidneyLann so you want per example weights. We only support output oriented weights which you can find here:

github.com

eclipse/deeplearning4j/blob/d77597bae3253e13a3384a168fda789f68309c1a/libnd4j/include/ops/declarable/generic/loss/cosineDistance.cpp#L62


      
                       "COSINE_DISTANCE_LOSS OP: reduction mode value is not acceptable, possible values are 0, 1, 2, 3, but "
                       "got %i instead!",
                       reductionMode);
          // input dimension can't be larger than labels/predictions/weights rank
          REQUIRE_TRUE(dim < labels->rankOf(), 0,
                       "COSINE_DISTANCE_LOSS OP: input reduction dimension (got %i) must be < labels rank %i!", dim,
                       labels->rankOf());
          
          
if (!output->isScalar()) {
            // weights array can be single scalar or has the same shape as output, and must be broadcastable to output shape
            REQUIRE_TRUE(weights->isScalar() || weights->rankOf() == output->rankOf(), 0,
                         "SOFTMAX_CROSS_ENTROPY_LOSS OP: weights array should be scalar or have the same rank as output array, "
                         "but got %i and %i correspondingly!",
                         weights->rankOf(), output->rankOf());
            // check whether broadcast operation is possible for weights array
            REQUIRE_TRUE(weights->isScalar() || ShapeUtils::areShapesBroadcastable(*weights, *output), 0,
                         "COSINE_DISTANCE_LOSS OP: shapes of weights and output arrays should be broadcastable, but got "
                         "weights = %s and output = %s instead!",
                         ShapeUtils::shapeAsString(weights).c_str(), ShapeUtils::shapeAsString(labels).c_str());
          }

You can approximate this with output based weighting + resampling which we do support so in that case it would be the same shape as the output.

SidneyLann · December 15, 2021, 3:12am

weights = [128, 6] and loss = [8, 128] → not work
weights = [6, 8] and loss = [8, 128] → not work
weights = [1024] and loss = [1024] → work

So the weights must be scalar or rank ==1?

agibsonccc · December 15, 2021, 12:09pm

@SidneyLann are you saying weights that don’t have the same shape as the output? No it doesn’t have to only be scalar. I gave you the source code there and you can see we support different shapes. It sounds like the labels are the wrong shape and the weights are mis calculated. Could you give me something end to end where you actually try what I suggested with the weights as a placeholder?

SidneyLann · December 15, 2021, 3:00pm

sameDiff.loss.softmaxCrossEntropy(“loss”, label=[8,128,6], out=[miniBatch, sentenceLength, positionCharLabels]=[8,128,6], weight=[128,6])
===>
shapes of weights and loss arrays should be broadcastable, but got weights = [128, 6] and loss = [8, 128] instead!

sameDiff.loss.softmaxCrossEntropy(“loss”, label=[8,128,6], out=[8,128,6], weight=[6, 8])
===>
shapes of weights and loss arrays should be broadcastable, but got weights = [6, 8] and loss = [8, 128] instead!

sameDiff.loss.softmaxCrossEntropy(“loss”, label=[1024,6], out=[1024,6], weight=[1024])
===> NO ERROR

SidneyLann · December 17, 2021, 10:22am

Such then the weights must be set based on miniBatch, how to set it not based on the miniBatch? I want to set static weight, not dynamic.

SidneyLann · December 17, 2021, 10:47pm

Can weight fixed sentences, but can’t weight N sentences, where N is dynamic.

agibsonccc · December 17, 2021, 10:55pm

@SidneyLann could you clarify? The 1 sentence replies/answers don’t really help me much. I told you a way of passing in dynamic weights and I don’t have any indication you actually tried it.
You use weights as placeholders alongside inputs and labels just like you would for training.

SidneyLann · December 25, 2021, 10:26pm

sameDiff.loss.softmaxCrossEntropy(“loss”, label=[8,128,6], out=[8,128,6], weight=[8, 128])

This line can be run in SNAPSHOT now. Thanks.

But this should weight every char in a sentence(128), not weight the labels(6). Am I right?

agibsonccc · December 29, 2021, 12:14pm

@SidneyLann if the labels are the characters then yes that should be fine.

SidneyLann · December 29, 2021, 12:27pm

The labels are not the characters, a char has 6 possible labels, I want to weight the labels, below settings should work?
label=[8,128,6], out=[8,128,6], weight=[8, 128]

Topic		Replies	Views
scoreExamples and outputSingle output shape DL4J	0	31	June 20, 2024
Label weight error in SameDiff loss function SameDiff	0	416	January 4, 2021
Reduce rank and set values according to 3rd dimension SameDiff	12	315	September 28, 2022
Error: Labels and preOutput must have equal shapes DL4J	2	317	March 1, 2022
Computation Graph fit with multiple outputs? DL4J	8	773	March 21, 2020

Shape of weight

Related topics