passing the dataset to the fit method of the MultiLayerNetwork class

Arasaka · April 26, 2023, 7:01am

Hello everyone, I am forming a dataset from INDArray 3 ranks, the dataset is formed without errors. But when passing it to the fit method of the MultiLayerNetwork class, an error occurs: Rank is [3]; columns() call is not valid. What could be the problem?

agibsonccc · April 26, 2023, 7:06am

@Arasaka can you pass more context? You are probably using columns() somewhere. That shortcut is only available for matrices. (Hence the name columns()). If you want the last dimension you can also use:
arr.size(-1)

Arasaka · April 26, 2023, 7:09am

@agibsonccc thanks for the answer. But the worst part is that I don’t use the columns() method anywhere. Is it possible to pass an INDArray of rank 3 to the fit method? Here is my stack trace:
Exception in thread “main” java.lang.IllegalStateException: Rank is [3]; columns() call is not valid
at org.nd4j.linalg.api.ndarray.BaseNDArray.columns(BaseNDArray.java:4054)
at org.deeplearning4j.nn.layers.feedforward.embedding.EmbeddingLayer.preOutput(EmbeddingLayer.java:85)
at org.deeplearning4j.nn.layers.feedforward.embedding.EmbeddingLayer.activate(EmbeddingLayer.java:126)
at org.deeplearning4j.nn.layers.AbstractLayer.activate(AbstractLayer.java:262)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.ffToLayerActivationsInWs(MultiLayerNetwork.java:1147)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.computeGradientAndScore(MultiLayerNetwork.java:2798)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.computeGradientAndScore(MultiLayerNetwork.java:2756)
at org.deeplearning4j.optimize.solvers.BaseOptimizer.gradientAndScore(BaseOptimizer.java:174)
at org.deeplearning4j.optimize.solvers.StochasticGradientDescent.optimize(StochasticGradientDescent.java:61)
at org.deeplearning4j.optimize.Solver.optimize(Solver.java:52)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fitHelper(MultiLayerNetwork.java:1767)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fit(MultiLayerNetwork.java:1688)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fit(MultiLayerNetwork.java:3614)
at org.deeplearning4j.nn.multilayer.MultiLayerNetwork.fit(MultiLayerNetwork.java:3601)
at Runner.main(Runner.java:58)

agibsonccc · April 26, 2023, 7:11am

@Arasaka ah I see the EmbeddingLayer is calling that. Can you post your network and ideally a reproducer?

It might not be pre processing the data correctly. This is common when you set the inputs manually.

Arasaka · April 26, 2023, 7:13am

@agibsonccc
Would you like to lay out the network build or dataset assembly?

Arasaka · April 26, 2023, 7:14am

@agibsonccc my network build:
public static MultiLayerNetwork buildModel() {
int vocabSize = 10000;
int embeddingSize = 300;
int numberHidden = 256;
int numberClasses = 2;
MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
.seed(123)
.updater(new Adam())
.list()
.layer(new EmbeddingLayer.Builder()
.nIn(vocabSize)
.nOut(embeddingSize)
.build())
.layer(new LSTM.Builder()
.nIn(embeddingSize)
.nOut(numberHidden)
.activation(Activation.TANH)
.build())
.layer(new DenseLayer.Builder()
.nIn(numberHidden)
.nOut(numberClasses)
.activation(Activation.SOFTMAX)
.build())
.layer(new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
.nIn(numberHidden)
.nOut(numberClasses)
.build())
.build();
MultiLayerNetwork network = new MultiLayerNetwork(conf);
network.init();

    return network;
}

agibsonccc · April 26, 2023, 7:39am

@Arasaka can you remove the manual nIns and use setInputType?
Something like:
.setInputType(InputType.recurrent(W2V_VECTOR_SIZE, 1000))

See more here;

Arasaka · April 26, 2023, 7:42am

@agibsonccc, i’ll try and post here.

Arasaka · April 26, 2023, 10:11am

@agibsonccc
I did it like this, is it right?

public static MultiLayerNetwork buildModel() {
int vocabSize = 10000;
int embeddingSize = 300;
int numberHidden = 256;
int numberClasses = 2;
MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
.seed(123)
.updater(new Adam())
.list()
.layer(new EmbeddingLayer.Builder()
.nIn(vocabSize)
.nOut(embeddingSize)
.build())
.layer(new LSTM.Builder()
.nOut(numberHidden)
.activation(Activation.TANH)
.build())
.layer(new DenseLayer.Builder()
.nOut(numberClasses)
.activation(Activation.SOFTMAX)
.build())
.layer(new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
.nOut(numberClasses)
.build())
.setInputType(InputType.reccurrent(embeddingSize))
.build();
MultiLayerNetwork network = new MultiLayerNetwork(conf);
network.init();
return network;
}

agibsonccc · April 26, 2023, 10:15am

@Arasaka remove your other nIn declaration as well. setInputType goes through each layer and adds pre processors as well as sets the input number for you. Run that let me know if you have any issues.

Arasaka · April 26, 2023, 10:27am

@agibsonccc that is to remove still in EmbeddingLayer?

agibsonccc · April 26, 2023, 10:27am

@Arasaka yes. nIn is not needed anywhere in the network besides in setInputType

Arasaka · April 26, 2023, 10:39am

@agibsonccc understood, thank you. Now I will run in and write to you.

Arasaka · April 26, 2023, 12:06pm

@agibsonccc now when transferring the dataset for training to the model, the following error occurs:
“Cannot do forward pass for embedding layer with input more than one column. Expected input shape: [numExamples,1] with each entry being an integer index (layer name: layer0, layer index: 0, layer type: EmbeddingLayer)”. As I understand it, the error occurs because the input data has the dimension [numExamples, maxLength, vectorLength], and the EmbeddingLayer expects input data of the dimension [numExamples, 1], where each element is the index of the word in the dictionary.

agibsonccc · April 26, 2023, 12:18pm

@Arasaka switch to using the EmbeddingSequenceLayer instead. Sorry I wasn’t thinking about that earlier. You can find tests and the class here: https://github.com/search?q=repo%3Adeeplearning4j%2Fdeeplearning4j%20%20EmbeddingSequenceLayer&type=code

Arasaka · April 26, 2023, 12:43pm

@agibsonccc i changed EmbeddingLayer to EmbeddingSequenceLayer. What will be the result - I will write.

agibsonccc · April 26, 2023, 12:49pm

@Arasaka since you’re using 3d the sequence layer expects 3d. That’s why we had the 2d columns assumption in there.

Arasaka · April 26, 2023, 12:55pm

@agibsonccc i replaced like this:
public static MultiLayerNetwork buildModel() {
int vocabSize = 10000;
int embeddingSize = 300;
int numberHidden = 256;
int numberClasses = 2;
MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
.seed(123)
.updater(new Adam())
.list()
.layer(new EmbeddingSequenceLayer.Builder() //change
.nOut(embeddingSize)
.build())
.layer(new LSTM.Builder()
.nOut(numberHidden)
.activation(Activation.TANH)
.build())
.layer(new DenseLayer.Builder()
.nOut(numberClasses)
.activation(Activation.SOFTMAX)
.build())
.layer(new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
.nOut(numberClasses)
.build())
.setInputType(InputType.reccurrent(embeddingSize))
.build();
MultiLayerNetwork network = new MultiLayerNetwork(conf);
network.init();
return network;
}

I hope I did it right

agibsonccc · April 26, 2023, 1:00pm

@Arasaka at first glance that looks good. Give that a shot and I"ll check on you tomorrow.

Arasaka · April 27, 2023, 2:46am

@agibsonccc hello, today and tomorrow will have to wait a while with this. I hope you wait.

Topic		Replies	Views
Illegal State Exception 3D input expected to RNN layer expected, got 2 DL4J	3	788	March 3, 2022
Training CNN error / CNN text classification DL4J	59	2468	May 15, 2021
DenseLayer (index=9, name=ffn0) nIn=0, nOut=3072; nIn and nOut must be > 0 RL4J	18	599	August 1, 2023
Error in training -> Invalid input: expect CNN activations with rank 4 DL4J	3	234	July 29, 2023
Problem removing lstm, shape exception RL4J	5	365	July 3, 2022

passing the dataset to the fit method of the MultiLayerNetwork class

Related topics