Training CNN error / CNN text classification

treo · May 5, 2021, 7:50am

Going by the code in your gist, you aren’t actually creating a character based input, but lets get your model problem sorted out first.

What you are looking for is a reshape. In principle you shouldn’t need to add anything if your input type is set up correctly.

In that case it should add the conversion from CNN format to a flat format automatically:

github.com

eclipse/deeplearning4j/blob/a1c407f5802231cf86098560c077cc7022bd74a0/deeplearning4j/deeplearning4j-nn/src/main/java/org/deeplearning4j/nn/conf/layers/FeedForwardLayer.java#L102-L105


      
          case CNN:
              //CNN -> FF
              InputType.InputTypeConvolutional c = (InputType.InputTypeConvolutional) inputType;
              return new CnnToFeedForwardPreProcessor(c.getHeight(), c.getWidth(), c.getChannels(), c.getFormat());

And your original error message actually tells us that it is added.

So it looks like your actual problem is in data loading. And as I said initially, what you are doing certainly is not a character level one-hot encoding of the input.

As your data probably fits into memory comfortably, I suggest you start with using a CollectionSequenceRecordReader, and give it a <List<List<List<StringWritable>>>. The outer most list contains all of your examples. The middle list contains all the steps of your sequence. The inner most list is then a list of two elements new StringWritable(char) and new StringWritable(label).

While this does duplicate your label for every character, it simplifies the setup a bit. Once you understand how things work you can split that out again.

You then create a transform, that will turn your characters into a one hot encoding (e.g. see Quickstart with Deeplearning4J – dubs·tech) and create a SequenceRecordReaderDataSetIterator that will then create sequences of one hot encoded vectors, just as you wanted it to.

If you want to skip all that, and just have a sanity check that your model is correct first, you can simply create INDArrays of the correct shape (which you should understand by know) for your inputs and labels, and send it through the model (model.fit(input, labels)).

Topic		Replies	Views
Error in training -> Invalid input: expect CNN activations with rank 4 DL4J	3	233	July 29, 2023
Problem with Conv1D example DL4J	0	254	December 8, 2022
DL4JInvalidInputException in simple example DL4J	4	449	February 24, 2021
Saved network training update failure DL4J	1	189	June 13, 2023
Error msg in LSTM RNN DL4J	2	44	July 10, 2024

Training CNN error / CNN text classification

Related topics