Getting output prediction for LSTM

ramarro123 · September 23, 2022, 6:14pm

Hello,

i didn’t understand how to get output for a LSTM network (probably more specifically any network)

i have a set of CSV file with that format, every file has 200 rows. there are 10 “attributes” and label can assume only 0/1 values, so i guess there are 2 labels

0/1 (label),double,double,double…
0/1 (label),double,double,double…

and so on…

so i splitted the file arbitrary from 0-50 are used for train and 51-60 for test

datasetiterator are created in that way

     SequenceRecordReader reader = new CSVSequenceRecordReader(0, ",");
     reader.initialize(new NumberedFileInputSplit("data/csv/%d.csv", 0, 50));
     DataSetIterator trainData = new SequenceRecordReaderDataSetIterator(reader, 100, 2, 0, false);

     SequenceRecordReader reader = new CSVSequenceRecordReader(0, ",");
     reader.initialize(new NumberedFileInputSplit("data/csv/%d.csv", 51, 60));
     DataSetIterator testData= new SequenceRecordReaderDataSetIterator(reader, 100, 2, 0, false);

same apply for test.

net is created with

MultiLayerConfiguration conf = new NeuralNetConfiguration.Builder()
            .seed(123)    //Random number generator seed for improved repeatability. Optional.
            .weightInit(WeightInit.XAVIER)
            .updater(new Adam(0.005))
            .list()
            .layer(0, new LSTM.Builder().activation(Activation.TANH).nIn(numFeatures).nOut(HID_LAY).build())

            .layer(1, new LSTM.Builder().activation(Activation.TANH).nIn(HID_LAY).nOut(HID_LAY).build())

            .layer(2, new RnnOutputLayer.Builder(LossFunctions.LossFunction.MCXENT)
                    .activation(Activation.SOFTMAX).nIn(HID_LAY).nOut(2).build())
            .build();

    MultiLayerNetwork net = new MultiLayerNetwork(conf);

hid_lay it’s 200.
now, i train for some times with (eventually i also use net.evaluate(test) to see the progress, but it doesn’t matter at this point.

            net.fit(trainData);
            trainData.reset();

and then i am trying to extract data. my idea was to do something like

            INDArray output = net.output(testData);

but i don’t understand at this point, what information i can extract from output.

how to get the “next” prediction for testData?

how to get the prediction for testData at position N? (suppose again that testData is loaded from a set of csv (9x) that have 200 rows each, how can i get the prediction for index YYYY)

thanks for helping hope my english is clear enough

agibsonccc · September 24, 2022, 7:05am

@ramarro123 take a look at rnn time step: Recurrent Neural Network - Deeplearning4j if you have something more specific after reading this feel free to post here.

ramarro123 · September 24, 2022, 8:02am

@agibsonccc can you please elaborate “more specific”?

i review the link, and from scetch (unfortunatley there aren’t many examples, just one for ucisequenceclassification, that doesn’t extract result, jsut validate with eval)

having said so, let me re-ask the specific question, in a more specific way.

specific part (not necessary a clever question)

i want to use this

As with other types of neural networks, predictions can be generated for RNNs using the MultiLayerNetwork.output() and MultiLayerNetwork.feedForward() methods. These methods can be useful in many circumstances;

so, using output() to classify my testSet.

having executed

            INDArray output = net.output(testData);

i can’t understand how to get the “classified” value of a certain index of my testData (containing let’s say 2000 entry)

i hope that’s specific enough

as for the quality of the question, could be a dumb question, so feel free to tell me if silly questions aren’t gonna get covered

agibsonccc · September 24, 2022, 12:13pm

@ramarro123 so if you read the RNN section you’ll see there’s different ways of calling “output” on the network. Try to understand which type you need.

That could be the full series or that could be the next time step. It doesn’t really look like you read the docs or I would have saw some additional questions.

If you don’t quite get what’s going on there please try to ask more questions rather than ignore it.

“More specific” should be additional questions whether it be on the structure of the input data or how time series works.

Beyond that…try looking at this example:

github.com

deeplearning4j/deeplearning4j-examples/blob/686db99fee3d4825ee70663e1a15aa8d6216f2c2/oreilly-book-dl4j-examples/dl4j-examples/src/main/java/org/deeplearning4j/examples/recurrent/seqClassification/UCISequenceClassificationExample.java

package org.deeplearning4j.examples.recurrent.seqClassification;

import org.apache.commons.io.FileUtils;
import org.apache.commons.io.IOUtils;
import org.deeplearning4j.nn.conf.layers.LSTM;
import org.nd4j.linalg.primitives.Pair;
import org.datavec.api.records.reader.SequenceRecordReader;
import org.datavec.api.records.reader.impl.csv.CSVSequenceRecordReader;
import org.datavec.api.split.NumberedFileInputSplit;
import org.deeplearning4j.datasets.datavec.SequenceRecordReaderDataSetIterator;
import org.deeplearning4j.eval.Evaluation;
import org.deeplearning4j.nn.conf.GradientNormalization;
import org.deeplearning4j.nn.conf.MultiLayerConfiguration;
import org.deeplearning4j.nn.conf.NeuralNetConfiguration;
import org.deeplearning4j.nn.conf.layers.GravesLSTM;
import org.deeplearning4j.nn.conf.layers.RnnOutputLayer;
import org.deeplearning4j.nn.multilayer.MultiLayerNetwork;
import org.deeplearning4j.nn.weights.WeightInit;
import org.deeplearning4j.optimize.listeners.ScoreIterationListener;
import org.nd4j.linalg.activations.Activation;

This file has been truncated. show original

Beyond that, evaluate actually does use output underneath. You should want to know how well your network does.

The output call for the full time series will just give you an output with the classifications. Those are going to usually be standard softmax outputs with probabilities.

For the labels when you’re using output you use argMax. We actually do have more examples of this take a look here:

github.com

deeplearning4j/deeplearning4j-examples/blob/686db99fee3d4825ee70663e1a15aa8d6216f2c2/dl4j-examples/src/main/java/org/deeplearning4j/examples/advanced/modelling/sequenceprediction/TrainLotteryModelSeqPrediction.java#L102


      
          System.out.println("=============run time=====================" + (endTime - startTime));
          
          
// save model to disk
          model.save(modelFile, true);
          
          
int luckySize = 5;
          if (modelType) {
              while (testIterator.hasNext()) {
                  DataSet ds = testIterator.next();
                  //predictions all at once
                  INDArray output = model.output(ds.getFeatures());
                  INDArray label = ds.getLabels();
                  INDArray preOutput = Nd4j.argMax(output, 2);
                  INDArray realLabel = Nd4j.argMax(label, 2);
                  StringBuilder peLabel = new StringBuilder();
                  StringBuilder reLabel = new StringBuilder();
                  for (int dataIndex = 0; dataIndex < 5; dataIndex++) {
                      peLabel.append(preOutput.getInt(dataIndex));
                      reLabel.append(realLabel.getInt(dataIndex));
                  }
                  log.info("test-->real lottery {}  prediction {} status {}", reLabel.toString(), peLabel.toString(), peLabel.toString().equals(reLabel.toString()));

Topic		Replies	Views
What method should i use for a time series prediction? DL4J	0	351	December 29, 2020
Prediction of Disk Space DL4J	8	413	June 10, 2021
LSTM prediction problems DL4J	0	376	August 21, 2020
A post in "Weird results from my LSTM prediction" requires staff attention DL4J	2	406	January 11, 2021
DL4J Need help with my input data DL4J	5	462	December 25, 2020

Getting output prediction for LSTM

Related topics