How to adapt the LSTM neural network example with sea temperature data to a different dataset?

LeandroZanatta · March 19, 2023, 9:17pm

In the example https://deeplearning4j.konduit.ai/v/en-1.0.0-beta7/getting-started/tutorials/sea-temperature-convolutional-lstm, without detailed information on how the data was processed and transformed into a suitable input format for the neural network model, the example is less useful for those who want to adapt it to a different dataset. This can make it difficult to understand and implement the model for other use cases. It would be useful to have more detailed information on the data preprocessing and transformation process so that we can better understand how the input data should be formatted for the neural network model.

agibsonccc · March 19, 2023, 9:25pm

@LeandroZanatta Could you clarify what you’re looking for?

Before I start, I want you to know that M2.1 is the latest version and beta7 is 2 years old at this point. We’ve removed a lot of those examples from newer versions for that exact reason.

For example, if there were things like normalizations it would be specified.

Are you talking about the raw csv data itself and how it gets converted to a neural network?

If you’re looking for an overview of each class or something look on the class itself:

github.com

deeplearning4j/deeplearning4j/blob/4766032444de8e0c2c3389270576bb6e7c466211/deeplearning4j/deeplearning4j-data/deeplearning4j-datavec-iterators/src/main/java/org/deeplearning4j/datasets/datavec/SequenceRecordReaderDataSetIterator.java#L47


      
          import org.nd4j.linalg.indexing.INDArrayIndex;
          import org.nd4j.linalg.indexing.NDArrayIndex;
          
          
import java.io.IOException;
          import java.io.Serializable;
          import java.util.*;
          
          
public class SequenceRecordReaderDataSetIterator implements DataSetIterator {
              /**Alignment mode for dealing with input/labels of differing lengths (for example, one-to-many and many-to-one type situations).
               * For example, might have 10 time steps total but only one label at end for sequence classification.<br>
               * Currently supported modes:<br>
               * <b>EQUAL_LENGTH</b>: Default. Assume that label and input time series are of equal length, and all examples are of
               * the same length<br>
               * <b>ALIGN_START</b>: Align the label/input time series at the first time step, and zero pad either the labels or
               * the input at the end<br>
               * <b>ALIGN_END</b>: Align the label/input at the last time step, zero padding either the input or the labels as required<br>
               *
               * Note 1: When the time series for each example are of different lengths, the shorter time series will be padded to
               * the length of the longest time series.<br>
               * Note 2: When ALIGN_START or ALIGN_END are used, the DataSet masking functionality is used. Thus, the returned DataSets
               * will have the input and mask arrays set. These mask arrays identify whether an input/label is actually present,

If you’re curious about say: the record reader you (or any other class) a good trick is to also check the tests:

Happy to help please do narrow down what you’re looking for.

LeandroZanatta · March 20, 2023, 1:52pm

My doubt is how the raw data was converted into the csv files.
In the case itself, my question is more related to how the data was organized into 52 inputs. It only states that data from 8 seas was used and organized into 2 dimensions. But what do the 2 dimensions refer to? Seas and characteristics of each sea? Time and characteristics of each period? I’m more interested in knowing why it was done this way than how it was done. I had this doubt when reading this tutorial, and I believe that this could contribute to a deeper understanding of the example.

agibsonccc · March 20, 2023, 10:09pm

@LeandroZanatta it it helps the dataset itself is fairly generic. This just forecasts future values of the given inputs. This is setup as a regression problem. The original tutorial is here:

I’d argue that the given tutorial is fairly generic and does set you up for a regression problem.

Just know that each time step as a row in the CSV file, with the time column as the first column, feature columns as input columns.

If we were to rewrite this (again this is a fairly old tutorial but still works)

a simpler problem would probably be more suitable (eg: stocks)

Would that be ok? The issue with regression problems is you can either try to forecast a completely independent variable or the future state of the inputs.

Topic		Replies	Views
Questions for Time Series LSTM DL4J	20	2027	February 26, 2020
How to prepare time series data for LSTM? DL4J	1	515	May 11, 2022
DL4J Need help with my input data DL4J	5	460	December 25, 2020
LSTM Regression Example DL4J	11	1158	January 13, 2022
Clinical Time Series LSTM	2	438	June 5, 2021

How to adapt the LSTM neural network example with sea temperature data to a different dataset?

Related topics