How to use NDArray like NumPy?

yjay · October 16, 2020, 2:37am

I’m new to using DL4J and I’m trying to figure out how to use NDArray
What I’m trying to do is similar to the NumPy python np.array

Basically what I’m trying to do is add a dataframe of one column into the array.

For ex. if I have a dataframe like:

 Height
    t
    t
    t
    t
    s
    s
    s

Using NumPy I could do:

heightDataframe = ...
heightArray = np.array(heightDataframe)

and I get back an array like:

[['t'] 
['t']
['t']
['t']
['s']
['s']
['s']]

So I’m wondering how I could do something similar in ND4J?
I tried reading through the docs but I was a bit confused about how to get it started.

Any ideas would be great!
Thanks so much!

agibsonccc · October 16, 2020, 3:13am

@yjay nd4j itself is just a “numpy”. In order to use it with a specific input pipeline, you normally use datavec with it.

We have examples of that here for csv data:

Nd4j can also ready numpy arrays directly. If you want to use python to do your ETL then just save numpy arrays we load that can work as well:

github.com

eclipse/deeplearning4j/blob/master/nd4j/nd4j-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/linalg/factory/Nd4j.java#L5654


      
              return INSTANCE.createFromNpyFile(file);

          }

          

          public static Map<String, INDArray> createFromNpzFile(File file) throws Exception{

              return INSTANCE.createFromNpzFile(file);

          }

          

          /**

           * Create a numpy array based on the passed in input stream

           * @param is the input stream to read

           * @return the loaded ndarray

           */

          @SuppressWarnings("unused")

          public static INDArray createNpyFromInputStream(@NonNull InputStream is) throws IOException {

              byte[] content = IOUtils.toByteArray(is);

              return createNpyFromByteArray(content);

          }

          

          

          /**

           * Create an {@link INDArray} from the given numpy input.<br>

Unfortunately, we don’t have as much magic around auto data type conversion and do not support objects as input types. We do allow strings though. If you can tell me more specifically what you’re trying to do I can make a better recommendation.

yjay · October 16, 2020, 4:56pm

Thanks so much for this useful information!

Basically what I’m trying to do is make an array that can be used for data science predictions. More specifically at the moment I’m looking to use if for k-nearest neighbour. In knn.fit, I want to use a flattened array containing numpy.array for the values in knn.fit.

I also want to be able to use if later on for things like numpy.maximum and minimum.

agibsonccc · October 18, 2020, 11:41pm

@yjay then for columnar data, we typically use csv record readers and datavec. Like I mentioned, if you prefer python and pandas you can save your dataset as numpy arrays and we can directly load them.
For max and min you’ll want:

github.com

eclipse/deeplearning4j/blob/88d3c4867fb87ec760b445c6b9459ecf353cec47/nd4j/nd4j-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/linalg/ops/transforms/Transforms.java#L870


      
          
          
/**
           * Element wise maximum function between 2 INDArrays
           *
           * @param first
           * @param second
           * @param dup
           * @return
           */
          public static INDArray max(INDArray first, INDArray second, boolean dup) {
              long[] outShape = broadcastResultShape(first, second);   //Also validates
              Preconditions.checkState(dup || Arrays.equals(outShape, first.shape()), "Cannot do inplace max operation when first input is not equal to result shape (%ndShape vs. result %s)",
                      first, outShape);
              INDArray out = dup ? Nd4j.create(first.dataType(), outShape) : first;
              return Nd4j.exec(new org.nd4j.linalg.api.ops.impl.transforms.custom.Max(first, second, out))[0];
          }
          
          
/**
           * Element wise maximum function between 2 INDArrays
           *
           * @param first

github.com

eclipse/deeplearning4j/blob/88d3c4867fb87ec760b445c6b9459ecf353cec47/nd4j/nd4j-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/linalg/ops/transforms/Transforms.java#L920


      
          
          
/**
           * Element wise minimum function between 2 INDArrays
           *
           * @param first
           * @param second
           * @param dup
           * @return
           */
          public static INDArray min(INDArray first, INDArray second, boolean dup) {
              long[] outShape = broadcastResultShape(first, second);   //Also validates
              Preconditions.checkState(dup || Arrays.equals(outShape, first.shape()), "Cannot do inplace min operation when first input is not equal to result shape (%ndShape vs. result %s)",
                      first, outShape);
              INDArray out = dup ? Nd4j.create(first.dataType(), outShape) : first;
              return Nd4j.exec(new org.nd4j.linalg.api.ops.impl.transforms.custom.Min(first, second, out))[0];
          }
          
          
/**
           * Element wise minimum function between 2 INDArrays
           *
           * @param first

Topic		Replies	Views
Basic deeplearning4j classification example DL4J	4	996	February 3, 2020
Working with arrays of data DL4J	1	32	December 12, 2024
Want to use gather - how? ND4J	3	789	June 3, 2020
Fit data into ND4J to get prediction of the model ND4J	8	315	November 16, 2022
Recommended way to create INDArray for prediction? DL4J	5	987	May 29, 2020

How to use NDArray like NumPy?

Related topics