@adonnini so let’s agree you need labels.
Pardon me for assuming, but based on your description you don’t know what those are and need help with that. Let’s reword your problem to that.
I’ve said numerous times to pick a column to use as a label and I haven’t seen you say “this one?”.
Regarding this:
FYI, I removed the dependent variable (label) from the input dataset,
and I corrected the 3d error
Put the dependent variable back. You need that as your label.
Regarding:
next location problem. It is a tool to help get there. at least, that’s
how I currently use it in my application. Please also note that a
clustering method like DBSCAN is probably better than K-Means, and
clustering should also have a temporal component.
Kmeans only requires you to define the number of clusters you want. DBScan is similar. Those are both unsupervised methods only working off the data.
Neural networks (and any machine learning algorithm that does regression/classification) aka supervised learning needs those dependent variables.
A neural network will learn the pattern based on the variables that you tell it to. It’ll say “map these input columns to correlate with this output column”.
Based on the paper:
Table 3. Sample Input
Input
[0.5, 0.8660254, 0.43388374, -0.90096887],
[0.5, 0.8660254, 0.43388374, -0.90096887,]
Output
[39., -77.00935364]
In this case you’re forecasting coordinates. Therefore you’ll want the final destination as the output variable for regression. Those are the variables you want to predict.
For the given application any row of your CSV would thus contain:
[0.5, 0.8660254, 0.43388374, -0.90096887, 39.0, -77.00523376],
[0.5, 0.8660254, 0.43388374, -0.90096887, 39.0, -77.00798035]
From the looks of it these would be 2 timesteps.
Pardon me but I’ve only skimmed the paper looking for a proper problem definition. This gave me what I was looking for.
Note that for the loss function they used. RMSE. IN dl4j that will be RMSE_XENT.
Sorry I know this was frustrating but literally all I was looking for was:
I’m using LSTMs to predict an x and y coordinate/ or lat/long and a paper I’m referencing appears to use a timestep of 2. How do I set that up?
In your case, clustering actually should work for that at least somewhat well since the problem is actually very much a nearest neighbors problem. I could see where you’d want to use LSTMs for that though. Just note that an LSTM in order to learn the pattern will need some sort of labels.