There are a few things wrong here:
- Why are you using reflection?
- Your dependencies look weird, why do you have an explicit dependency on nd4j-buffer? Why are you mixing cuda versions?
- Where are you reading the data from? What kind of storage is it?
- Even though your pom.xml says beta6, are you entirely sure you aren’t somehow on beta7 (tried to downgrade after seeing something like this: Beta 7 - Glove Word Vector - #4 by treo, but Eclipse didn’t properly pick up on that)
Typically loading a 1GB binary takes about as long as it takes to read the file - 2 hours obviously is way too long.
If you can, running your application with a profiler should also shed some light into why it takes so long to load it.