Loading model takes over two hours?!

treo · June 18, 2020, 8:13pm

There are a few things wrong here:

Why are you using reflection?
Your dependencies look weird, why do you have an explicit dependency on nd4j-buffer? Why are you mixing cuda versions?
Where are you reading the data from? What kind of storage is it?
Even though your pom.xml says beta6, are you entirely sure you aren’t somehow on beta7 (tried to downgrade after seeing something like this: Beta 7 - Glove Word Vector - #4 by treo, but Eclipse didn’t properly pick up on that)

Typically loading a 1GB binary takes about as long as it takes to read the file - 2 hours obviously is way too long.

If you can, running your application with a profiler should also shed some light into why it takes so long to load it.

Topic		Replies	Views
WordVectorSerializer.readWord2VecModel throws an exception: "Unable to guess input file format" DL4J	0	370	November 20, 2020
Read model as InputStream DL4J	17	778	May 12, 2020
Use dl4j is slow to load pre-trained model is slow SameDiff	2	220	September 15, 2023
Beta 7 - Glove Word Vector DL4J	3	1010	May 19, 2020
Error when loading 'wiki.en.bin' pretrained FastText model DL4J	4	485	March 19, 2021