Dl4j cuda 11.2 running out of memory on evaluation on ubuntu 20.04

Hi guys,

To be a little more clear on the steps that we did, here are the details below:

We are using these memory settings:

When we enable the automatic gc, we don’t get error in the first epoch evaluation

And if we use this custom class, the train do 8 epochs without problems

But it is failing here

For now we are catching this error and save the model, but we are testing the same code on Windows with cuda 10.2 and video card GTX 1080 ti and we dont have this problem. Also you suggested:

You can also try setting the gc period to longer.

are you able to explain more on how to achieve this?
Thank you for your continued and timely assistance