GPU Issue, memory not freed

aschwanden · January 13, 2020, 11:54am

Hi everyone,

GPU issue:
After training a model with DL4j and using GPU memory to do so, the allocated GPU memory is not freed when the training is done. (The Java app which is doing the training stays alive after training. Only after this application is terminated, the GPU memory is freed). Any ideas why this could be the case? Do we need to do something manually here in order to free this memory? (ps: we are not using UIServer)

Thanks in advance & cheers
Reto

AlexBlack · January 20, 2020, 2:10am

Hi Reto
There’s a few different types of GPU memory that are released at different times.

a) Native libraries - code for various operations, including all the nd4j/libnd4j ops, and libraries like cuBLAS and cuDNN - these are only released when the process is terminated

b) Network parameters - memory here is only released once the network is garbage collected - i.e., there are no references to the network remaining in your code, and the java garbage collector runs.
myNetwork.params().close() is also an option to do it manually.

c) Workspaces memory (for activations, gradients, etc) - released when a thread is GC’d. For the main thread, you can do Nd4j.getWorkspaceManager().destroyAllWorkspacesForCurrentThread();

aschwanden · January 20, 2020, 5:37am

Hi Alex, thanks for the lengthy response! Cheers Reto

arnaud22 · May 7, 2020, 12:13pm

Hi @AlexBlack,

Does this fonctionment works for CPU ?
Backend used: [CPU]

Cheers.

Topic		Replies	Views
Destroy Neural Network completely (Memory Leak) DL4J	2	425	November 27, 2020
Should I call close()? ND4J	4	201	October 23, 2023
Running out of GPU Memory Despite Setting Parameters DL4J	8	618	July 29, 2021
How do I fix this possible memory leak? ND4J	3	747	April 20, 2020
Debugging Memory Issues in Java Application DL4J	7	781	March 5, 2020

GPU Issue, memory not freed

Related topics