Parallel training on multicore single CPU

jin · April 12, 2020, 4:46am

I would like to understand how the parallel training on multicore single CPU work. I plan to write my own EvaluativeListener so that I can control when to stop the training. When I look at the original implementation here, I noticed that iteration counter uses ThreadLocal. My question is what kind of parallel training approach is used by default. I am working in multicore single CPU environment. Does each thread uses portion of the training data to run the optimizer and update the weights synchronously? Or the parallel training happens in some other way? Thanks for info.

treo · April 13, 2020, 8:39am

That is what you would typically use the early stopping functionality for.

That is because the listener can be used in a multi thread environment, e.g. when multiple gpus or cpus are used. When training in a single cpu multi core environment, usually only the actual math is parallelized, as that can still be done efficiently enough.

jin · April 13, 2020, 5:06pm

Makes sense. Thanks.

Topic		Replies	Views
Only two threads are running when switching training to GPU DL4J	2	419	May 29, 2020
How to customize a dataset iterator that supports multiple GPUs? DL4J	8	95	July 18, 2024
A good method of preventing overfitting using MutiGPUWrapper DL4J	8	38	August 22, 2024
An exception is thrown when releasing CUDA resources in a multi-GPU environment DL4J	8	33	September 9, 2024
GPU Error between epochs ND4J	3	502	October 15, 2020

Parallel training on multicore single CPU

Related topics