Why only 1 cpu on multi socket systems

diane · January 6, 2020, 8:19am

Why do we prefer to use only 1 cpu on multi socket systems?

AlexBlack · January 20, 2020, 1:54am

This is due to an architecture issue associated with multi-socket systems called NUMA - non-uniform memory access.
This means that in some system (with multiple physical CPUs/sockets for example) how long it takes to access memory depends on which CPU is accessing it.

ND4J (and hence DL4J/SameDiff) is not at present NUMA aware (it’s on our roadmap) so performance may not scale well on multi-socket systems, due to lots of potentially costly data transfers between CPUs.
No such issues are present on standard multi-core systems (i.e., almost all consumer/workstation setups and a sizable fraction of servers aren’t NUMA so don’t have this issue).

Topic		Replies	Views
Single threaded Nd4j operations ND4J	8	636	May 25, 2021
Only two threads are running when switching training to GPU DL4J	2	419	May 29, 2020
Linux performance issues in highly threaded environment DL4J	1	337	July 29, 2021
Using multiple backends DL4J	1	307	September 20, 2021
Optimization question DL4J	20	909	May 30, 2021

Why only 1 cpu on multi socket systems

Related topics