Libcublas.so.10: cannot open shared object file: No such file or directory

Hi, everybody, I have a problem when I execute this command: java -jar xxxxx.jar

The problem is:

Warning: Versions of org.bytedeco:javacpp:1.5.3 and org.bytedeco:cuda:10.1-7.6-1.5.2 do not match.
2020-10-14 15:27:03,165 - [WARN] o.n.l.f.Nd4jBackend - Skipped [JCublasBackend] backend (unavailable): java.lang.UnsatisfiedLinkError: /data/yuhao/.javacpp/cache/cuda-10.1-7.6-1.5.2-linux-x86_64.jar/org/bytedeco/cuda/linux-x86_64/libjnicublas.so: libcublas.so.10: cannot open shared object file: No such file or directory
Exception in thread "main" java.lang.ExceptionInInitializerError
	at org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.readWord2VecModel(WordVectorSerializer.java:2392)
	at org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.readWord2VecModel(WordVectorSerializer.java:2372)
	at org.deeplearning4j.models.embeddings.loader.WordVectorSerializer.readWord2VecModel(WordVectorSerializer.java:2341)
	at com.sohu.modelupdate.CalculateCurrentModelPerformance.main(CalculateCurrentModelPerformance.java:43)
Caused by: java.lang.RuntimeException: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: Please ensure that you have an nd4j backend on your classpath. Please see: https://deeplearning4j.konduit.ai/nd4j/backend
	at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5094)
	at org.nd4j.linalg.factory.Nd4j.<clinit>(Nd4j.java:270)
	... 4 more
Caused by: org.nd4j.linalg.factory.Nd4jBackend$NoAvailableBackendException: Please ensure that you have an nd4j backend on your classpath. Please see: https://deeplearning4j.konduit.ai/nd4j/backend
	at org.nd4j.linalg.factory.Nd4jBackend.load(Nd4jBackend.java:221)
	at org.nd4j.linalg.factory.Nd4j.initContext(Nd4j.java:5091)
	... 5 more

My pom.xml configuration is:

...
<properties>
  <nd4j.version>1.0.0-beta7</nd4j.version>
  <dl4j.version>1.0.0-beta7</dl4j.version>
  <nd4j.backend>nd4j-cuda-10.1-platform</nd4j.backend>
</properties>
...
<dependencies>
  <dependency>
    <groupId>org.nd4j</groupId>
    <artifactId>${nd4j.backend}</artifactId>
    <version>${nd4j.version}</version>
  </dependency>
</dependencies>

and my machine’s GPU information is:

$ nvidia-smi
Wed Oct 14 15:38:21 2020
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.39       Driver Version: 418.39       CUDA Version: 10.1     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla P40           Off  | 00000000:02:00.0 Off |                    0 |
| N/A   20C    P0    48W / 250W |      0MiB / 22919MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   1  Tesla P40           Off  | 00000000:04:00.0 Off |                    0 |
| N/A   20C    P0    49W / 250W |      0MiB / 22919MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   2  Tesla P40           Off  | 00000000:83:00.0 Off |                    0 |
| N/A   21C    P0    48W / 250W |      0MiB / 22919MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   3  Tesla P40           Off  | 00000000:84:00.0 Off |                    0 |
| N/A   20C    P0    48W / 250W |      0MiB / 22919MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

can anyone explain how to solve this problem? thanks!

@saudet ^ any ideas? It looks like a pretty normal configuration for cuda.

Well, the error message is pretty clear. If libcublas.so.10 isn’t anywhere on your machine, reinstall CUDA.

Thanks @saudet so just a cuda install issue. I guess the redist artifacts are also valid here?

1 Like

reference to this page: https://deeplearning4j.konduit.ai/config/backends/config-cudnn, it works!