Compiling deeplearning4j with -Pcuda does not produce cuda artifacts

chris2 · July 7, 2022, 2:55pm

Hello,

I am working on compiling deeplearning4j for my specific architecture. In the top level directory I am running the command:

mvn clean install -Pcuda -DskipTests -PskipTestResourceEnforcement=true -Djavacpp.platform=linux-x86_64

But the following dependencies that I thought should exist are not produced:

org.nd4j:nd4j-cuda-11.2-platform:jar:chriscustom1.0
org.deeplearning4j:deeplearning4j-cuda-11.2:jar:chriscustom1.0

Other artifacts are produced, but not this one. I am branched off the 1.0.0-M1.1 tag.

What am I doing wrong?

I can successfully produce a -Pcpu build, FYI, and run on the CPU with it.

agibsonccc · July 8, 2022, 2:27am

@chris2 you can just download cuda 11.2 support from maven central now. This was fixed last week. Note that going forward we’ll mainly support cuda 11.6 and 11.7 though.

chris2 · July 8, 2022, 1:05pm

Amazing! Can I get cuda 11.6 support from maven central or your baseline or something? I don’t see it yet: Maven Central Repository Search I’m actually only compiling because I must use 11.6. I was trying wit h 11.2 just to learn how to build it from a simpler test case. @agibsonccc

chris2 · July 8, 2022, 5:07pm

Also, part of why I have had to recompile is that the linux_x86 *.so files in the nd4j native packages didn’t work on my older operating system/architecture. They worked after I recompiled them. I have run into the same problem with your new cuda artifacts. Do you have some instructions floating around for compiling the -Pcuda project?

chris2 · July 8, 2022, 9:26pm

Hello! I have discovered a couple of critical facts that I think have taken me 90% of the way to a solution here. For the record:

I discovered the shell script change-cuda-versions.sh and have updated it for the proper versions of cuda/cudann/etc
You gave me a clue when you said you were all changing the build to build the nd4j backends from the root. When I moved to the nd4j-backends directory and started building there, it started creating the artifacts.

It will be some time before the build is complete, but I think this is going to work.

agibsonccc · July 11, 2022, 8:20am

@chris2 yes that’s all you need to do is change the base versions. Basically cuda version support just consists of changing the versions in the pom and ensuring that libnd4j c++ compilation matches that. Cuda 11.6 is on the way with the next release.

Topic		Replies	Views
Running ND4J against CUDA 11.6 ND4J	8	473	August 23, 2022
Nd4j-cuda-11.2-platform not available for m2 ND4J	9	503	July 4, 2022
Cuda error during run the example DL4J	46	3074	June 2, 2021
Mvn commamd built nd4j-native but not nd4j-cuda, what is the cause? ND4J	5	467	February 27, 2021
Using JavaCPP Presets for CUDA with M2.1 for cuda 11.6 DL4J	19	496	December 18, 2023

Compiling deeplearning4j with -Pcuda does not produce cuda artifacts

Related topics