Importing a pytorch model

mdebeer · November 8, 2021, 7:12am

@agibsonccc Hi Adam, how has it gone with adding the onnx ops?

agibsonccc · November 22, 2021, 11:26pm

@mdebeer was checking through forum posts, realized I typed a response and never submitted it. Sorry about that.
I didn’t get the detectron2 op done yet but wanted to take a look at that soon. A few more PRs have been merged that improve coverage quite a bit. I still want to get to the detectron 2 op but I wanted to make sure that we covered standard ops first. We’re getting there though!

mdebeer · January 25, 2022, 8:35am

Hey Adam, it’s been a while! Wishing you a prosperous new year.

We’ve left importing detectron2 to dl4j on the back-burner, but it’s growing into an important use case for migrating our python code base to Scala.

Could you please update me on the latest progress? Will it be possible for us to start testing the onnx model import again?

treo · January 25, 2022, 8:45am

There have been quite a lot of improvements on the import side of things, you can probably try the current snapshots and see if it maybe just runs out of the box

agibsonccc · January 25, 2022, 8:48am

@treo @mdebeer I never got around to this specific op. We did however add new model import infra including a new model zoo with pre converted models. It is much easier to add new ops now. I can give that a shot and see. Thanks for checking in.

saudet · January 25, 2022, 10:19am

BTW, we can use the full C++ API of PyTorch from Java quite easily, you might want to try that instead:

mdebeer · January 25, 2022, 1:09pm

@agibsonccc @treo @saudet Thank you for the prompt feedback!
Glad to hear there’s new improvements + a model zoo to check out. I’ll dive into it throughout the week and will let you know of any findings or questions. Cheers!

agibsonccc · January 25, 2022, 1:14pm

@mdebeer I took another look at this since release QA is about wrapped up. I implemented 2 of the necessary ops. There’s only 1 more to do which I can wrap up tomorrow.

I just had to introduce a few new op overrides and re express the generate bounding box proposals op used by pytorch in samediff. Like I thought all the stuff was already there to do it. I’ll put up a PR this week and let you know.

In the mean time take a look at:

This contains all the models already pre converted. I’ll do the same for the detectron model. Then you can just use the new improvements for finetuing your model. You’ll find gpt2 among other graphs already in there.

mdebeer · January 25, 2022, 1:15pm

That’s brilliant, thank you!

mdebeer · January 28, 2022, 1:51pm

By the way, I tried to pull the latest snapshot again (I presume it’s 1.0.0-SNAPSHOT), and I’m again getting the XML parsing error for the "org.nd4j" % "samediff-import-onnx" package (as described earlier, and fixed by @treo in this post: Importing a pytorch model - #34 by treo).

Again, it’s only for this package, and only for the snapshot. Could you please verify if the previous XML spacing error has crept in again? SBT / coursier presumably has stricter parsing rules than other tools.

agibsonccc · January 28, 2022, 1:59pm

@mdebeer I’ll flag you when a PR is merged that solves this. Otherwise for now don’t worry about it. I found out there were a few more ops that needed to be implemented there. That graph is huge and I missed some ops.

I’m currently in the middle of doing M2. After M2 I’ll just convert the model and publish it on our zoo. The PR I’d say is about 60% done for that.

Unfortunately during release time that also means dealing with long compile times for cuda (usually cpu/windows) which takes 5-6 hours and can sometimes fail for odd reasons (eg: network failures)

Edit: Here’s the branch in case you’re curious. GitHub - eclipse/deeplearning4j at ag_detectron_2

One thing I need to finish implementing was a new masking array function. That will allow me to implement 1 more op. It turns out detectron2 is full of proprietary pytorch ops not just one.

Thanks for flagging though!

mdebeer · February 18, 2022, 8:10am

Hey Adam - how is the progress to M2, and the detectron2 imports?

mdebeer · February 28, 2022, 1:23pm

Now that M2 is released, I’ve tested the ONNX import of a Detectron 2 model as follows (using the following as a guide):

import org.junit.Assert.assertNotNull
import org.nd4j.autodiff.samediff.SameDiff
import org.nd4j.samediff.frameworkimport.onnx.importer.OnnxFrameworkImporter

import java.io.File
import java.util.Collections

object ImportOnnxModel extends App {
  //create the framework importer
  val onnxImport = new OnnxFrameworkImporter()

  val onnxFile: File  = new File("model.onnx")
  val graph: SameDiff = onnxImport.runImport(onnxFile.getAbsolutePath, Collections.emptyMap(), true)

  assertNotNull(graph)
}

As found previously, the above code errors because the AliasWithName op is not defined. Hopefully it’s a trivial operation (seems like a node rename), so is it something that I can work on in a merge request? Please just direct me to relevant implementation.

Here’s a Netron view of the ONNX model beginning:

agibsonccc · February 28, 2022, 1:29pm

@mdebeer sorry I guess I wasn’t clear. Detectron2 didn’t make it in to M2. I still have to finish up the PR for it. Other things took priority but I’m working on the next wave of onnx import now that that’s all done.

mdebeer · February 28, 2022, 1:58pm

Okay – please keep me posted if a snapshot becomes available for testing. Best wishes with the work.

Topic		Replies	Views
What does it take to import CLIP into Java? SameDiff	5	441	June 15, 2023
Import Pytorch model in ONNX format SameDiff	6	298	September 21, 2023
ONNX import fails SameDiff	9	447	October 13, 2021
Importing GRU onnx model failed DL4J	7	573	February 20, 2023
1.0.0-M1 docs up	4	500	July 5, 2021

Importing a pytorch model

Related topics