Finetune onnx model

Booker · August 2, 2023, 5:05pm

Can I retrain some layers in onnx model by SameDiff? Is there an example?

agibsonccc · August 3, 2023, 11:54am

@Booker yes we do. You would need to add an updater and loss function. Most onnx models will only have the feedforward. You can load it as is and just add these things when you’re done. Eg: loss + updater could be:

SDVariable loss = sd.loss.logLoss("loss", label, out);

                //Also set the training configuration:
                sd.setTrainingConfig(TrainingConfig.builder()
                        .updater(new Adam(0.01))
                        .weightDecay(1e-3, true)
                        .dataSetFeatureMapping("in")            //features[0] -> "in" placeholder
                        .dataSetLabelMapping("label")           //labels[0]   -> "label" placeholder
                        .build());

sd would be a samediff instance you imported using the OnnxFrameworkImporter.

Booker · August 4, 2023, 5:36am

Thanks. I should retrain entire model or one layer to get better result for my little data? how to frozen entire model and set specific layer to train?

agibsonccc · August 4, 2023, 8:31am

@Booker there aren’t really “layers” in samediff. It’s just ops. There are just variables. In terms of specific variables, you can just do:

SameDIff sd = …;
sd = sd.freeze(true);

That will freeze all variables.

For leaving certain variables frozen, just set those to variables. You can see the code for that here:

github.com

deeplearning4j/deeplearning4j/blob/5edff97bd65ca836b3f725d14d2fd49a0c975d87/nd4j/nd4j-backends/nd4j-api-parent/nd4j-api/src/main/java/org/nd4j/autodiff/samediff/SameDiff.java#L6725


      
           * Returns either a copy or this instance of the model with frozen variables.
           * A frozen model is not trainable with variables converted to constants.
           * @return
           */
          public SameDiff freeze(boolean inPlace) {
              SameDiff clone = inPlace ? this : dup();
              for(Map.Entry<String,Variable> varEntry : clone.variables.entrySet()) {
                  Variable varMetaData = varEntry.getValue();
                  SDVariable currVar = varMetaData.getVariable();
                  switch(currVar.getVariableType()) {
                      case VARIABLE:
                          currVar.setVariableType(VariableType.CONSTANT);
                          break;
                      case CONSTANT:
                      case ARRAY:
                      case PLACEHOLDER:
                          break;
                  }
              }

By default everything should be constants since onnx imports as feedforward only.

Booker · August 4, 2023, 8:51am

Powerfull! The next release is waiting for a year, when will release it?

agibsonccc · August 4, 2023, 9:31am

I had paid work that held that up for quite a while. Now that that’s done I’m wrapping up the cuda testing now. I’ve been cleaning up technical debt along the way. Don’t worry that’s the main item I’m working on atm.

Booker · August 6, 2023, 1:53am

Unable to resolve attribute for name auto_pad for node Conv for op type Conv
Unable to resolve attribute for name dilations for node MaxPool for op type MaxPool
Skipping input B on node /model.22/dfl/conv/Conv

These console logs exist when I import yolov8x.onnx, is it normal and can be ignore? Thanks.

agibsonccc · August 6, 2023, 3:58am

@Booker can you file an issue and link to the model so I can reproduce this and verify this will work in the current upcoming release?Thanks!

Booker · August 6, 2023, 7:33am

done.

Topic		Replies	Views
RNN with simple dense function SameDiff	15	397	January 23, 2023
Training imported model SameDiff	0	245	March 7, 2022
Using gradient as an intermediate SDVariable SameDiff	11	392	June 14, 2022
Params change in layers despite FrozenLayerWithBackprop DL4J	3	176	December 14, 2022
Variable Length Output Layer for FeedForward Network DL4J	2	467	March 5, 2020

Finetune onnx model

Related topics