[onert] Introduce fine tuning feature for training #12951

mbencer · 2024-04-30T14:18:30Z

This PR adds fine tuning feature to onert. The new methods to nnfw API were added. onert_train tool was also extended to support this feature via --frozen_ops_idx argument.

TEST ENVIRONMENT:

test models
training data
C++ tool to compare weights before and after training
Python script which run training for different epoch/batch size/frozen ops config to measure time and accuracy

The test methodology was described here and the results are collected in tests report

Issue: #12386

mbencer · 2024-05-07T15:52:40Z

runtime/onert/api/nnfw/src/nnfw_api_internal.cc

@@ -1695,7 +1742,7 @@ NNFW_STATUS nnfw_session::train_export_circle(const char *path)

      auto subg = subgs->Get(0); // Get 1st subgraph
      if (!idx.valid() || idx.value() >= subg->tensors()->size())
-        throw std::runtime_error("Trainable tensor index is out of range");
+        return;


It's just a workaround for a problem reproduced even on master to enable exporting trained circle to a file (used in my test scripts to compare weights before and after training).
My assumption is that the trained graph can have more tensors because of the tensors consumed by a Loos function, but not sure how it worked before.

BTW:

auto model = ::circle::GetModel(mmapfile.buf()); if (!model) throw std::runtime_error("Failed to get model from circle"); auto subgs = model->subgraphs(); if (!subgs || subgs->size() != 1) throw std::runtime_error("Circle does not has valid subgraph or has multiple subgraphs"); auto subg = subgs->Get(0); // Get 1st subgraph

can be move outside iterateTrainableTensors but I'll try to deal with it in a separate PR.

jyoungyun · 2024-05-08T07:54:40Z

runtime/onert/api/nnfw/src/nnfw_api_internal.cc

+  }
+
+  const auto ir_op_index = onert::ir::OperationIndex{op_index};
+  auto &options = _coptions[0];


How about using TrainingInfo instead of _coptions?

@jyoungyun I like this idea, introduced ;)

This commits extend TrainableOperation API with methods to disable/enable weights update of some particular trainable operations. In trainable operations is stored also status about contribution in backward propagation phase. ONE-DCO-1.0-Signed-off-by: Mateusz Bencer <m.bencer@partner.samsung.com>

Co-authored-by: Jiyoung Giuliana Yun <wldudyun10@gmail.com>

ONE-DCO-1.0-Signed-off-by: Mateusz Bencer <m.bencer@partner.samsung.com>

mbencer mentioned this pull request Apr 30, 2024

[onert][training] Changing classification layer of pre-trained model and fine-tuning #12386

Open

7 tasks

mbencer changed the title ~~[onert] Introduce fine tuning feature for traning~~ [onert] Introduce fine tuning feature for training May 6, 2024

mbencer commented May 7, 2024

View reviewed changes

jyoungyun reviewed May 8, 2024

View reviewed changes

mbencer requested a review from jyoungyun May 8, 2024 11:10

mbencer mentioned this pull request May 16, 2024

[onert] Introduce API to control trainability of operations #13007

Merged

mbencer and others added 10 commits May 16, 2024 11:08

Update runtime/onert/core/include/ir/train/TrainableOperation.h

358e192

Co-authored-by: Jiyoung Giuliana Yun <wldudyun10@gmail.com>

changed default state of required_for_backward to false

443e93f

ONE-DCO-1.0-Signed-off-by: Mateusz Bencer <m.bencer@partner.samsung.com>

review remarks

2c60ab0

changed isTrainable to isWeightsUpdateEnabled

007720a

Merge branch 'mbencer/ExtendTrainableOpApi'

9c5b494

the reset

4cc4b77

handle AddBackPropInitializers and appendBackPropAccumulator

bdd371f

move markOpsRequiredForBackward to other place

17b0a1b

Merge remote-tracking branch 'upstream/master' into mbencer/FineTuning

2c68b22

fix problem with inserted layers

6ac060a

mbencer force-pushed the mbencer/FineTuning branch from a0fcf8d to 6ac060a Compare May 21, 2024 13:46

Merge remote-tracking branch 'upstream/master' into mbencer/FineTuning

98e2b1f

mbencer mentioned this pull request May 21, 2024

[onert] Separate forward and backward configuration of trainable ops #13033

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onert] Introduce fine tuning feature for training #12951

[onert] Introduce fine tuning feature for training #12951

mbencer commented Apr 30, 2024 •

edited

mbencer May 7, 2024

jyoungyun May 8, 2024 •

edited

mbencer May 8, 2024

[onert] Introduce fine tuning feature for training #12951

Are you sure you want to change the base?

[onert] Introduce fine tuning feature for training #12951

Conversation

mbencer commented Apr 30, 2024 • edited

mbencer May 7, 2024

Choose a reason for hiding this comment

jyoungyun May 8, 2024 • edited

Choose a reason for hiding this comment

mbencer May 8, 2024

Choose a reason for hiding this comment

mbencer commented Apr 30, 2024 •

edited

jyoungyun May 8, 2024 •

edited