[DRAFT] Support add training #12417

Aeren1564 · 2024-01-08T07:16:38Z

Draft PR for supporting Add training

runtime/onert/backend/train/ops/BinaryArithmeticLayer.cc

compute/cker/include/cker/train/operation/BinaryArithmetic.h

runtime/onert/backend/train/ops/BinaryArithmeticLayer.cc

jyoungyun · 2024-01-24T01:30:50Z

compute/cker/include/cker/train/operation/BinaryArithmetic.h

+    case ArithmeticType::kSub:
+    case ArithmeticType::kMul:
+    case ArithmeticType::kDiv:


Are other operations always required to broadcast?

I was planning to work with broadcasted data (which, to my understanding, is simple copying) for other OPs.
Is there a better alternative?

Well, if there is a way to calculate both broadcast and gradient at the same time, it would be great. But it is not related to this PR, so let's think about it later. :)

Aeren1564 · 2024-01-26T03:47:50Z

Model with Subtract

    input_lhs = tf.keras.layers.Input(shape=(10))
    input_rhs = tf.keras.layers.Input(shape=(10))
    lhs = tf.keras.layers.Dense(10)(input_lhs)
    rhs = tf.keras.layers.Dense(10)(input_rhs)
    res_sub = tf.keras.layers.Subtract()([lhs, rhs])
    output = tf.keras.layers.Dense(10)(res_sub)
    model = tf.keras.models.Model(inputs=[input_lhs, input_rhs], outputs=output, name="subtract_training")

Data

    np.random.seed(123)
    data_lhs = np.random.rand(3000, 10).astype(np.float32) * 100
    data_rhs = np.random.rand(3000, 10).astype(np.float32) * 100
    coef_lhs, coef_rhs = np.random.rand(10, 10).astype(np.float32), np.random.rand(10, 10).astype(np.float32)
    data_res = np.array([(np.matmul(coef_lhs, x[0]) + np.matmul(coef_rhs, x[1])) for x in zip(data_lhs, data_rhs)], dtype=np.float32)

Tensorflow

__________________________________________________________________________________________________
Epoch 1/5
150/150 [==============================] - 0s 578us/step - loss: 647.6946 - mae: 20.2884
Epoch 2/5
150/150 [==============================] - 0s 585us/step - loss: 484.7763 - mae: 17.5722
Epoch 3/5
150/150 [==============================] - 0s 581us/step - loss: 382.2013 - mae: 15.5862
Epoch 4/5
150/150 [==============================] - 0s 592us/step - loss: 305.6406 - mae: 13.9046
Epoch 5/5
150/150 [==============================] - 0s 629us/step - loss: 241.3343 - mae: 12.2882

ONERT-Train

/home/aeren/Repos/ONE/Product/x86_64-linux.debug/out/bin/onert_train --modelfile /home/aeren/Repos/Scripts/_Product/circle+/result/model20240126_1244/model.circle --load_input:raw /home/aeren/Repos/Scripts/_Product/circle+/data/input.bin --load_expected:raw /home/aeren/Repos/Scripts/_Product/circle+/data/res.bin --epoch 5 --batch_size 20 --learning_rate 0.001 --loss 1 --loss_reduction_type 1 --optimizer 2 
Model Expected Filename /home/aeren/Repos/Scripts/_Product/circle+/data/res.bin
Model Input Filename /home/aeren/Repos/Scripts/_Product/circle+/data/input.bin
Model Filename /home/aeren/Repos/Scripts/_Product/circle+/result/model20240126_1244/model.circle
== training parameter ==
- learning_rate   = 0.001
- batch_size      = 20
- loss_info       = {loss = mean squared error, reduction = sum over batch size}
- optimizer       = adam
========================
Epoch 1/5 - time: 0.327ms/step - loss: [0] 647.6941
Epoch 2/5 - time: 0.318ms/step - loss: [0] 484.7753
Epoch 3/5 - time: 0.302ms/step - loss: [0] 382.2000
Epoch 4/5 - time: 0.317ms/step - loss: [0] 305.6393
Epoch 5/5 - time: 0.303ms/step - loss: [0] 241.3329

Aeren1564 · 2024-01-26T04:31:20Z

Model with Multiply

    input_lhs = tf.keras.layers.Input(shape=(10))
    input_rhs = tf.keras.layers.Input(shape=(10))
    lhs = tf.keras.layers.Dense(10)(input_lhs)
    rhs = tf.keras.layers.Dense(10)(input_rhs)
    res_mul = tf.keras.layers.Multiply()([lhs, rhs])
    output = tf.keras.layers.Dense(10)(res_mul)
    model = tf.keras.models.Model(inputs=[input_lhs, input_rhs], outputs=output, name="multiply_training")

Data

    np.random.seed(123)
    data_lhs = np.random.rand(3000, 10).astype(np.float32) * 100
    data_rhs = np.random.rand(3000, 10).astype(np.float32) * 100
    coef_lhs, coef_rhs = np.random.rand(10, 10).astype(np.float32), np.random.rand(10, 10).astype(np.float32)
    data_res = np.array([(np.matmul(coef_lhs, x[0]) + np.matmul(coef_rhs, x[1])) for x in zip(data_lhs, data_rhs)], dtype=np.float32)

Tensorflow

__________________________________________________________________________________________________
Epoch 1/5
150/150 [==============================] - 0s 688us/step - loss: 6594.2104 - mae: 64.8041
Epoch 2/5
150/150 [==============================] - 0s 676us/step - loss: 5605.8013 - mae: 59.9134
Epoch 3/5
150/150 [==============================] - 0s 573us/step - loss: 5306.5811 - mae: 58.3457
Epoch 4/5
150/150 [==============================] - 0s 548us/step - loss: 5146.8296 - mae: 57.5273
Epoch 5/5
150/150 [==============================] - 0s 547us/step - loss: 5031.8623 - mae: 56.9231

ONERT-Train

/home/aeren/Repos/ONE/Product/x86_64-linux.debug/out/bin/onert_train --modelfile /home/aeren/Repos/Scripts/_Product/circle+/result/model20240126_1329/model.circle --load_input:raw /home/aeren/Repos/Scripts/_Product/circle+/data/input.bin --load_expected:raw /home/aeren/Repos/Scripts/_Product/circle+/data/res.bin --epoch 5 --batch_size 20 --learning_rate 0.001 --loss 1 --loss_reduction_type 1 --optimizer 2 
Model Expected Filename /home/aeren/Repos/Scripts/_Product/circle+/data/res.bin
Model Input Filename /home/aeren/Repos/Scripts/_Product/circle+/data/input.bin
Model Filename /home/aeren/Repos/Scripts/_Product/circle+/result/model20240126_1329/model.circle
== training parameter ==
- learning_rate   = 0.001
- batch_size      = 20
- loss_info       = {loss = mean squared error, reduction = sum over batch size}
- optimizer       = adam
========================
Epoch 1/5 - time: 0.300ms/step - loss: [0] 6594.2061
Epoch 2/5 - time: 0.288ms/step - loss: [0] 5605.7979
Epoch 3/5 - time: 0.289ms/step - loss: [0] 5306.5791
Epoch 4/5 - time: 0.291ms/step - loss: [0] 5146.8281
Epoch 5/5 - time: 0.291ms/step - loss: [0] 5031.8633

Aeren1564 · 2024-01-26T05:07:07Z

compute/cker/include/cker/train/operation/BinaryArithmetic.h

+      lhs_grad_map = in_map.array() / rhs_map.array();
+      rhs_grad_map = in_map.array() * -lhs_map.array() / rhs_map.array() / rhs_map.array();


I'm seeing weird outputs :/
Is there something wrong with the following?

$L$: LHS
$R$: RHS
$O$: Output (of elementwise division)
$X$: Output (of entire model)

$L / R = O$

$\frac{\partial X}{\partial L} = \frac{\partial O}{\partial L} \cdot \frac{\partial X}{\partial O} = \frac{1}{R} \cdot \frac{\partial X}{\partial O}$
$\frac{\partial X}{\partial R} = \frac{\partial O}{\partial R} \cdot \frac{\partial X}{\partial O} = -\frac{L}{R^2} \cdot \frac{\partial X}{\partial O}$

Aeren1564 · 2024-01-26T05:14:35Z

@nnfw-bot test tizen-gbs

Aeren1564 · 2024-01-30T06:31:08Z

@nnfw-bot test onert-cross-debug

Aeren1564 · 2024-01-30T06:31:14Z

@nnfw-bot test onert-cross-release

Aeren1564 · 2024-01-30T06:31:35Z

TODO: try other optimizers for division

. Signed-off-by: YongHyun An <yonghyunz.an@samsung.com>

Aeren1564 · 2024-02-05T09:34:40Z

I've tried following optimizers for division but all of them showed loss values differing from that of tensorflow :/

SGD, RMSProp, Adam, Adadelta

Aeren1564 · 2024-02-05T11:23:31Z

@nnfw-bot test onert-cross-debug

Aeren1564 · 2024-02-05T11:23:36Z

@nnfw-bot test onert-cross-release

Aeren1564 mentioned this pull request Jan 8, 2024

[onert] Support Add for training #12415

Closed

3 tasks

Aeren1564 force-pushed the draft_add branch 6 times, most recently from 12ccdba to 14612d8 Compare January 11, 2024 13:22

jyoungyun reviewed Jan 12, 2024

View reviewed changes

runtime/onert/backend/train/ops/BinaryArithmeticLayer.cc Outdated Show resolved Hide resolved

jyoungyun reviewed Jan 12, 2024

View reviewed changes

runtime/onert/backend/train/ops/BinaryArithmeticLayer.cc Outdated Show resolved Hide resolved

Aeren1564 force-pushed the draft_add branch 7 times, most recently from 978e12a to 3dc789c Compare January 17, 2024 05:17

ragmani reviewed Jan 17, 2024

View reviewed changes

compute/cker/include/cker/train/operation/BinaryArithmetic.h Outdated Show resolved Hide resolved

Aeren1564 force-pushed the draft_add branch 6 times, most recently from a14d419 to fd56419 Compare January 18, 2024 05:39

Aeren1564 mentioned this pull request Jan 18, 2024

[onert] Introduce BinaryArithmetic trainable operation #12495

Merged

Aeren1564 force-pushed the draft_add branch from fd56419 to e927d69 Compare January 22, 2024 23:51

zetwhite reviewed Jan 23, 2024

View reviewed changes

runtime/onert/backend/train/ops/BinaryArithmeticLayer.cc Outdated Show resolved Hide resolved

Aeren1564 force-pushed the draft_add branch 2 times, most recently from a8a00ea to 1a16092 Compare January 24, 2024 01:12

jyoungyun reviewed Jan 24, 2024

View reviewed changes

Aeren1564 force-pushed the draft_add branch from 1a16092 to 8a07598 Compare January 24, 2024 01:31

Aeren1564 force-pushed the draft_add branch 4 times, most recently from 8ecfd6b to ca069c0 Compare January 26, 2024 03:40

Aeren1564 force-pushed the draft_add branch from ca069c0 to ccb639d Compare January 26, 2024 04:29

Aeren1564 force-pushed the draft_add branch 4 times, most recently from 3225baf to 4606d2c Compare January 26, 2024 04:46

Aeren1564 commented Jan 26, 2024

View reviewed changes

Aeren1564 force-pushed the draft_add branch 3 times, most recently from 90993b0 to df237cc Compare January 29, 2024 04:16

This was referenced Jan 29, 2024

[onert] Implement BinaryArithmeticLayer in train backend #12542

Merged

[cker] Introduce Subtract gradient kernel #12543

Merged

Aeren1564 force-pushed the draft_add branch 2 times, most recently from 846cbba to 1c0a425 Compare January 29, 2024 11:12

Aeren1564 mentioned this pull request Jan 30, 2024

[cker] Introduce Multiply gradient kernel #12554

Merged

Aeren1564 force-pushed the draft_add branch from 1c0a425 to 1eece11 Compare January 30, 2024 04:49

[DRAFT] Support Add

b6ab1ae

. Signed-off-by: YongHyun An <yonghyunz.an@samsung.com>

Aeren1564 force-pushed the draft_add branch from 1eece11 to b6ab1ae Compare February 5, 2024 09:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] Support add training #12417

[DRAFT] Support add training #12417

Aeren1564 commented Jan 8, 2024

jyoungyun Jan 24, 2024

Aeren1564 Jan 24, 2024

jyoungyun Jan 24, 2024

Aeren1564 commented Jan 26, 2024 •

edited

Aeren1564 commented Jan 26, 2024

Aeren1564 Jan 26, 2024 •

edited

Aeren1564 commented Jan 26, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Feb 5, 2024

Aeren1564 commented Feb 5, 2024

Aeren1564 commented Feb 5, 2024

		lhs_grad_map = in_map.array() / rhs_map.array();
		rhs_grad_map = in_map.array() * -lhs_map.array() / rhs_map.array() / rhs_map.array();

[DRAFT] Support add training #12417

Are you sure you want to change the base?

[DRAFT] Support add training #12417

Conversation

Aeren1564 commented Jan 8, 2024

jyoungyun Jan 24, 2024

Choose a reason for hiding this comment

Aeren1564 Jan 24, 2024

Choose a reason for hiding this comment

jyoungyun Jan 24, 2024

Choose a reason for hiding this comment

Aeren1564 commented Jan 26, 2024 • edited

Model with Subtract

Data

Tensorflow

ONERT-Train

Aeren1564 commented Jan 26, 2024

Model with Multiply

Data

Tensorflow

ONERT-Train

Aeren1564 Jan 26, 2024 • edited

Choose a reason for hiding this comment

Aeren1564 commented Jan 26, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Jan 30, 2024

Aeren1564 commented Feb 5, 2024

Aeren1564 commented Feb 5, 2024

Aeren1564 commented Feb 5, 2024

Aeren1564 commented Jan 26, 2024 •

edited

Aeren1564 Jan 26, 2024 •

edited