Improve intermediate layer extraction explanation #1338

palonso · 2023-05-26T08:51:29Z

TensorToVectorReal converts tensors to 2D arrays by flattening all axis but the last one into the first dimension.
model-specific prediction algorithms (e.g., TensorflowPredictVGGish) use this algorithm to return 2D arrays since they are primarily intended for time-wise predictions or embeddings. However, it is possible to use these algorithms to extract intermediate layers of the models that may have more than two dimensions. In this case, all dimensions but the last one will be flattened. To address this:

TensorToVectorReal throws a warning in case it flattens a dimension.
We added notes explaining this behavior to the algorithms potentially affected.

Note that it is also possible to retrieve intermediate layers with their original shape using TensorflowPredict as discussed here.

dbogdanov

This looks good! I've left a proposal to improve the description of the algorithms' output in the DOC string.

dbogdanov · 2023-05-29T13:34:32Z

src/algorithms/machinelearning/tensorflowpredicteffnetdiscogs.cpp

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Rephrased version (trying to simplify):

Note: The algorithm outputs a time series of class activations or embedding vectors, with a 2D shape [time, feature vector]. Feature vector values will be flattened if the output parameter is set to extract an intermediate layer with multiple dimensions.

dbogdanov · 2023-05-29T13:36:07Z

src/algorithms/machinelearning/tensorflowpredictfsdsinet.cpp

+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"
+  "\n"


Same comments as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:36:21Z

src/algorithms/machinelearning/tensorflowpredictmusicnn.cpp

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Same comment as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:36:31Z

src/algorithms/machinelearning/tensorflowpredictvggish.cpp

+  "Note: The output of this algorithm is 2D, which is suitable for extracting embeddings or "
+  "class activations (the output shape is, e.g., [time, number of classes]). If the output "
+  "parameter is set to an intermediate layer with more dimensions, the output will be "
+  "flattened to 2D.\n"


Same comment as for TensorflowPredictEffnetDiscogs

dbogdanov · 2023-05-29T13:37:22Z

src/algorithms/standard/tensortovectorreal.cpp

@@ -66,6 +68,11 @@ AlgorithmStatus TensorToVectorReal::process() {
    _timeStamps = tensor.dimension(2);
    _featsSize = tensor.dimension(3);

+    if (_channels != 1 && !_warned) {
+        E_WARNING("TensorToVectorReal: The channel axis (dimension 1) of the input tensor has size larger than 1, but the output of this algorithm is 2D. The batch, channel, and time axes (dimensions 0, 1, 2) will be flattened to the first dimension of the output matrix.");


We output a vector of vector of reals, so the "matrix" terminology may be misleading.

palonso added 3 commits May 26, 2023 10:23

Warn if channels>1 when converting tensor to frame

0500f9a

Add note explaining intermediate layer extraction

4af184c

Fix references

3d5cf82

palonso requested a review from dbogdanov May 26, 2023 08:51

dbogdanov requested changes May 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve intermediate layer extraction explanation #1338

Improve intermediate layer extraction explanation #1338

palonso commented May 26, 2023

dbogdanov left a comment

dbogdanov May 29, 2023

dbogdanov May 29, 2023

dbogdanov May 29, 2023

dbogdanov May 29, 2023

dbogdanov May 29, 2023

Improve intermediate layer extraction explanation #1338

Are you sure you want to change the base?

Improve intermediate layer extraction explanation #1338

Conversation

palonso commented May 26, 2023

dbogdanov left a comment

Choose a reason for hiding this comment

dbogdanov May 29, 2023

Choose a reason for hiding this comment

dbogdanov May 29, 2023

Choose a reason for hiding this comment

dbogdanov May 29, 2023

Choose a reason for hiding this comment

dbogdanov May 29, 2023

Choose a reason for hiding this comment

dbogdanov May 29, 2023

Choose a reason for hiding this comment