float16 quantization runs out of memory for LSTM model #1091

Black3rror · 2023-08-30T18:08:05Z

No matter the size of the LSTM model, converting it with float16 optimization runs out of memory.

Code to reproduce the issue
The code snippet to reproduce the issue on Google Colab
Code:

import numpy as np
import tensorflow as tf
import tensorflow_model_optimization as tfmot

def create_model():
  model = tf.keras.models.Sequential()

  # For the model to later get converted, batch_size and sequence_length should be fixed.
  # E.g., batch_input_shape=[None, 1] will throw an error.
  # This is just a limitation when using RNNs. E.g., for FC or CNN we can have batch_size=None
  model.add(tf.keras.layers.Embedding(
    input_dim=5,
    output_dim=1,
    batch_input_shape=[1, 1]
  ))

  model.add(tf.keras.layers.LSTM(
    units=1,
    return_sequences=False,
    stateful=False
  ))

  model.add(tf.keras.layers.Dense(5))

  return model

model = create_model()
model.summary()

model.save("/content/model/")

representative_data = np.random.randint(0, 5, (200, 1)).astype(np.float32)

def representative_dataset():
  for sample in representative_data:
    sample = np.expand_dims(sample, axis=0)     # batch_size = 1
    yield [sample]                              # set sample as first (and only) input of the model

# float16 quantization
converter = tf.lite.TFLiteConverter.from_saved_model("/content/model/")
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_types = [tf.float16]
# kernel runs out of memory and crashes in the following line
tflite_quant_model = converter.convert()

cdh4696 · 2023-08-31T01:45:22Z

@yyoon Could you please check? Thanks!

malloyca · 2023-09-14T19:01:03Z

I have also encountered this problem using TensorFlow version 12.2.1 on my system. Non-optimized conversion works fine with LSTM, but float16 optimization is causing my kernel to crash repeatedly.

barrypitman · 2024-05-02T07:57:33Z

Same problem here.

Black3rror added the bug Something isn't working label Aug 30, 2023

cdh4696 assigned yyoon Aug 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

float16 quantization runs out of memory for LSTM model #1091

float16 quantization runs out of memory for LSTM model #1091

Black3rror commented Aug 30, 2023

cdh4696 commented Aug 31, 2023

malloyca commented Sep 14, 2023 •

edited

barrypitman commented May 2, 2024

float16 quantization runs out of memory for LSTM model #1091

float16 quantization runs out of memory for LSTM model #1091

Comments

Black3rror commented Aug 30, 2023

cdh4696 commented Aug 31, 2023

malloyca commented Sep 14, 2023 • edited

barrypitman commented May 2, 2024

malloyca commented Sep 14, 2023 •

edited