Refactor `keras.dtype_policies` #19711

james77777778 · 2024-05-13T02:46:30Z

EDITED:
Please refer to #19711 (comment) for the new updates.

I think it would be beneficial to provide some flexibility to QuantizedDTypePolicy regarding the global dtype policy keras.config.dtype_policy()

Additionally, there is a new property in DTypePolicy: is_quantized that should be useful for these quantization-related methods.

With this PR, we can do the following:

import keras
from keras import dtype_policies
from keras import layers
from keras import models


@keras.saving.register_keras_serializable("MyPackage")
class MySubclass(layers.Layer):
    def __init__(self, **kwargs):
        dtypes = kwargs.pop("dtypes", {})
        super().__init__(**kwargs)
        self.layer = layers.Dense(8, dtype=dtypes.pop("layer", None))

    def call(self, inputs, training=None):
        return self.layer(inputs)

    def get_config(self):
        config = super().get_config()
        config.pop("dtype")
        if self.layer.dtype_policy.is_quantized:
            _config = dtype_policies.serialize(self.layer.dtype_policy)
            _config["config"]["source_name"] = None
            config.update({"dtypes": {"layer": _config}})
        return config


inputs = layers.Input(shape=[None, 4])
outputs = MySubclass()(inputs)
model = models.Model(inputs, outputs)

"""global dtype policy (float32)"""

model.quantize("int8")
for layer in model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)
model.save("model.keras")

"""global dtype policy (bfloat16)"""

keras.config.set_dtype_policy("bfloat16")
new_model = models.load_model("model.keras")
for layer in new_model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)

Outputs:

# During saving (global dtype policy: float32)
input_layer <FloatDTypePolicy "float32">
my_subclass <FloatDTypePolicy "float32">
dense <QuantizedDTypePolicy "int8_from_float32">

# During loading (global dtype policy: bfloat16)
input_layer <FloatDTypePolicy "bfloat16">
my_subclass <FloatDTypePolicy "bfloat16">
dense_1 <QuantizedDTypePolicy "int8_from_bfloat16">

@mattdangerw has pointed out that currently the dtype policies of the quantized saves are immutable regarding the global dtype policy. keras-team/keras-nlp#1612 (comment)
With this PR, we can make a slight modification in get_config to support that feature.

codecov-commenter · 2024-05-13T02:51:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 78.53%. Comparing base (310c275) to head (ecf2523).
Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #19711   +/-   ##
=======================================
  Coverage   78.52%   78.53%           
=======================================
  Files         498      498           
  Lines       45769    45756   -13     
  Branches     8456     8454    -2     
=======================================
- Hits        35942    35936    -6     
+ Misses       8091     8087    -4     
+ Partials     1736     1733    -3

Flag	Coverage Δ
keras	`78.38% <100.00%> (+<0.01%)`	⬆️
keras-jax	`61.95% <100.00%> (+<0.01%)`	⬆️
keras-numpy	`56.29% <87.93%> (-0.01%)`	⬇️
keras-tensorflow	`63.41% <100.00%> (-0.01%)`	⬇️
keras-torch	`61.99% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

keras/src/dtype_policies/dtype_policy.py

fchollet · 2024-05-13T18:19:46Z

keras/src/dtype_policies/dtype_policy.py

@@ -202,6 +202,10 @@ def __repr__(self):
        return f'<FloatDTypePolicy "{self._name}">'


+GLOBAL_DEFAULT_PLACEHOLDER = "global_default"


Please use a more explicit name, e.g. "DEFAULT_DTYPE_POLICY". Why use this string as the initial value, instead of e.g. None?

Why use this string as the initial value, instead of e.g. None?

Currently, DTypePolicy and its subclasses rely on string value for parsing.
It is not clear for me how we can pass None in combination with the quantization mode.

Should we refactor QuantizedDTypePolicy to support a signature for both the quantization mode and the source dtype policy?

Ex:

policy = QuantizedDTypePolicy(mode="int8", source_dtype_policy="mixed_bfloat16")

Currently, DTypePolicy and its subclasses rely on string value for parsing.
It is not clear for me how we can pass None in combination with the quantization mode.

We could just modify DTypePolicy to support None, meaning "default".

Should we refactor QuantizedDTypePolicy to support a signature for both the quantization mode and the source dtype policy?

Yes, that's a great idea!

james77777778 · 2024-05-15T03:24:22Z

I've significantly refactored the keras.dtype_policies.

Some notes:

Replicate all methods from FloatDTypePolicy to DTypePolicy so that FloatDTypePolicy becomes an alias for DTypePolicy. The reason is that the overriden __new__ in DTypePolicy caused numerous issues and addressing them would introduce unnecessary complexity.
Introduce a new signature for QuantizedDTypePolicy and QuantizedFloat8DTypePolicy.
Utilize dtype_policies.serialize in get_config of keras.layers.Layer. This is required because we now use different signatures for different dtype policies.
Update the tests.

Imcompatible warning:

We can still use something like "int8_from_float32" in keras.dtype_polices.get but it is now impossible to be passed to QuantizedDTypePolicy and QuantizedFloat8DTypePolicy.

To add flexibility to quantized dtype policy:

import keras
from keras import dtype_policies
from keras import layers
from keras import models


@keras.saving.register_keras_serializable("MyPackage")
class MySubclass(layers.Layer):
    def __init__(self, **kwargs):
        dtypes = kwargs.pop("dtypes", {})
        super().__init__(**kwargs)
        self.layer = layers.Dense(8, dtype=dtypes.pop("layer", None))

    def call(self, inputs, training=None):
        return self.layer(inputs)

    def get_config(self):
        config = super().get_config()
        config.pop("dtype")
        if self.layer.dtype_policy.is_quantized:
            _config = dtype_policies.serialize(self.layer.dtype_policy)
            _config["config"]["source_name"] = None
            config.update({"dtypes": {"layer": _config}})
        return config


inputs = layers.Input(shape=[None, 4])
outputs = MySubclass()(inputs)
model = models.Model(inputs, outputs)

"""global dtype policy (float32)"""

model.quantize("int8")
for layer in model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)
model.save("model.keras")

"""global dtype policy (bfloat16)"""

keras.config.set_dtype_policy("bfloat16")
new_model = models.load_model("model.keras")
for layer in new_model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)

The outputs:

# global dtype policy: float32
input_layer <FloatDTypePolicy "float32">
my_subclass <FloatDTypePolicy "float32">
dense <QuantizedDTypePolicy "int8_from_float32">

# global dtype policy: bfloat16
input_layer <FloatDTypePolicy "bfloat16">
my_subclass <FloatDTypePolicy "bfloat16">
dense_1 <QuantizedDTypePolicy "int8_from_bfloat16">

fchollet

Nice work -- it's definitely cleaner this way! LGTM

Keras' output format was slightly changed in keras-team/keras#19711; in some cases dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras.

Keras' output format was slightly changed in keras-team/keras#19711; for non-input layers dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras.

Keras' output format was slightly changed in keras-team/keras#19711; for non-input layers dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras. Alternative to #6855

james77777778 added 3 commits May 13, 2024 09:40

Add flexibility to QuantizedDTypePolicy

8dd90ff

Add is_quantized_dtype_policy

9208b57

Update layers

fd3a75b

google-ml-butler bot added the size:M label May 13, 2024

google-ml-butler bot assigned gbaned May 13, 2024

fchollet reviewed May 13, 2024

View reviewed changes

james77777778 added 2 commits May 14, 2024 10:20

Address comments

59300ed

Refactor keras.dtype_policies

e5d1320

james77777778 changed the title ~~Add flexibility to QuantizedDTypePolicy~~ Refactor keras.dtype_policies May 15, 2024

Update unit tests

ae08b69

Update comments

5e5bdad

gbaned added this to Assigned Reviewer in PR Queue via automation May 15, 2024

Update tests

d73c54d

james77777778 requested a review from fchollet May 15, 2024 04:44

google-ml-butler bot added the awaiting review label May 15, 2024

Update tests

ecf2523

fchollet approved these changes May 15, 2024

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer May 15, 2024

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels May 15, 2024

fchollet merged commit 3105247 into keras-team:master May 15, 2024
6 checks passed

kokoro-team removed the kokoro:force-run label May 15, 2024

PR Queue automation moved this from Approved by Reviewer to Merged May 15, 2024

google-ml-butler bot removed awaiting review ready to pull Ready to be merged into the codebase labels May 15, 2024

james77777778 deleted the flexible-quantized-dtype branch May 16, 2024 00:33

mloc mentioned this pull request May 21, 2024

Fix keras dtype importing and unpin for CI tensorflow/tensorboard#6857

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `keras.dtype_policies` #19711

Refactor `keras.dtype_policies` #19711

james77777778 commented May 13, 2024 •

edited

codecov-commenter commented May 13, 2024 •

edited

fchollet left a comment

fchollet May 13, 2024

james77777778 May 14, 2024

fchollet May 14, 2024

james77777778 commented May 15, 2024 •

edited

fchollet left a comment

		@@ -202,6 +202,10 @@ def __repr__(self):
		return f'<FloatDTypePolicy "{self._name}">'


		GLOBAL_DEFAULT_PLACEHOLDER = "global_default"

Refactor keras.dtype_policies #19711

Refactor keras.dtype_policies #19711

Conversation

james77777778 commented May 13, 2024 • edited

codecov-commenter commented May 13, 2024 • edited

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

fchollet May 13, 2024

Choose a reason for hiding this comment

james77777778 May 14, 2024

Choose a reason for hiding this comment

fchollet May 14, 2024

Choose a reason for hiding this comment

james77777778 commented May 15, 2024 • edited

fchollet left a comment

Choose a reason for hiding this comment

Refactor `keras.dtype_policies` #19711

Refactor `keras.dtype_policies` #19711

james77777778 commented May 13, 2024 •

edited

codecov-commenter commented May 13, 2024 •

edited

james77777778 commented May 15, 2024 •

edited