Implementation of mean in categorical #1718

fotisdr · 2023-05-08T15:57:15Z

NotImplementedError: mean is not implemented: Categorical

Defined a _mean method for implementing the mean of the Categorical distribution, following a previous PR (#1411).

Implementation of the mean method in the Categorical distribution.

google-cla · 2023-05-08T15:57:19Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

csuter

Thanks for this; a few comments at the change site.

csuter · 2023-05-08T18:49:22Z

tensorflow_probability/python/distributions/categorical.py

@@ -333,6 +333,10 @@ def _entropy(self):
        _mul_exp(log_probs, log_probs),
        axis=-1)

+  def _mean(self):
+    probs = self.probs_parameter()
+    return tf.reduce_sum(tf.range(self._num_categories(probs),dtype=probs.dtype) * probs, axis=-1) / tf.reduce_sum(probs, axis=-1)


we should not divide by sum of probs here. if they don't sum to one, and validate_args is true, this is an error. validate_args is false by default to prevent spending unnecessary compute.

a more numerically stable implementation would use logits along with tfp.math.reduce_logmeanexp with the weights arg. the current one is ok, but suboptimal.

some unit tests should be added.

sorry, meant weighted_logsumexp

Should we always expect that the provided probs sum up to 1 for this method, and thus add an assertion that this is the case before computing the mean? Because other methods in the Categorical work without the sum of the probs necessarily being 1 (with validate_args=False).

Also, the previous implementation is definitely possible with logits and the reduce_logsum_exp function:

logits = self.logits_parameter() return tf.math.exp(reduce_weighted_logsumexp(logits,w=tf.range(self._num_categories(logits),dtype=logits.dtype),axis=-1))

I tested it and it produces the same results, so I can replace my code with this (more stable) implementation. The only issue is that I'm still waiting for approval from a maintainer to run the workflow tests.

Should we always expect that the provided probs sum up to 1 for this method, and thus add an assertion that this is the case before computing the mean? Because other methods in the Categorical work without the sum of the probs necessarily being 1 (with validate_args=False).

There are already such assertions (when validate_args=True) in the execution path of all these methods. Look at _parameter_control_dependencies in this file, as well as most other Distribution subclasses in TFP, to see which ones there are. These are triggered by the base Distribution class when any public API point is invoked (eg, dist.log_prob, dist.sample, dist.mean, etc...again, only if validate_args is True 🙂)

The computation of the mean is done with logits (instead of probs) to make the implementation more stable numerically.

csuter · 2023-05-09T15:41:22Z

tensorflow_probability/python/distributions/categorical.py

@@ -30,6 +30,7 @@
 from tensorflow_probability.python.internal import samplers
 from tensorflow_probability.python.internal import tensor_util
 from tensorflow_probability.python.internal import tensorshape_util
+from tensorflow_probability.python.math import reduce_weighted_logsumexp


Please import as

from tensorflow_probability.python.math.generic import reduce_weighted_logsumexp

and add this to the "categorical" deps list in the adjacent BUILD file:

"//tensorflow_probability/python/math:generic"

csuter · 2023-05-09T15:41:49Z

tensorflow_probability/python/distributions/categorical.py

@@ -333,6 +334,13 @@ def _entropy(self):
        _mul_exp(log_probs, log_probs),
        axis=-1)

+  def _mean(self):
+    #probs = self.probs_parameter()


new impl looks good, thanks! please

remove the commented out lines

add a test in categorical_test.py. you can look at other categorical tests and maybe normal_test.py for a hint of how these should look. think of edge cases, like some zero prob categories, etc.

csuter · 2023-05-09T15:43:00Z

tensorflow_probability/python/distributions/categorical.py

+    #return tf.reduce_sum(tf.range(self._num_categories(probs),dtype=probs.dtype) * probs, axis=-1) / tf.reduce_sum(probs, axis=-1)
+    # Implement with logits to improve numerical stability
+    logits = self.logits_parameter()
+    return tf.math.exp(reduce_weighted_logsumexp(logits,w=tf.range(self._num_categories(logits),dtype=logits.dtype),axis=-1))


please format as

return tf.math.exp( reduce_weighted_logsumexp( logits, w=tf.range(self._num_categories(logits), dtype=logits.dtype), axis=-1))

Thanks for these, my last commit now includes all the requested changes.

csuter · 2023-05-09T15:46:04Z

tensorflow_probability/python/distributions/categorical.py

@@ -333,6 +333,10 @@ def _entropy(self):
        _mul_exp(log_probs, log_probs),
        axis=-1)

+  def _mean(self):
+    probs = self.probs_parameter()
+    return tf.reduce_sum(tf.range(self._num_categories(probs),dtype=probs.dtype) * probs, axis=-1) / tf.reduce_sum(probs, axis=-1)


Should we always expect that the provided probs sum up to 1 for this method, and thus add an assertion that this is the case before computing the mean? Because other methods in the Categorical work without the sum of the probs necessarily being 1 (with validate_args=False).

There are already such assertions (when validate_args=True) in the execution path of all these methods. Look at _parameter_control_dependencies in this file, as well as most other Distribution subclasses in TFP, to see which ones there are. These are triggered by the base Distribution class when any public API point is invoked (eg, dist.log_prob, dist.sample, dist.mean, etc...again, only if validate_args is True 🙂)

Removed commented lines and reformated

csuter · 2023-05-09T16:30:59Z

tensorflow_probability/python/distributions/categorical_test.py

+    self.assertAllEqual((1,), dist.mean().shape)
+    # Expected mean will be the same as in a Multinomial with n = 1
+    expected_means = stats.multinomial.mean(n=1, p=p).argmax(axis=-1)
+    self.assertAllClose(expected_means, self.evaluate(binom.mean()))


change binom to categorical :)

Sorry, corrected it...

Implementation of mean in categorical

d930411

Implementation of the mean method in the Categorical distribution.

csuter requested changes May 8, 2023

View reviewed changes

fotisdr added 2 commits May 9, 2023 15:41

Update the mean to use logits

ccb072c

The computation of the mean is done with logits (instead of probs) to make the implementation more stable numerically.

Added the reduce_weighted_logsumexp import

30623c5

csuter requested changes May 9, 2023

View reviewed changes

fotisdr added 4 commits May 9, 2023 16:59

Update categorical.py

682def0

Removed commented lines and reformated

Correct the import

588d96b

Update BUILD to include the python.math.generic lib

4c01594

Add a test for the categorical mean

d77a686

csuter reviewed May 9, 2023

View reviewed changes

Corrected the missing variables

82a90e3

fotisdr requested a review from csuter September 15, 2023 07:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of mean in categorical #1718

Implementation of mean in categorical #1718

fotisdr commented May 8, 2023

google-cla bot commented May 8, 2023

csuter left a comment

csuter May 8, 2023

csuter May 8, 2023

fotisdr May 9, 2023

fotisdr May 9, 2023

csuter May 9, 2023

csuter May 9, 2023

csuter May 9, 2023

csuter May 9, 2023

fotisdr May 9, 2023

csuter May 9, 2023

csuter May 9, 2023

fotisdr May 9, 2023

Implementation of mean in categorical #1718

Are you sure you want to change the base?

Implementation of mean in categorical #1718

Conversation

fotisdr commented May 8, 2023

google-cla bot commented May 8, 2023

csuter left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment