Add method to MovingWindow that waits for a number of samples #387

matthias-wende-frequenz · 2023-05-15T11:58:34Z

Up to now there was no clean way to wait until the MovingWindow got updated with a certain number of samples.
In this PR we introduce a wait_for_samples method that finishes once the number of samples arrived.

matthias-wende-frequenz · 2023-05-15T14:44:06Z

FYI @idlir-shkurti-frequenz

src/frequenz/sdk/timeseries/_moving_window.py

leandro-lucarella-frequenz

I also feel something smells here, I don't like having some class variable to only track a counter that is used by one method. My instinct says that there should be a way to split this functionality to a different class that uses, can be plugged to or proxies to the MovingWindow, but I don't have anything in particular to propose right. At least nothing that's not a lot of extra work (and probably worsen the usability of this too), so I'm fine to leave it as is, but just mentioning it in case anyone suddenly gets a brilliant idea 💡

leandro-lucarella-frequenz · 2023-05-16T07:22:18Z

src/frequenz/sdk/timeseries/_moving_window.py

+        self.count_samples = 0
+        """The number of samples that have been received."""


I think this one should probably be a read-only public attribute. Also for attributes it's better to use nouns instead of verbs (and it's missing the type), for example:

Suggested change

self.count_samples = 0

"""The number of samples that have been received."""

self._received_samples_count: int = 0

"""The number of samples that have been received."""

Plus

@property def received_samples_count(self) -> int return self._received_samples_count

leandro-lucarella-frequenz · 2023-05-16T07:27:37Z

src/frequenz/sdk/timeseries/_moving_window.py

+        self.wait_for_num_samples = 0
+        """The number of samples to wait for before
+        the wait_for_samples method triggers."""
+        self.wait_for_samples_event = asyncio.Event()


These two really look like they should be private. Same about names (and missing types):

Suggested change

self.wait_for_num_samples = 0

"""The number of samples to wait for before

the wait_for_samples method triggers."""

self.wait_for_samples_event = asyncio.Event()

self._expected_samples_count: int = 0

"""The number of samples to wait for before `wait_for_samples()` triggers."""

self._wait_for_samples_event: asyncio.Event = asyncio.Event()

"""The event to signal `wait_for_samples()` that the wait is over."""

src/frequenz/sdk/timeseries/_moving_window.py

leandro-lucarella-frequenz · 2023-05-16T07:38:50Z

Oh, I also noticed in your last PRs that it seems you have your editor configured to a maximum line length of maybe 50 chars? Ideally it should be 88 to match black's config.

src/frequenz/sdk/timeseries/_moving_window.py

tests/timeseries/test_moving_window.py

Marenz · 2023-05-16T08:13:03Z

src/frequenz/sdk/timeseries/_moving_window.py

+            return
+
+        self.wait_for_num_samples = num_samples
+        await self.wait_for_samples_event.wait()


If you never call self.wait_for_samples_event.clear() it will always fire immediately in .wait()

src/frequenz/sdk/timeseries/_moving_window.py

Up to now there was no clean way to wait until the MovingWindow got updated with a certain number of samples. In this commit we introduce a `wait_for_samples` method that finishes once the number of samples arrived. Signed-off-by: Matthias Wende <matthias.wende@frequenz.com>

matthias-wende-frequenz · 2023-05-17T15:56:01Z

I've reworked the implementation. Now the user can get a channel that sends a None when the a certain number of samples arrived. That has the advantage that we can plug it into our channels select mechanism.

leandro-lucarella-frequenz

Besides a few comments on improving consistency, the API and the tests, the commit message needs to be updated because it mentions a wait_for_samples method.

leandro-lucarella-frequenz · 2023-05-22T07:32:39Z

src/frequenz/sdk/timeseries/_moving_window.py

+        self._wait_for_num_samples: int = 0
+        """The number of samples to wait for before the wait_for_num_samples channels
+        sends out an event."""


I recommend using nouns for non-boolean attributes:

Suggested change

self._wait_for_num_samples: int = 0

"""The number of samples to wait for before the wait_for_num_samples channels

sends out an event."""

self._num_samples_to_wait_for: int = 0

"""The number of samples to wait for before triggering an event through the channel."""

leandro-lucarella-frequenz · 2023-05-22T07:34:34Z

src/frequenz/sdk/timeseries/_moving_window.py

+        self._wait_for_samples_channel = Broadcast[None](
+            "Wait for number of samples channel."
+        )


So the text here is for debugging purposes only, right @shsms?

I'd suggest using a shorter string for that and using the long string as documentation for the variable:

Suggested change

self._wait_for_samples_channel = Broadcast[None](

"Wait for number of samples channel."

)

self._wait_for_samples_channel = Broadcast[None]("wait-for-samples")

"""Channel to send events to when wait for number of samples is triggered."""

leandro-lucarella-frequenz · 2023-05-22T07:36:54Z

src/frequenz/sdk/timeseries/_moving_window.py

@@ -169,6 +176,9 @@ async def _run_impl(self) -> None:
        Raises:
            asyncio.CancelledError: if the MovingWindow task is cancelled.
        """
+        received_samples_count = 0
+        wait_for_samples_sender = self._wait_for_samples_channel.new_sender()


I guess once a MW was stopped there is no way to start it again, right? Otherwise the sender should probably be created in the constructor instead to avoid leaking sender objects.

leandro-lucarella-frequenz · 2023-05-22T07:39:20Z

src/frequenz/sdk/timeseries/_moving_window.py

+                        received_samples_count = 0
+                        await wait_for_samples_sender.send(None)


Since it's free and even when the user should know it beforehand anyway, we could also send the number of samples received instead of None, maybe it could become a handy shortcut.

Suggested change

received_samples_count = 0

await wait_for_samples_sender.send(None)

await wait_for_samples_sender.send(received_samples_count)

received_samples_count = 0

leandro-lucarella-frequenz · 2023-05-22T07:47:32Z

src/frequenz/sdk/timeseries/_moving_window.py

+    def set_sample_counter(self, num_samples: int) -> None:
+        """Set the number of samples to wait for until the sample counter triggers.
+
+        Args:
+            num_samples: The number of samples to wait for.
+
+        Raises:
+            ValueError: if the number of samples is less than or equal to zero.
+        """
+        if num_samples <= 0:
+            raise ValueError(
+                "The number of samples to wait for should be greater than zero."
+            )
+        self._wait_for_num_samples = num_samples


If I understand correctly, this indirectly enables sending the events if num_samples > 0, right? If so I would rename this method to something that makes it easier to realize that happens. What about:

@property def is_wait_for_samples_event_enabled(self) -> bool: # Returns `self._wait_for_num_samples != 0` def enable_wait_for_samples_event(self, num_samples: int) -> bool: # Same as above but check for `num_samples > 0`, return whether it was enabled before def disable_wait_for_samples_event(self) -> bool: # Set `num_samples = 0`, return whether it was enabled before

leandro-lucarella-frequenz · 2023-05-22T07:49:54Z

src/frequenz/sdk/timeseries/_moving_window.py

+            )
+        self._wait_for_num_samples = num_samples
+
+    def new_sample_count_receiver(self) -> Receiver[None]:


If the above suggestion is applied, I would rename this for consistency:

Suggested change

def new_sample_count_receiver(self) -> Receiver[None]:

def new_wait_for_samples_event_receiver(self) -> Receiver[None]:

We could go with something different than wait_for_samples_event as it is a bit long, but the 4 methods should use the same term IMHO.

leandro-lucarella-frequenz · 2023-05-22T07:51:42Z

tests/timeseries/test_moving_window.py

+
+    window.set_sample_counter(samples_to_wait_for)
+
+    # asyncio.create_task(push_data_delayed())


Suggested change

# asyncio.create_task(push_data_delayed())

?

leandro-lucarella-frequenz · 2023-05-22T07:55:26Z

tests/timeseries/test_moving_window.py

+    for i in range(0, samples_to_wait_for):
+        await sender.send(
+            Sample(datetime.now(tz=timezone.utc) + timedelta(seconds=i), 1.0)
+        )
+    await sample_count_recv.receive()


There are some failures that this will not detect or detect wrongly. For example if you trigger before the samples_to_wait_for this will succeed too. If the event doesn't trigger, this test will hang forever, which might be quite annoying.

Maybe could check for the internal counter to verify it is incremented? If you also add my suggestion to return the number of samples received you could add the check here too.

Maybe you can also move the await sample_count_recv.receive() to a different task so you can check it didn't fire before you sent enough samples and that it was fired after you sent all the expected samples but without blocking the main thread if it fails?

Maybe you could also push different sample values and then check that the state of the moving window is what it would be expected after receiving all that samples, so if the even triggered before that check should also fail.

cwasicki · 2023-08-22T17:50:57Z

Will this not be part of v1 milestone?

matthias-wende-frequenz · 2023-08-23T11:19:03Z

Will this not be part of v1 milestone?

Let's see. This change is not breaking so it can be v1.1.x too. But let's try to get it in.

matthias-wende-frequenz · 2024-06-05T14:43:43Z

This is quite old. It we need it we can reopen the PR or start from scratch.

cwasicki · 2024-06-05T20:50:29Z

Is there another way to know if the moving window was updated?

matthias-wende-frequenz · 2024-06-06T11:34:06Z

Is there another way to know if the moving window was updated?

No there isn't.
This pr was closed, not because we don't want to support it, but rather since nobody is working nor anyone was asking for this feature. So as it appears there is no need for this feature.

shsms · 2024-06-06T12:22:03Z

It turns out this is required in the Forecast actor. Only yesterday evening, Christoph showed me the place where he's using a while loop that could be replaced with this, just a little bit after we were talking about closing this issue. But I think not top priority, so we can pick this up again soon.

matthias-wende-frequenz requested a review from a team as a code owner May 15, 2023 11:58

matthias-wende-frequenz requested a review from Marenz May 15, 2023 11:58

github-actions bot added part:data-pipeline Affects the data pipeline part:tests Affects the unit, integration and performance (benchmarks) tests labels May 15, 2023

matthias-wende-frequenz removed the part:tests Affects the unit, integration and performance (benchmarks) tests label May 15, 2023

matthias-wende-frequenz added this to the v0.21.0 milestone May 15, 2023

shsms reviewed May 15, 2023

View reviewed changes

src/frequenz/sdk/timeseries/_moving_window.py Outdated Show resolved Hide resolved

src/frequenz/sdk/timeseries/_moving_window.py Outdated Show resolved Hide resolved

shsms assigned matthias-wende-frequenz May 15, 2023

matthias-wende-frequenz force-pushed the moving_trigger branch from c136a73 to e1e58b6 Compare May 15, 2023 15:48

github-actions bot added the part:tests Affects the unit, integration and performance (benchmarks) tests label May 15, 2023

leandro-lucarella-frequenz reviewed May 16, 2023

View reviewed changes

Marenz reviewed May 16, 2023

View reviewed changes

src/frequenz/sdk/timeseries/_moving_window.py Outdated Show resolved Hide resolved

Marenz reviewed May 16, 2023

View reviewed changes

tests/timeseries/test_moving_window.py Outdated Show resolved Hide resolved

Marenz reviewed May 16, 2023

View reviewed changes

src/frequenz/sdk/timeseries/_moving_window.py Outdated Show resolved Hide resolved

matthias-wende-frequenz force-pushed the moving_trigger branch from e1e58b6 to 2566927 Compare May 17, 2023 14:59

matthias-wende-frequenz force-pushed the moving_trigger branch from 2566927 to f80b565 Compare May 17, 2023 15:48

leandro-lucarella-frequenz reviewed May 22, 2023

View reviewed changes

leandro-lucarella-frequenz modified the milestones: v0.21.0, v0.22.0 Jun 5, 2023

llucax removed this from the v0.22.0 milestone Jun 27, 2023

matthias-wende-frequenz added this to the post-v1.0 milestone Aug 23, 2023

thomas-nicolai-frequenz added priority:high Address this as soon as possible type:enhancement New feature or enhancement visitble to users labels Aug 23, 2023

This comment was marked as off-topic.

Sign in to view

llucax changed the base branch from v0.x.x to v1.x.x October 11, 2023 07:21

llucax modified the milestones: post-v1.0, v1.0.0-rc5, v1.0.0-rc6 Jan 29, 2024

llucax modified the milestones: v1.0.0-rc6, v1.0.0-rc7 Mar 26, 2024

matthias-wende-frequenz closed this Jun 5, 2024

llucax modified the milestones: v1.0.0-rc800, Dropped Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add method to MovingWindow that waits for a number of samples #387

Add method to MovingWindow that waits for a number of samples #387

matthias-wende-frequenz commented May 15, 2023

matthias-wende-frequenz commented May 15, 2023

leandro-lucarella-frequenz left a comment

leandro-lucarella-frequenz May 16, 2023

leandro-lucarella-frequenz May 16, 2023

leandro-lucarella-frequenz commented May 16, 2023

Marenz May 16, 2023

matthias-wende-frequenz commented May 17, 2023

leandro-lucarella-frequenz left a comment

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

leandro-lucarella-frequenz May 22, 2023

cwasicki commented Aug 22, 2023

matthias-wende-frequenz commented Aug 23, 2023

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

matthias-wende-frequenz commented Jun 5, 2024

cwasicki commented Jun 5, 2024

matthias-wende-frequenz commented Jun 6, 2024

shsms commented Jun 6, 2024

		self.count_samples = 0
		"""The number of samples that have been received."""

		received_samples_count = 0
		await wait_for_samples_sender.send(None)

	def new_sample_count_receiver(self) -> Receiver[None]:
	def new_wait_for_samples_event_receiver(self) -> Receiver[None]:


		window.set_sample_counter(samples_to_wait_for)

		# asyncio.create_task(push_data_delayed())

Add method to MovingWindow that waits for a number of samples #387

Add method to MovingWindow that waits for a number of samples #387

Conversation

matthias-wende-frequenz commented May 15, 2023

matthias-wende-frequenz commented May 15, 2023

leandro-lucarella-frequenz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leandro-lucarella-frequenz commented May 16, 2023

Choose a reason for hiding this comment

matthias-wende-frequenz commented May 17, 2023

leandro-lucarella-frequenz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cwasicki commented Aug 22, 2023

matthias-wende-frequenz commented Aug 23, 2023

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

matthias-wende-frequenz commented Jun 5, 2024

cwasicki commented Jun 5, 2024

matthias-wende-frequenz commented Jun 6, 2024

shsms commented Jun 6, 2024