Add test for sequence state after cancellation #7167

kthui · 2024-04-26T19:32:10Z

Should be coupled with this 3rd party PR which adds the fix: triton-inference-server/core#341

Reproduction steps:

Start sequence 1 and cancel it - this release the slot
Wait until the sequence is timed-out - this release the slot again
Start two new sequences - they will be executed concurrently on the double released slot.

Since sequence state is identified by the sequence slot, so the two new sequences executing concurrently will share the same sequence state and leads to corruption of the state.

Ref: #7117 (comment)

Tabrizian · 2024-04-29T15:07:14Z

qa/L0_request_cancellation/implicit_state_test.py

+                    sequence_id=1,
+                    sequence_start=True,
+                )
+                seq_start = False


question: why do we set this value to False when it is overwritten on line 84 and not used before that?

This variable should be used for the sequence_start, you already caught it is always set to True on another comment. The issue is fixed.

Tabrizian · 2024-04-29T15:09:38Z

qa/L0_request_cancellation/implicit_state_test.py

+                    model_name,
+                    self._get_inputs(delay_itrs=5000000),
+                    sequence_id=1,
+                    sequence_start=True,


Suggested change

sequence_start=True,

sequence_start=seq_start,

Good catch!

Tabrizian · 2024-04-29T15:11:35Z

qa/L0_request_cancellation/implicit_state_test.py

+        with grpcclient.InferenceServerClient("localhost:8001") as client:
+            client.start_stream(callback)
+            seq_start = True
+            num_reqs = 4
+            seq_ids = [2, 3]
+            for req_id in range(num_reqs):
+                for seq_id in seq_ids:
+                    client.async_stream_infer(
+                        model_name,
+                        self._get_inputs(delay_itrs=0),
+                        sequence_id=seq_id,
+                        sequence_start=seq_start,
+                    )
+                    time.sleep(0.1)
+                seq_start = False
+            client.stop_stream(cancel_requests=False)


Could we perhaps refactor this into a common function looks like it is a repetition of the block above it?

Good idea! Updated: Regroup infer calls

qa/L0_request_cancellation/implicit_state_test.py

… jacky-seq-slot

tanmayv25 · 2024-05-07T21:44:46Z

Looks good to me. Please rebase the branch.

Add test for sequence state after cancellation

0564937

kthui requested review from Tabrizian, tanmayv25 and rmccorm4 April 26, 2024 19:32

kthui marked this pull request as ready for review April 26, 2024 19:39

Tabrizian reviewed Apr 29, 2024

View reviewed changes

rmccorm4 self-assigned this Apr 29, 2024

Regroup infer calls

f781cc2

github-advanced-security bot found potential problems May 6, 2024

View reviewed changes

qa/L0_request_cancellation/implicit_state_test.py Fixed Show fixed Hide fixed

Remove unused variable

be9fc02

kthui requested a review from Tabrizian May 6, 2024 20:47

Merge branch 'main' of github.com:triton-inference-server/server into…

e2df67e

… jacky-seq-slot

tanmayv25 approved these changes May 7, 2024

View reviewed changes

Merge branch 'main' into jacky-seq-slot

41a6a31

kthui merged commit d21685b into main May 7, 2024
3 checks passed

kthui deleted the jacky-seq-slot branch May 7, 2024 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for sequence state after cancellation #7167

Add test for sequence state after cancellation #7167

kthui commented Apr 26, 2024

Tabrizian Apr 29, 2024

kthui May 6, 2024

Tabrizian Apr 29, 2024

kthui May 6, 2024

Tabrizian Apr 29, 2024

kthui May 6, 2024

tanmayv25 commented May 7, 2024

Add test for sequence state after cancellation #7167

Add test for sequence state after cancellation #7167

Conversation

kthui commented Apr 26, 2024

Tabrizian Apr 29, 2024

Choose a reason for hiding this comment

kthui May 6, 2024

Choose a reason for hiding this comment

Tabrizian Apr 29, 2024

Choose a reason for hiding this comment

kthui May 6, 2024

Choose a reason for hiding this comment

Tabrizian Apr 29, 2024

Choose a reason for hiding this comment

kthui May 6, 2024

Choose a reason for hiding this comment

tanmayv25 commented May 7, 2024