Performances #30

Gsantomaggio · 2023-01-31T13:27:03Z

Test the performance and evaluate a way to aggregate the messages before sending the messages.

DanielePalaia · 2023-02-03T08:59:56Z

I made few tests running also the profiler pyroscope on top of it.

The tests was quite simple:

       MESSAGES = 100
       list_messages = []
        t0 = time.process_time()
        #print ("Sending " + str(MESSAGES) + " simple messages")
        for j in range(10000):
            for i in range(MESSAGES):
                # sending basic messages
                #amqp_message = AMQPMessage(
                #    body='hello:'
                #)
                amqp_message = b'hello'          
                list_messages.append(amqp_message)
            await producer.publish_batch('mixing', list_messages, False)
            list_messages.clear()
        t3 = time.process_time()
        print("time elapsed after publish_batch" + str(t3-t0))

Sending 1 milions of messages with publish_batch.
First attempt creating an AMQPMessage. This was very slow like 11 seconds.
Just sending in binary format instead amqp_message = b'hello' it seems like we gain at least 5/6 seconds as the computational time go down to 6 seconds.

From peryscope profiler it seems like the function which take most time are the indeed the __ bytes __ at amqp.py:21 when we run the first version of the test and the function encode_struct encoding.py:98 which takes 3.5 seconds in total (cumulative test)

qweeze · 2023-02-03T14:14:49Z

Hi @DanielePalaia, have you tried py-spy profiler? It can output nice flamegraph visualizations of stack traces by the time they take

Also it would be great if your could post a flamegraph report to this thread

DanielePalaia · 2023-02-03T14:59:16Z

Hi @qweeze
This link when I send 1 milion of AMQPMessage (part commented)
https://flamegraph.com/share/a44afb24-a3d2-11ed-b6df-aec0fc79c33f
This second link when I send 1 milion of binary messages amqp_message = b'hello'
https://flamegraph.com/share/6fb388e7-a3d2-11ed-b6df-aec0fc79c33f

This is the output from from pyroscope.
I can try also to use [py-spy] if you thinks is better.

Rperry2174 · 2023-02-03T18:22:36Z

hi @DanielePalaia someone in the Pyroscope slack channel let us know about this issue -- just wanted to say a few things FYI

Excited to see use of Flamegraph.com :)
We use py-spy under the hood for our python integration (with some added functionality)
you can also use flamegraph.com and the api there to get a diff between two flamegraphs:

curl 'https://flamegraph.com/api/render/v1/a44afb24-a3d2-11ed-b6df-aec0fc79c33f' > f1.json
curl 'https://flamegraph.com/api/render/v1/6fb388e7-a3d2-11ed-b6df-aec0fc79c33f' > f2.json

and then if you upload them to flamegraph.com results in either:

f1 vs f2 diff: https://flamegraph.com/share/f5ece10c-a3e1-11ed-b2e4-5a5f230eba48
f2 vs f1 diff: https://flamegraph.com/share/ffef8ddc-a3e1-11ed-b2e4-5a5f230eba48

Hope this helps! Feel free to let us know if there's any functionality that you want/need that doesn't exist

qweeze · 2023-02-04T07:38:30Z

Didn't know about https://flamegraph.com, very cool 😅

Thanks @DanielePalaia, seems that uamqp and encoding takes quite some time indeed

Also I made a couple of quick and dirty optimizations to the encoding module in this branch. I got ~50% speedup on a test like this

frame = schema.Publish(publisher_id=1, messages=[schema.Message(publishing_id=123, data=b"hello")])
for _ in range(100_000):
    encode_frame(frame)

I'm not an expert on performance, but I doubt that it's possible to radically improve performance using only pure python for low-level stuff like binary protocol encoding/decoding. If speed is a concern maybe we should consider alternative ways, such as using cython or c / rust extension

DanielePalaia · 2023-02-06T08:22:21Z

@qweeze Thank you for your improvements, I will have a look!

DanielePalaia · 2023-02-06T09:39:30Z

Using the test-encoding-optimization branch with the previous example is generating an exception:

Exception in callback BaseClient.start_task.<locals>.on_task_done(<Task finishe...sResponse'>")>) at /Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/client.py:93
handle: <Handle BaseClient.start_task.<locals>.on_task_done(<Task finishe...sResponse'>")>) at /Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/client.py:93>
Traceback (most recent call last):
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/encoding.py", line 179, in decode_frame
    frame = _decode_struct(buf, cls)
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/encoding.py", line 162, in _decode_struct
    data[f.name] = _decode_field(buf, fld_tp)
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/encoding.py", line 149, in _decode_field
    raise NotImplementedError(f"Unexpected type {tp}")
NotImplementedError: Unexpected type typing.ClassVar[rstream.constants.Key]

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/asyncio/events.py", line 80, in _run
    self._context.run(self._callback, *self._args)
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/client.py", line 95, in on_task_done
    task.result()
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/client.py", line 164, in _listener
    frame = await self._conn.read_frame()
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/connection.py", line 73, in read_frame
    return decode_frame(await self._read_frame_raw())
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/venv/lib/python3.10/site-packages/rstream/encoding.py", line 182, in decode_frame
    raise ValueError(f"Could not decode {cls!r}") from e
ValueError: Could not decode <class 'rstream.schema.PeerPropertiesResponse'>
^CTraceback (most recent call last):
  File "/Users/dpalaia/projects/rabbitmq-stream-mixing/python/python_rstream/producer.py", line 44, in <module>
    asyncio.run(publish())
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/asyncio/runners.py", line 44, in run
    return loop.run_until_complete(main)
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/asyncio/base_events.py", line 636, in run_until_complete
    self.run_forever()
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/asyncio/base_events.py", line 603, in run_forever
    self._run_once()
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/asyncio/base_events.py", line 1868, in _run_once
    event_list = self._selector.select(timeout)
  File "/usr/local/Cellar/python@3.10/3.10.9/Frameworks/Python.framework/Versions/3.10/lib/python3.10/selectors.py", line 562, in select
    kev_list = self._selector.control(None, max_ev, timeout)

Will try to have a look!

qweeze · 2023-02-08T19:36:55Z

@DanielePalaia sorry I accidentally broke decoding in this branch, now fixed in 28dd87d

DanielePalaia · 2023-02-15T13:33:10Z

Hi @qweeze I made a few tests on these new improvements.
I was able to gain a few seconds of improvements (like 15%/20% impromevements).

On 3 milions of messages sent in binary:
From 15seconds to 12seconds with a publish_batch

And sent in amqp format
From 55seconds to 48 seconds.

I think is a nice improment anyway. Would it be possible to make an official PR that we can review and maybe merge?

Gsantomaggio · 2023-07-18T06:57:25Z

This PR #100 increases the performances a bit.

DanielePalaia · 2023-09-25T14:45:57Z

To check #127, #132, #133 for possibile improvements

DanielePalaia mentioned this issue Jul 7, 2023

Test encoding optimiations #87

Closed

DanielePalaia added the performance label Nov 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performances #30

Performances #30

Gsantomaggio commented Jan 31, 2023

DanielePalaia commented Feb 3, 2023 •

edited

qweeze commented Feb 3, 2023

DanielePalaia commented Feb 3, 2023

Rperry2174 commented Feb 3, 2023 •

edited

qweeze commented Feb 4, 2023

DanielePalaia commented Feb 6, 2023

DanielePalaia commented Feb 6, 2023

qweeze commented Feb 8, 2023

DanielePalaia commented Feb 15, 2023

Gsantomaggio commented Jul 18, 2023

DanielePalaia commented Sep 25, 2023

Performances #30

Performances #30

Comments

Gsantomaggio commented Jan 31, 2023

DanielePalaia commented Feb 3, 2023 • edited

qweeze commented Feb 3, 2023

DanielePalaia commented Feb 3, 2023

Rperry2174 commented Feb 3, 2023 • edited

qweeze commented Feb 4, 2023

DanielePalaia commented Feb 6, 2023

DanielePalaia commented Feb 6, 2023

qweeze commented Feb 8, 2023

DanielePalaia commented Feb 15, 2023

Gsantomaggio commented Jul 18, 2023

DanielePalaia commented Sep 25, 2023

DanielePalaia commented Feb 3, 2023 •

edited

Rperry2174 commented Feb 3, 2023 •

edited