Python AudioDataStream.read_data should not modify immutable bytes object #2337

msehnout · 2024-04-11T12:48:33Z

Hello

This sample can lead to subtle bugs:

cognitive-services-speech-sdk/samples/python/console/speech_synthesis_sample.py

Line 386 in 1b2d61b

filled_size = audio_data_stream.read_data(audio_buffer)

            audio_buffer = bytes(16000)
            total_size = 0
            filled_size = audio_data_stream.read_data(audio_buffer)
            while filled_size > 0:
                print("{} bytes received.".format(filled_size))
                total_size += filled_size
                filled_size = audio_data_stream.read_data(audio_buffer)

The problem is that bytes type is immutable in Python, but Speech SDK uses native C library and it modifies the immutable type: https://docs.python.org/3/library/stdtypes.html#bytes-objects

I stumbled upon the bug when I tried to accumulate the buffer in a separate function like this:

buffer = b""
for chunk in stream_from_azure:
   buffer += chunk

where the iterator was implemented like this:

            audio_buffer = bytes(16000)
            while filled_size > 0:
                filled_size = audio_data_stream.read_data(audio_buffer)
                yield audio_buffer[:filled_size]

And the result was corrupted.

The SDK should probably take bytearray instead: https://docs.python.org/3/library/stdtypes.html#bytearray-objects
Because it is a mutable counterpart to bytes objects.

I could not find a better place to report this issue. Please let me know if I can submit it somewhere else.

The text was updated successfully, but these errors were encountered:

yulin-li · 2024-04-14T04:30:07Z

Thanks for this issue. I agree with you that we should not modify this immutable object. We will discuss internally to update this as we don't want to introducing breaking changes at this time

yulin-li · 2024-04-14T04:34:58Z

And this is the right place to report this issue, this is the official Speech SDK repo

github-actions · 2024-05-06T02:11:25Z

This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.

yulin-li self-assigned this Apr 14, 2024

yulin-li added enhancement New feature or request text-to-speech Text-to-Speech labels Apr 14, 2024

github-actions bot added the update needed For items that are in progress but have not been updated label May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python AudioDataStream.read_data should not modify immutable bytes object #2337

Python AudioDataStream.read_data should not modify immutable bytes object #2337

msehnout commented Apr 11, 2024

yulin-li commented Apr 14, 2024

yulin-li commented Apr 14, 2024

github-actions bot commented May 6, 2024

Python AudioDataStream.read_data should not modify immutable bytes object #2337

Python AudioDataStream.read_data should not modify immutable bytes object #2337

Comments

msehnout commented Apr 11, 2024

yulin-li commented Apr 14, 2024

yulin-li commented Apr 14, 2024

github-actions bot commented May 6, 2024