azure-cognitiveservices-speech 1.37.0 won't work for PA task on Mac OSX #2347

dyustc · 2024-04-19T08:58:21Z

IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:

Speech SDK log taken from a run that exhibits the reported issue.
See instructions on how to take logs.
A stripped down, simplified version of your source code that exhibits the issue. Or, preferably, try to reproduce the problem with one of the public samples in this repository (or a minimally modified version of it), and share the code.
If relevant, a WAV file of your input audio.
Additional information as shown below

Describe the bug
I use Mac OSX 14.4.1 (23E224), M2 chipset, under Python 3.9.18, I install the latest azure package. azure-cognitiveservices-speech 1.37.0. Also, I use the sample code, I copied from the Azure speech studio, except that I use my own subscription key, and region.

And it won't work. It crashes here in line 64. I make sure the wav file and text is also correct.

To Reproduce

Steps to reproduce the behavior:

...
...

Expected behavior

A clear and concise description of what you expected to happen.

Version of the Cognitive Services Speech SDK

Which version of the SDK are you using.

Platform, Operating System, and Programming Language

OS: [e.g. Windows, Linux, Android, iOS, ...] - please be specific
Hardware - x64, x86, ARM, ...
Programming language: C#, C++, Java, JavaScript, Objective-C, Python
Browser [e.g. Chrome, Safari] (if applicable) - please be specific

Additional context

here is the error msg,
~/work/ramp/CTC-Attention-Mispronunciation/egs/qa (41*) » python microsoft_pronunciation_assessment.py daiyi@bogon
Traceback (most recent call last):
File "/Users/daiyi/work/ramp/CTC-Attention-Mispronunciation/egs/qa/microsoft_pronunciation_assessment.py", line 178, in
pronunciation_assessment_continuous_from_file(audio, txt)
File "/Users/daiyi/work/ramp/CTC-Attention-Mispronunciation/egs/qa/microsoft_pronunciation_assessment.py", line 62, in pronunciation_assessment_continuous_from_file
pronunciation_config.enable_prosody_assessment()
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/speech.py", line 3077, in enable_prosody_assessment
self.__properties.set_property(PropertyId.PronunciationAssessment_EnableProsodyAssessment, "true")
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/properties.py", line 29, in set_property
_call_hr_fn(fn=_sdk_lib.property_bag_set_string, *[self._handle, ctypes.c_int(property_id.value), None, c_value])
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 62, in _call_hr_fn
_raise_if_failed(hr)
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 55, in _raise_if_failed
__try_get_error(_spx_handle(hr))
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 50, in __try_get_error
raise RuntimeError(message)
RuntimeError: Exception with error code:
[CALL STACK BEGIN]

3 libMicrosoft.CognitiveServices.Spee 0x0000000101c9cb34 property_bag_set_string + 408
4 libffi.8.dylib 0x000000010159804c ffi_call_SYSV + 76
5 libffi.8.dylib 0x000000010159574c ffi_call_int + 1208
6 _ctypes.cpython-39-darwin.so 0x00000001015785a0 _ctypes_callproc + 1260
7 _ctypes.cpython-39-darwin.so 0x00000001015729c8 PyCFuncPtr_call + 1148
8 python3.9 0x0000000100ecc1d0 _PyObject_Call + 164
9 python3.9 0x0000000100fb9ea4 _PyEval_EvalFrameDefault + 27244
10 python3.9 0x0000000100fb2de0 _PyEval_EvalCode + 2908
11 python3.9 0x0000000100ecc3e4 _PyFunction_Vectorcall + 220
12 python3.9 0x0000000100ecbff0 PyVectorcall_Call + 156
13 python3.9 0x0000000100fb9ea4 _PyEval_EvalFrameDefault + 27244
14 python3.9 0x0000000100ecc4a0 function_code_fastcall + 116
15 python3.9 0x0000000100fbd480 call_function + 516
16 python3.9 0x0000000100fb9b5c _PyEval_EvalFrameDefault + 26404
17 python3.9 0x0000000100ecc4a0 function_code_fastcall + 116
18 python3.9 0x0000000100fbd480 call_function + 516
19 python3.9 0x0000000100fb9b5c _PyEval_EvalFrameDefault + 26404
[CALL STACK END]

Exception with an error code: 0x5 (SPXERR_INVALID_ARG)

Any additional information.

ralph-msft · 2024-04-22T15:51:11Z

Please provide the SDK logs:
https://docs.microsoft.com/azure/cognitive-services/speech-service/how-to-use-logging

dyustc · 2024-04-24T02:55:18Z

Please provide the SDK logs: https://docs.microsoft.com/azure/cognitive-services/speech-service/how-to-use-logging

I added this line, as the log suggests
speech_config.set_property(speechsdk.PropertyId.Speech_LogFilename, "/Users/daiyi/work/ramp/CTC-Attention-Mispronunciation/egs/qa/azure.log")

but the log is empty. Only the err message as my original post
`
Traceback (most recent call last):
File "/Users/daiyi/work/ramp/CTC-Attention-Mispronunciation/egs/qa/microsoft_pronunciation_assessment.py", line 180, in
pronunciation_assessment_continuous_from_file(audio, txt)
File "/Users/daiyi/work/ramp/CTC-Attention-Mispronunciation/egs/qa/microsoft_pronunciation_assessment.py", line 64, in pronunciation_assessment_continuous_from_file
pronunciation_config.enable_prosody_assessment()
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/speech.py", line 3077, in enable_prosody_assessment
self.__properties.set_property(PropertyId.PronunciationAssessment_EnableProsodyAssessment, "true")
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/properties.py", line 29, in set_property
_call_hr_fn(fn=_sdk_lib.property_bag_set_string, *[self._handle, ctypes.c_int(property_id.value), None, c_value])
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 62, in _call_hr_fn
_raise_if_failed(hr)
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 55, in _raise_if_failed
__try_get_error(_spx_handle(hr))
File "/Users/daiyi/miniconda3/envs/py39/lib/python3.9/site-packages/azure/cognitiveservices/speech/interop.py", line 50, in __try_get_error
raise RuntimeError(message)
RuntimeError: Exception with error code:
[CALL STACK BEGIN]

3 libMicrosoft.CognitiveServices.Spee 0x0000000105d0cb34 property_bag_set_string + 408
4 libffi.8.dylib 0x0000000104c7c04c ffi_call_SYSV + 76
5 libffi.8.dylib 0x0000000104c7974c ffi_call_int + 1208
6 _ctypes.cpython-39-darwin.so 0x0000000104c5c5a0 _ctypes_callproc + 1260
7 _ctypes.cpython-39-darwin.so 0x0000000104c569c8 PyCFuncPtr_call + 1148
8 python3.9 0x00000001045b01d0 _PyObject_Call + 164
9 python3.9 0x000000010469dea4 _PyEval_EvalFrameDefault + 27244
10 python3.9 0x0000000104696de0 _PyEval_EvalCode + 2908
11 python3.9 0x00000001045b03e4 _PyFunction_Vectorcall + 220
12 python3.9 0x00000001045afff0 PyVectorcall_Call + 156
13 python3.9 0x000000010469dea4 _PyEval_EvalFrameDefault + 27244
14 python3.9 0x00000001045b04a0 function_code_fastcall + 116
15 python3.9 0x00000001046a1480 call_function + 516
16 python3.9 0x000000010469db5c _PyEval_EvalFrameDefault + 26404
17 python3.9 0x00000001045b04a0 function_code_fastcall + 116
18 python3.9 0x00000001046a1480 call_function + 516
19 python3.9 0x000000010469db5c _PyEval_EvalFrameDefault + 26404
[CALL STACK END]

Exception with an error code: 0x5 (SPXERR_INVALID_ARG)
`

dyustc · 2024-04-24T10:36:30Z

Please provide the SDK logs: https://docs.microsoft.com/azure/cognitive-services/speech-service/how-to-use-logging

it seems like a library problem itself, not how I called the library. And also it crashes in the speech sdk setup stage, before calling any speech service, maybe before verfification also. so maybe this is why no log is generated.

I am running on M1 pro, sonoma 14.4.1, python 3.9, the azure-cognitiveservices-speech version is 1.37.0

ralph-msft · 2024-04-26T18:15:22Z

Have you tried using the Python sample code on your machine to rule out any potential issues in your code?

Could you also please check what the architecture of the speech shared library is? You can use e.g.

file libMicrosoft.CognitiveServices.Speech.core.dylib

or

lipo -info libMicrosoft.CognitiveServices.Speech.core.dylib

dyustc · 2024-04-28T03:31:46Z

Have you tried using the Python sample code on your machine to rule out any potential issues in your code?

Readme: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/README.md

Python code: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/console/speech_sample.py

Could you also please check what the architecture of the speech shared library is? You can use e.g.
file libMicrosoft.CognitiveServices.Speech.core.dylib
or
lipo -info libMicrosoft.CognitiveServices.Speech.core.dylib

I run the sample code you provided, it hint the same crash as I provided. and the output of 2 cmds is
Mach-O 64-bit dynamically linked shared library arm64
or
Non-fat file: libMicrosoft.CognitiveServices.Speech.core.dylib is architecture: arm64

dyustc · 2024-05-09T11:54:36Z

@ralph-msft Hi, is there anything extra I could provide? I still have this issue on my mac.

github-actions · 2024-05-29T02:13:14Z

This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.

ralph-msft self-assigned this Apr 22, 2024

ralph-msft added the in-review In review label Apr 26, 2024

github-actions bot added the update needed For items that are in progress but have not been updated label May 29, 2024

ForrestGumb added the pronunciation assessment label May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

azure-cognitiveservices-speech 1.37.0 won't work for PA task on Mac OSX #2347

azure-cognitiveservices-speech 1.37.0 won't work for PA task on Mac OSX #2347

dyustc commented Apr 19, 2024

ralph-msft commented Apr 22, 2024

dyustc commented Apr 24, 2024

dyustc commented Apr 24, 2024

ralph-msft commented Apr 26, 2024

dyustc commented Apr 28, 2024

dyustc commented May 9, 2024

github-actions bot commented May 29, 2024

azure-cognitiveservices-speech 1.37.0 won't work for PA task on Mac OSX #2347

azure-cognitiveservices-speech 1.37.0 won't work for PA task on Mac OSX #2347

Comments

dyustc commented Apr 19, 2024

ralph-msft commented Apr 22, 2024

dyustc commented Apr 24, 2024

dyustc commented Apr 24, 2024

ralph-msft commented Apr 26, 2024

dyustc commented Apr 28, 2024

dyustc commented May 9, 2024

github-actions bot commented May 29, 2024