Error while fine-tuning hyperparameters in the Pyannote.audio 2.1 Speaker Diarization Pipeline #1317

nalli-hu · 2023-04-05T09:25:52Z

I followed the notebook "Adapting pyannote.audio 2.1 pretrained speaker diarization pipeline to your own data" to adapt the Pyannote.audio 2.1 Speaker Diarization Pipeline to my own data.

I am using the pyannote.database structure to load my data, and the audio recordings are short snippets, with the shortest ones being one second in length.

However, when I try to fine-tune the hyperparameters using the following code:
iterations = optimizer.tune_iter(dev_set, show_progress=False)
best_loss = 1.0
for i, iteration in enumerate(iterations):
print(f"Best segmentation threshold so far: {iteration['params']['segmentation']['threshold']}")
if i > 20: break # 50 iterations should give slightly better results

I encounter the following error:
Traceback (most recent call last): File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/optuna/study/_optimize.py", line 200, in _run_trial value_or_values = func(trial) File "path/to/optimizer.py", line 210, in objective output = pipeline(input) File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/pyannote/audio/core/pipeline.py", line 238, in call return self.apply(file, **kwargs) File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/pyannote/audio/pipelines/speaker_diarization.py", line 494, in apply hardclusters, = self.clustering( File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/pyannote/audio/pipelines/clustering.py", line 612, in call oracle_segmentations = oracle_segmentation(file, window, frames=frames) File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/pyannote/audio/pipelines/utils/oracle.py", line 105, in oracle_segmentation return SlidingWindowFeature(np.float32(np.stack(segmentations)), window) File "<array_function internals>", line 180, in stack File "/opt/miniconda3/envs/Asr2/lib/python3.9/site-packages/numpy/core/shape_base.py", line 422, in stack raise ValueError('need at least one array to stack') ValueError: need at least one array to stack

Any suggestions on how to fix this error would be greatly appreciated.

github-actions · 2023-04-05T09:26:13Z

We found the following entry in the FAQ which you may find helpful:

Does pyannote support streaming speaker diarization?

Feel free to close this issue if you found an answer in the FAQ. Otherwise, please give us a little time to review.

This is an automated reply, generated by FAQtory

hbredin · 2023-04-06T07:23:15Z

To maximise the probability of someone answering your question:

if your issue is a bug report, please provide a minimum reproducible example, e.g. a link to a self-contained Google Colab notebook (i.e. containing everthing needed to reproduce the bug: installation of pyannote.audio, downloads of models or test data, etc...)
if your issue is a feature request, please read this first and update your request accordingly.

stale · 2023-10-03T07:30:20Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

hbredin added the cannot_reproduce label Apr 6, 2023

stale bot added the wontfix label Oct 3, 2023

stale bot closed this as completed Nov 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while fine-tuning hyperparameters in the Pyannote.audio 2.1 Speaker Diarization Pipeline #1317

Error while fine-tuning hyperparameters in the Pyannote.audio 2.1 Speaker Diarization Pipeline #1317

nalli-hu commented Apr 5, 2023

github-actions bot commented Apr 5, 2023

hbredin commented Apr 6, 2023

stale bot commented Oct 3, 2023

Error while fine-tuning hyperparameters in the Pyannote.audio 2.1 Speaker Diarization Pipeline #1317

Error while fine-tuning hyperparameters in the Pyannote.audio 2.1 Speaker Diarization Pipeline #1317

Comments

nalli-hu commented Apr 5, 2023

github-actions bot commented Apr 5, 2023

hbredin commented Apr 6, 2023

stale bot commented Oct 3, 2023