How should I extract the features of noisy speech after mixing? #1202

huhuqwaszxedc · 2023-11-01T09:18:02Z

Hello, My Cutset was obtained by Cutset.mix, so all of them are Mixcut. I used compute_and_store_features_batch function, the features of the output Cutset only contain the features of the first track (i.e. the source audio). If I want to obtain the features of noisy speech after mixing, how should I extract them?
Thank you very much for your work！

desh2608 · 2023-11-01T13:15:07Z

By default, it should already extract features for the "mixed" speech, not just the first track. The compute_and_store_features_batch calls load_audio internally which has mixed=True set by default (

lhotse/lhotse/cut/mixed.py

Line 1027 in c5f26af

def load_audio(

). If this is not the case for you, you may need to use pdb to step through the code and see where it is failing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How should I extract the features of noisy speech after mixing? #1202

How should I extract the features of noisy speech after mixing? #1202

huhuqwaszxedc commented Nov 1, 2023

desh2608 commented Nov 1, 2023

How should I extract the features of noisy speech after mixing? #1202

How should I extract the features of noisy speech after mixing? #1202

Comments

huhuqwaszxedc commented Nov 1, 2023

desh2608 commented Nov 1, 2023