Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adapt "step" automagically #206

Open
hbredin opened this issue Nov 10, 2023 · 2 comments
Open

Adapt "step" automagically #206

hbredin opened this issue Nov 10, 2023 · 2 comments
Labels
feature New feature or request

Comments

@hbredin
Copy link
Collaborator

hbredin commented Nov 10, 2023

step controls the minimum algorithmic latency of the speaker diarization pipeline.

Targetting real-time processing, one needs to make sure that the processing latency (i.e. the time it takes to process one step) is smaller than this algorithmic latency.

Said differently: the lower bound on the algorithmic latency is the processing latency, which in turns, depends on the computing power of the machine the pipeline runs on (e.g. GPU is usually faster than CPU).

Would be nice to provide an API to automatically estimate this lower bound by running a few steps when pipeline is instantiated and measuring the time it takes so that step can be set automatically to processing latency + a little safety net.

@hbredin hbredin added the feature New feature or request label Nov 10, 2023
@juanmc2005
Copy link
Owner

I like it, that way we can automatically set the lowest possible latency. This could be implemented as --step auto, but also somewhere in the python API

@juanmc2005
Copy link
Owner

Another idea: Implement this as a diart.profile recording.wav that also runs a quick grid search on that file to suggest hyper-parameter values without running a costly tuning.

This would be useful for people that don't have much data but have a "typical" conversation that the system will encounter. Then diart would quickly suggest a config to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants