Skip to content

Fast loading of cut audio in dataloader #955

Answered by pzelasko
desh2608 asked this question in Q&A
Discussion options

You must be logged in to vote

Cuts are already implemented this way, i.e. they load only the relevant subset of audio data from disk, not the full recording*. But that's often not nearly enough on slow clusters with magnetic disks and slow interconnects. Usually you end up getting bottlenecked by random access reads, which can be even 100x slower than sequential reads, because the recording/other data is fragmented all over a magnetic disk and it takes quite a while to physically find it.

Lhotse Shar is definitely an answer to that, but I currently can't find a spare moment to write up the tutorial. But in many cases it will be sufficient to use WebDataset which offers pretty much the same I/O speed-up advantages. Ple…

Replies: 2 comments 3 replies

Comment options

desh2608
Jan 25, 2023
Collaborator Author

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@desh2608
Comment options

desh2608 Jan 26, 2023
Collaborator Author

@pzelasko
Comment options

@pzelasko
Comment options

Answer selected by desh2608
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants