Hierarchy measures: speed vs memory? #340

bmcfee · 2021-07-29T15:44:45Z

Most of the hierarchy measures are defined in terms of a meet matrix. This is a square matrix where each row and column corresponds to a frame (default, sampled at 10Hz per the segment metrics), and the value is a small integer corresponding to the maximum depth within the hierarchy at which frames i,j receive the same label. We do this twice, once for the reference and once for the estimate.

For a typical ~4min track, this object is of size 2400x2400, so not huge, but not tiny either. In most cases, most of the matrix is empty, so we store it as a CSR (compressed sparse rows) matrix. However, we don't know the sparsity pattern in advance, so it is first built as a LIL matrix and then converted to CSR, as seen here (mir_eval.hierarchy._meet):

mir_eval/mir_eval/hierarchy.py

Lines 215 to 236 in 576aad4

    
           # Initialize the meet matrix 
        
           meet_matrix = scipy.sparse.lil_matrix((n, n), dtype=np.uint8) 
        
           for level, (intervals, labels) in enumerate(zip(intervals_hier, 
        
                                                           labels_hier), 1): 
        
               # Encode the labels at this level 
        
               lab_enc = util.index_labels(labels)[0] 
        
               # Find unique agreements 
        
               int_agree = np.triu(np.equal.outer(lab_enc, lab_enc)) 
        
               # Map intervals to frame indices 
        
               int_frames = (_round(intervals, frame_size) / frame_size).astype(int) 
        
               # For each intervals i, j where labels agree, update the meet matrix 
        
               for (seg_i, seg_j) in zip(*np.where(int_agree)): 
        
                   idx_i = slice(*list(int_frames[seg_i])) 
        
                   idx_j = slice(*list(int_frames[seg_j])) 
        
                   meet_matrix[idx_i, idx_j] = level 
        
                   if seg_i != seg_j: 
        
                       meet_matrix[idx_j, idx_i] = level

Rather than iterating over rows or columns, we use slice notation to assign blocks at a time. It turns out that this is horrendously slow in LIL matrices, and comprises the major bottleneck in the evaluation. The subsequent processing mainly involves sorting and inversion counting, and I believe is about as fast as can be (cf #201 and #225).

If we're willing to sacrifice memory for speed, it's simple enough to replace the initial LIL matrix by an ndarray. (I've done this in a local branch, and it is indeed significantly faster and sufficient for my current needs.) This hardly seems like an ideal solution though, as it completely defeats the purpose of constraining memory by going through a sparse representation. I expect that with a bit of cleverness, we could probably bypass this initial lil/dense issue and go directly to CSR format.

The text was updated successfully, but these errors were encountered:

craffel · 2021-07-29T17:24:29Z

Thanks Brian. Maybe as a preliminary step we could add an arg that specifies whether to use a LIL matrix or an ndarray?

bmcfee · 2021-07-29T17:25:56Z

we could add an arg that specifies whether to use a LIL matrix or an ndarray?

I thought about that. It's doable, but kind of ugly from an API perspective since it would have to thread through the entire module.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hierarchy measures: speed vs memory? #340

Hierarchy measures: speed vs memory? #340

bmcfee commented Jul 29, 2021

craffel commented Jul 29, 2021

bmcfee commented Jul 29, 2021

Hierarchy measures: speed vs memory? #340

Hierarchy measures: speed vs memory? #340

Comments

bmcfee commented Jul 29, 2021

craffel commented Jul 29, 2021

bmcfee commented Jul 29, 2021