Token Classification Memory Issue #117

mattdeeperinsights · 2022-11-23T12:31:19Z

Hi there, I am using TokenClassificationExplainer which works great with small text inputs. However, for a large 512 token input my machine runs out of memory and crashes.

This is probably because of an [unavoidable?] exponential memory issue of creating attributions for all 512 subtokens for each subtoken. Normally, the local context is most likely to contribute to the classificaiton of each token, it could be useful to reduce the number of pairs we make for each token. For example, the 512th token is less related to the 10th token, than tokens 1 - 20.

Would it be possible to add in a context_window_size: Optional[int] = None somewhere (possibly here), to reduce the number of pairs, so we we only calculate attributions for the context_window_size many tokens to the left and right.

I have tried to dig into the code to add this functionality in a PR but I don't understand all of the functions as they are missing doc strings and comments (in here).

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token Classification Memory Issue #117

Token Classification Memory Issue #117

mattdeeperinsights commented Nov 23, 2022 •

edited

Token Classification Memory Issue #117

Token Classification Memory Issue #117

Comments

mattdeeperinsights commented Nov 23, 2022 • edited

mattdeeperinsights commented Nov 23, 2022 •

edited