Add Masked NLL interface #466

ilkerkesen · 2019-06-17T10:26:11Z

This PR adds Masked NLL functionality with two different interfaces. Currently, tests fail because of some multiple dispatch pattern matching issue.

ekinakyurek · 2019-06-17T18:41:02Z

@ilkerkesen I had a different implementation for masked-nll over here. It is a bit hacky but the advantage is that it doesn't require another mask array, and probably it is slightly faster because it has no extra indexing operation. The hack works by forcing user to put 0 into the answer array where the answer will be masked. Then, findindices returns locations for only non-masked indices.

ilkerkesen · 2019-06-17T19:24:47Z

I liked yours more. I'm just questioning that do we ever need positive mask/pad indices in the output array just for ease of use?

ekinakyurek · 2019-06-17T19:55:56Z

Thanks @ilkerkesen, and I think there is also a way to support both interface on top of my version of findindices. By adding something like below function:

function nll(y, a::AbstractArray{<:Integer}, mask; dims=1, average=true) = 
    a[mask].=0 # No AD overhead in the masking since `a` is not `Param` object
    return nll(y,a;dims=dims, average=average)
end

For example, user may want to mask different tokens in each iteration.

I am not sure that I understand your question correctly:
Is the output array predicted scores which is the input to the nll?

ilkerkesen · 2019-06-17T23:06:41Z

a[mask] .= 0 trick is not feasible because it makes nll procedure mutable. I think your initial solution is much more simpler and elegant. Though, I agree with you, an explicit interface where masking doesn't depend on gold output values would be a nice feature to have since it is convenient for evaluation phase.

ekinakyurek · 2019-06-18T17:33:56Z

What do you suggest then? Should I make another PR?

This should work:

function nll(y, a::AbstractArray{<:Integer}, mask; dims=1, average=true) = 
 # where mask is boolean or integer array where 0s or falses are masked
   return nll(y,a .* mask;dims=dims, average=average)
end

denizyuret · 2019-08-18T12:00:36Z

The least intrusive (backward compatible) change is to implement Ekin's suggestion of skipping zeros in answer key, implemented in: 2f33926

This does not break any existing code because zero is currently not a valid answer.

Users will most likely want to use a positive number for padding rather than zero, otherwise the decoder input will be invalid in a s2s model for example. Before calling nll/accuracy the convention is to prepare an answer key where padding is done via zeros. I believe this is the method used in Keras.

Eventually it would be best to implement complete MaskedSequence, PaddedSequence, PackedSequence types like in pytorch. The current solution should be seen as a low level solution. Check out the commit and close PR if satisfactory.

P.S. I had to change the behavior of average=false to return a pair (total,count) rather than just total, as this was needed everywhere. Please check and fix if average=false is used in your code.

ilkerkesen added 6 commits June 16, 2019 21:01

revisit nll tests, add masking nll tests

8b217f0

add tests for nll over data

700b497

add more tests

d3eaa62

add masked nll interface with docstrings

65afe92

revisit nll tests, add @test_throws cases

7e02066

change mask/ignore mechanism, document it, make tests pass

e45652b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Masked NLL interface #466

Add Masked NLL interface #466

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 17, 2019 •

edited

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 17, 2019 •

edited

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 18, 2019

denizyuret commented Aug 18, 2019

Add Masked NLL interface #466

Are you sure you want to change the base?

Add Masked NLL interface #466

Conversation

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 17, 2019 • edited

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 17, 2019 • edited

ilkerkesen commented Jun 17, 2019

ekinakyurek commented Jun 18, 2019

denizyuret commented Aug 18, 2019

ekinakyurek commented Jun 17, 2019 •

edited

ekinakyurek commented Jun 17, 2019 •

edited