Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization

This repository is the official bipartite matchings experiment implementation of paper "Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization", ICML 2021. In this work we learn the variance as well as the mean of randomized structured predictors and show that it balances better between the learned score function and the randomized noise.

Architecture

The expectancy over Gumbel noise of the loss is derived w.r.t. the parameters w of the signal and w.r.t. the parameters v of the variance controller σ directly. The network μ has a first fully connected layer that links the sets of samples to an intermediate representation (with 32 neurons), and a second (fully connected) layer that turns those representations into batches of latent permutation matrices of dimension d by d each. The network σ has a single layer connecting input sample sequences to a single output which is then activated by a softplus activation. We have chosen such an activation to enforce a positive sigma value.

How to run this code

Settings to consider:

'n_numbers' controls the sequence length (d).

'batch_size' controls the number of sequences used in training.

'test_set_size' controls the number of sequences to evaluate in the test set.

Hyper-parameters to consider:

'samples_per_num_train' controls how many perturbations will be conducted for each permutation representation. We explored one or five in our experiments. Five are usually more beneficial as the sequence length increases. The results in the paper refer to five noise perturbations for each permutation representation.

A test set will be evaluated on the trained model, and the following metrics will be reported to log file:

Prop. wrong: the proportion of errors in sorting.
Prop. any wrong: the proportion of sequences where there was at least one error.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
BipartiteMatching.jpg		BipartiteMatching.jpg
Direct_bipartite_matching_signal-to-noise.py		Direct_bipartite_matching_signal-to-noise.py
README.md		README.md
my_ops.py		my_ops.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BipartiteMatching.jpg

BipartiteMatching.jpg

Direct_bipartite_matching_signal-to-noise.py

Direct_bipartite_matching_signal-to-noise.py

README.md

README.md

my_ops.py

my_ops.py

requirements.txt

requirements.txt

Repository files navigation

Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization

Architecture

How to run this code

About

Releases

Packages

Languages

HeddaCohenIndelman/PerturbedStructuredPredictorsDirect

Folders and files

Latest commit

History

Repository files navigation

Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization

Architecture

How to run this code

About

Topics

Resources

Stars

Watchers

Forks

Languages