Adding NTK adaptive loss function #834

ayushinav · 2024-03-16T23:22:06Z

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Additional context

Add any other context about the problem here.

ayushinav · 2024-03-16T23:22:54Z

One of the arguments for NTK Adaptive loss would be the kernel size (eg., 4th cell here). This is the number of points that should be used to make the NTK. This means we have to sample points again when computing the adaptive loss weights. We can use the already sampled points, but under the current implementation that is not possible because the points are generated inside merge_strategy_with_loss_function in discretize.jl. Also, the kernel size might be different from the batch size, but yeah, we can reuse points instead of sampling.

As a workaround, I tried generating the points inside generate_adaptive_loss_function. The issue now is that we don't have access to data free loss functions because the current implementation only passes the loss functions after taking the mean from the losses of the sampled points. while they are saved for future calls later on, pinnrep does not hold them until then.

So, a simpler solution would be to allocate pinnrep.loss_function earlier on, after defining bc_loss_function and pde_loss_function but since we do not have full_loss_function and additional_loss, we can have them as null functions, so something like

pinnrep.loss_functions = PINNLossFunctions(bc_loss_functions, pde_loss_functions,
                                                ()->(), ()->(), 
                                                datafree_pde_loss_functions,
                                                datafree_bc_loss_functions)

here or probably even before but make all the functions as nulls until we get them. Wanted to confirm if it's fine doing this, @ChrisRackauckas @sathvikbhagavan

IIUC, the sampler in StochasticTraining best resembles their sampler (3rd cell here).

The current implementation I have here takes the gradient after taking the mse, that is, the squares of sum of the gradient of squared errors, we want to take the sum of the squares of the gradients of the squared errors.

ayushinav · 2024-03-16T23:32:02Z

I'd be happy to work on #703 that will help resolve the issue here as well. As I understand now, I feel like it's mostly about making a struct that contains the strategy and the sampled points, maybe domains as well?

ayushinav and others added 4 commits March 12, 2024 23:05

adding NTK adaptive loss

604f88c

Merge branch 'SciML:master' into ada_loss

51a2fab

Merge branch 'SciML:master' into ada_loss

4369444

Merge branch 'SciML:master' into ada_loss

8644402

ayushinav and others added 3 commits May 13, 2024 22:40

Merge branch 'SciML:master' into ada_loss

9e50d71

shifting to work on sampling strategy

66e651a

merge issues

f025602

ayushinav closed this May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding NTK adaptive loss function #834

Adding NTK adaptive loss function #834

ayushinav commented Mar 16, 2024

ayushinav commented Mar 16, 2024 •

edited

ayushinav commented Mar 16, 2024

Adding NTK adaptive loss function #834

Adding NTK adaptive loss function #834

Conversation

ayushinav commented Mar 16, 2024

Checklist

Additional context

ayushinav commented Mar 16, 2024 • edited

ayushinav commented Mar 16, 2024

ayushinav commented Mar 16, 2024 •

edited