Pareto front #33

ianhbell · 2017-05-02T22:23:41Z

Has there been any thought given to pareto front optimization? There's always a tradeoff between tree size and model fidelity, which I gather you handle with parsimony. But the other alternative is to keep any model that is non-dominated by the pareto front. I couldn't see any clear way of hacking that into gplearn.

trevorstephens · 2017-05-06T09:14:17Z

Sounds interesting @ianhbell ... Got a citation in mind?

Ohjeah · 2017-05-06T23:56:58Z

This should be a good point to start reading: https://www.iitk.ac.in/kangal/Deb_NSGA-II.pdf

remiadon · 2022-01-21T18:25:27Z

Hi, @ianhbell

Just for ciriosity, if I define a complexity measure (yielding the number of nodes in the tree representation of an expression), and use this complexity measure inside my custom fitness, a bit like so

from sklearn.metrics import r2_score
def my_custom_fitness(expr, X, y_true):
    y_pred = make_prediction(expr, X)
    return r2_score(y_pred, y_true) - (complexity(expr) / 1000)

Therefore:

for two expressions yielding the same r2_score, my_custom_fitness would favour the simplest one
for two expressions having the same complexity (i.e the first one is as simple as the second one), my_custom_fitness would the one that yields the best r2_score

Given these properties, the expression found at the end of fit would be on the pareto front (at least the one drawn considering all evaluated expressions)

Am I missing something ?

remiadon · 2022-01-23T11:26:40Z

Answering to myself with a reference

PARETO-FRONT EXPLOITATION IN SYMBOLIC
REGRESSION

Written at page 294 :

There is, however, a significant difference between using a Pareto front as a post-run
analysis tool vs. actively optimizing the Pareto front during a GP-run. In the
latter case the Pareto front becomes the objective that is being optimized instead
of the fitness (accuracy) of the “best” mode

So yes, I was missing something big

trevorstephens added the enhancement label May 6, 2017

trevorstephens added this to the 0.5.0 milestone Mar 24, 2019

trevorstephens modified the milestones: 0.4.1, 0.5.0 May 31, 2019

hwulfmeyer mentioned this issue Jun 22, 2019

Add different selection methods #28

Open

danuker mentioned this issue Oct 30, 2020

Improve closure by returning more reasonable values #130

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pareto front #33

Pareto front #33

ianhbell commented May 2, 2017

trevorstephens commented May 6, 2017

Ohjeah commented May 6, 2017

remiadon commented Jan 21, 2022

remiadon commented Jan 23, 2022

Pareto front #33

Pareto front #33

Comments

ianhbell commented May 2, 2017

trevorstephens commented May 6, 2017

Ohjeah commented May 6, 2017

remiadon commented Jan 21, 2022

remiadon commented Jan 23, 2022