The examples of filter and meta-labeling are cheating #533

eromoe · 2023-03-30T05:35:37Z

As docs in filter chapter says:

The core idea is that labeling every trading day is a fools errand, researchers should instead focus on forecasting how markets behave during specific events

It is fine while using these methods in model trainning . But in real market, **you need filter label too! ** Which trend_scanning_labels/cusum_filter / balabala.. cannot apply.
So what happen is you need train a label filter model at first , all the examples avoid this , obviously get a very good result .

As a rigorous financial quantitative engineer, you should not omit this essential case .

The text was updated successfully, but these errors were encountered:

Giogitter · 2023-05-26T09:16:59Z

I don't know if it cheating but yeah that's the main issue with their framework which seems very well made overall but in order to use classification you need to know a-priori before the training at which time to enter in position and there is no proposal to do that with an ml model. I think that is the main missing part of their system to get a fully build ml architecture, A first model for events, a second for side and a third for bet sizing (metalabeling)

eromoe · 2023-05-26T09:38:18Z

The most important thing is the label .
If you know which point profit most and label it, build a relative good model is pretty easy.
They provide a lot of labeling methods all have lookahead bias, I don't think they don't aim to cheat .

Maybe the valueable part is only metalabeling, it does help bet sizing, but that's all, it can't bring outperform to moden model(LGB/XGB/DNN) .

Giogitter · 2023-05-26T10:54:43Z

But for classifiers it is necessary to set right labels before the training that comes from the constraints of the algo itself. Maybe one way to solve that is to voluntary decrease the amont of the accuracy of labels like they did for backtesting to find optimal rules in one of their video, but the model will learn on fake labels and that will false his learning and predictions.

We can also view this problem from the other side if you set labels on events that are not proof to be particularly relevant it will miss opportunities that the model could have find otherwise.

Another way is to do the opposite, label all the market at each point with just the trading rules and let it filter events itself instead of outsourced predefined filter that could false it in both ways.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The examples of filter and meta-labeling are cheating #533

The examples of filter and meta-labeling are cheating #533

eromoe commented Mar 30, 2023 •

edited

Giogitter commented May 26, 2023

eromoe commented May 26, 2023

Giogitter commented May 26, 2023 •

edited

The examples of filter and meta-labeling are cheating #533

The examples of filter and meta-labeling are cheating #533

Comments

eromoe commented Mar 30, 2023 • edited

Giogitter commented May 26, 2023

eromoe commented May 26, 2023

Giogitter commented May 26, 2023 • edited

eromoe commented Mar 30, 2023 •

edited

Giogitter commented May 26, 2023 •

edited