Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The examples of filter and meta-labeling are cheating #533

Open
eromoe opened this issue Mar 30, 2023 · 3 comments
Open

The examples of filter and meta-labeling are cheating #533

eromoe opened this issue Mar 30, 2023 · 3 comments

Comments

@eromoe
Copy link

eromoe commented Mar 30, 2023

As docs in filter chapter says:

The core idea is that labeling every trading day is a fools errand, researchers should instead focus on forecasting how markets behave during specific events

It is fine while using these methods in model trainning . But in real market, **you need filter label too! ** Which trend_scanning_labels/cusum_filter / balabala.. cannot apply.
So what happen is you need train a label filter model at first , all the examples avoid this , obviously get a very good result .

As a rigorous financial quantitative engineer, you should not omit this essential case .

@Giogitter
Copy link

I don't know if it cheating but yeah that's the main issue with their framework which seems very well made overall but in order to use classification you need to know a-priori before the training at which time to enter in position and there is no proposal to do that with an ml model. I think that is the main missing part of their system to get a fully build ml architecture, A first model for events, a second for side and a third for bet sizing (metalabeling)

@eromoe
Copy link
Author

eromoe commented May 26, 2023

The most important thing is the label .
If you know which point profit most and label it, build a relative good model is pretty easy.
They provide a lot of labeling methods all have lookahead bias, I don't think they don't aim to cheat .

Maybe the valueable part is only metalabeling, it does help bet sizing, but that's all, it can't bring outperform to moden model(LGB/XGB/DNN) .

@Giogitter
Copy link

Giogitter commented May 26, 2023

But for classifiers it is necessary to set right labels before the training that comes from the constraints of the algo itself. Maybe one way to solve that is to voluntary decrease the amont of the accuracy of labels like they did for backtesting to find optimal rules in one of their video, but the model will learn on fake labels and that will false his learning and predictions.

We can also view this problem from the other side if you set labels on events that are not proof to be particularly relevant it will miss opportunities that the model could have find otherwise.

Another way is to do the opposite, label all the market at each point with just the trading rules and let it filter events itself instead of outsourced predefined filter that could false it in both ways.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants