Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

实时训练如何生成正负样本 #15

Open
gzy0-0 opened this issue Apr 27, 2020 · 0 comments
Open

实时训练如何生成正负样本 #15

gzy0-0 opened this issue Apr 27, 2020 · 0 comments

Comments

@gzy0-0
Copy link

gzy0-0 commented Apr 27, 2020

在曝光和点击进行interval join 时生成的数据是正样本,但是需要拿到曝光没有点击的数据。interval join 是否可以输出该数据,比如在数据淘汰的时候?
使用cogroup 的话,由于

public static long getWindowStartWithOffset(long timestamp, long offset, long windowSize) {
return timestamp - (timestamp - offset + windowSize) % windowSize;
}

曝光和点击可能位于不了一个window,这个情况怎么处理呢,非要借助外部存储进行记录吗

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant