Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

你好,我有一个问题 #42

Open
mactavish91 opened this issue Dec 24, 2019 · 1 comment
Open

你好,我有一个问题 #42

mactavish91 opened this issue Dec 24, 2019 · 1 comment

Comments

@mactavish91
Copy link

为什么tfidf,doc2vec你都做了stacking,而word2vec没有呢,如果加入lda的特征需要进行stacking吗

@ShawnyXiao
Copy link
Owner

  1. 为什么tfidf,doc2vec你都做了stacking,而word2vec没有呢?

TFIDF 和 Doc2Vec 都做了 Stacking,是因为这两个特征的维度太高了,不好直接输送给 XGBoost 模型。此项目中的 Word2Vec 向量维度输送给 XGBoost 模型还能够接受,另外效果上也还可以,所以没有做 Stacking。

  1. 如果加入lda的特征需要进行stacking吗?

LDA 特征输送给 XGBoost 的话,训练速度能够接受的话,可以不做 Stacking。如果基本无法训练,或者效果上不好,还是做一层 Stacking 比较好。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants