varlen features implementations #29

minarastgar · 2020-08-28T20:39:45Z

Do you have a plan to implement Varlen sparse features and different pooling layers?

jackguagua · 2020-08-29T01:34:23Z

pooling layers? can you say it more specifically?

minarastgar · 2020-08-29T02:03:21Z

I meant mostly varlan sparse features, for example, sequence of item_ids. Every item id is a sparse feature, and the last 10 items purchased by a user is a sequence of embeddings of item-ids which can be aggregated with a pooling layer like averagepooling.

jackguagua · 2020-08-29T02:44:52Z

I got it, it will be supported in next release.

minarastgar · 2020-09-03T17:57:57Z

Thanks for your quick reply. Looking forward to it. Any ETA for the next release?

jackguagua · 2020-09-04T00:27:58Z

It should be around October this year.

minarastgar · 2020-10-31T01:14:49Z

sorry for bugging you. Wonder if the release mentioned above is available. Thank you very much

jackguagua · 2020-10-31T03:33:34Z

I'm very very sorry for the delay of the original plan due to some other urgent tasks in the past two months. I will strive to release this new feature by the end of November.Sorry again.

jackguagua · 2020-11-26T05:21:54Z

@minarastgar varlen features is ready. here for details #44 (comment)

minarastgar · 2020-11-30T00:52:16Z

@jackguagua thank you so much. This is absolutely fantastic

minarastgar · 2021-01-28T00:58:40Z

Hi @jackguagua , I have a quick question about Varlen Features. Let's say there is a varlen feature like streams of movie_ids, and a categorical feature that is the movie_id we want to show to user. So we want to have an embedding for movie_id which is used by movie_id as well as streams of movie_ids . How can I specify that the embedding used for streams_of_movie_ids and movie_id is the same

                                   task=consts.TASK_REGRESSION,
                                   categorical_columns=["movie_id", "user_id", "gender", "occupation", "zip", "title", "age"],
                                   metrics=['mse'],
                                   fixed_embedding_dim=True,
                                   embeddings_output_dim=4,
                                   apply_gbm_features=False,
                                   apply_class_weight=True,
                                   earlystopping_patience=5,
                                   var_len_categorical_columns=[('stream_of_movie_ids', "|", "max")]) ```

jackguagua · 2021-01-31T12:03:03Z

DT can't do what you want now. I'm not very clear about the purpose of doing this. If you have the code that uses keras to implement it, pls send to me for reference.

minarastgar · 2021-02-04T01:07:00Z

Let me please clarify this, let say we have a list of movie_id [movie_id1, movie_id2,...,movie_id10] which are the last 10 movies watched by the user. On the other hand, we have a target movie which is movie_id100 (sparse_feature). for both streams (list of movie_id ) and sparse (target_title), we want to use movie_ids to build the embeddings. We do not want to generate different embeddings for entities in streams and sparse. the are coming from the same root which is movie_id.

jackguagua assigned oaksharks Nov 2, 2020

jackguagua added the enhancement New feature or request label Nov 2, 2020

oaksharks added a commit that referenced this issue Nov 25, 2020

#29 Support var len features

1ed27c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

varlen features implementations #29

varlen features implementations #29

minarastgar commented Aug 28, 2020

jackguagua commented Aug 29, 2020

minarastgar commented Aug 29, 2020

jackguagua commented Aug 29, 2020

minarastgar commented Sep 3, 2020

jackguagua commented Sep 4, 2020

minarastgar commented Oct 31, 2020

jackguagua commented Oct 31, 2020

jackguagua commented Nov 26, 2020

minarastgar commented Nov 30, 2020

minarastgar commented Jan 28, 2021 •

edited

jackguagua commented Jan 31, 2021

minarastgar commented Feb 4, 2021

varlen features implementations #29

varlen features implementations #29

Comments

minarastgar commented Aug 28, 2020

jackguagua commented Aug 29, 2020

minarastgar commented Aug 29, 2020

jackguagua commented Aug 29, 2020

minarastgar commented Sep 3, 2020

jackguagua commented Sep 4, 2020

minarastgar commented Oct 31, 2020

jackguagua commented Oct 31, 2020

jackguagua commented Nov 26, 2020

minarastgar commented Nov 30, 2020

minarastgar commented Jan 28, 2021 • edited

jackguagua commented Jan 31, 2021

minarastgar commented Feb 4, 2021

minarastgar commented Jan 28, 2021 •

edited