[QST] Help w/ exporting Retrieval Model. #1092

Tmoradi · 2024-01-03T18:22:21Z

❓ Questions & Help

Details

my current set up is on a vertex ai workbench w/ Tesla T4 GPU, 120 GB RAM, 32vCPUs, and using nvcr.io/nvidia/merlin/merlin-tensorflow:nightly container image.

I am currently working on a retrieval based model and depending on the version of the dataset I use, I cannot export the query tower.

The different dataset's that I've been using are as follows:

v3: various continuous and categorical features for both users and the items we want to recommend.

exporting the query tower works fine in this case.

v4: features of v3 + item embeddings generated from sentence transformer.

dim size is 384
in the NVTabular Workflow, I just tagged the embedding columns as item features. not sure if this is the cause of the issues but I wanted to include this just in case.

v5: features of v3 + item embeddings generated from sentence transformer + multi-hot encoding of user history as implicit feedback.

dim of multi-hot encoding is 332
in the NVTabular Workflow, I just tagged the these columns Categorify() >> TagAsUserFeatures(). not sure if this is the cause of the issues but I wanted to include this just in case.

so when training a model w/ either v4 or v5 and save the query encoder, I get the following message:
query_tower = model.query_encoder
query_tower.save(...)

note: this screen shot is with v4, when using v5 ds, I get more functions that wont be available.

when I load the query_tower and then try to turn the model into a top_k_model I get another error.
query_tower = tf.keras.models.load_model(".../query_tower_based_on_v5",compile=False)
model = mm.TwoTowerModelV2(query_tower, candidate)
model.compile()
candidate_features = unique_rows_by_features(train, Tags.ITEM, Tags.ITEM_ID)
topk_model = model.to_top_k_encoder(candidate_features, k=10, batch_size=1)

sorry for cutting the error message off but its talking about the encoder for the query tower.

sorry for being private w/ the data but its not public.

I hope i was able to portray the issue and any suggestions would be appreciated!

The text was updated successfully, but these errors were encountered:

rnyak · 2024-01-04T19:55:18Z

@Tmoradi can you please use nvcr.io/nvidia/merlin/merlin-tensorflow:23.08 and then do

cd  /models
git pull origin main
pip install .

and test again.

if you can give us a toy repro example we can debug better.

Tmoradi · 2024-01-04T20:54:27Z

Thanks for your reply! I'll get you the toy repo example asap.

Tmoradi · 2024-01-16T19:10:11Z

Hi @rnyak, sorry for the late reply. Here's what I've done since the last update.

I made a new notebook on vertex ai that uses the container you mentioned to try nvcr.io/nvidia/merlin/merlin-tensorflow:23.08

I ran into the same issues when it came to exporting the query tower.

I wasn't sure this would work, but in the Workflow, I tried to add Tags.CONTINUOUS for the embedding features to see if that did anything and didn't impact results.

I also created a toy dataset(10k sample of V5) and a notebook that goes through the workflow and the different errors I get. Link to repo

Tmoradi · 2024-01-31T19:41:05Z

@rnyak hello, hope you are doing well. I appreciate that you may be busy, but would it be possible to give an eta for when you are able to help? I am going to try to use pytorch/pytorch lightning instead to see if that helps.

rnyak · 2024-02-02T21:14:14Z

@Tmoradi please note that adding external embeddings to TwoTower model was not tested, and I can say it is likely that you might get issues. The team does not currently have bandwidth to work on that. you can try to look at the available unit tests (like this one) in case you can adopt them.

Tmoradi added the question Further information is requested label Jan 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST] Help w/ exporting Retrieval Model. #1092

[QST] Help w/ exporting Retrieval Model. #1092

Tmoradi commented Jan 3, 2024

rnyak commented Jan 4, 2024 •

edited

Tmoradi commented Jan 4, 2024 •

edited

Tmoradi commented Jan 16, 2024

Tmoradi commented Jan 31, 2024

rnyak commented Feb 2, 2024

[QST] Help w/ exporting Retrieval Model. #1092

[QST] Help w/ exporting Retrieval Model. #1092

Comments

Tmoradi commented Jan 3, 2024

❓ Questions & Help

Details

rnyak commented Jan 4, 2024 • edited

Tmoradi commented Jan 4, 2024 • edited

Tmoradi commented Jan 16, 2024

Tmoradi commented Jan 31, 2024

rnyak commented Feb 2, 2024

rnyak commented Jan 4, 2024 •

edited

Tmoradi commented Jan 4, 2024 •

edited