Add support of transformer-based retrieval models #1128

sararb · 2023-05-31T13:33:09Z

Goals ⚽

Support ragged queries and targets in the BruteForce(TopKLayer) class. This is needed to export a sequential session encoder.
Support exporting the candidate embeddings from CategoricalOutput class.
Make the bias term optional in the weight-tying layer EmbeddingTablePrediction. The default is false, as we wouldn't have access to the bias term if we export the query encoder to an ANN system for inference.

Testing Details 🔍

Add a check of ragged_query in the unit testtest_brute_force_layer.
Add use_bias option to the unit test test_last_item_prediction.

github-actions · 2023-05-31T13:41:39Z

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1128

sararb · 2023-06-01T15:24:32Z

merlin/models/tf/outputs/topk.py

@@ -206,6 +206,8 @@ def call(
                "You should call the `index` method first to " "set the _candidates index."
            )

+        if isinstance(inputs, tf.RaggedTensor):


This assumes that we evaluate only on the last item in the session (which is the default mode during inference too). We might need to extend it to evaluate other items in the sequence in future work.

Thanks @sararb. I was trying to understand why we would have 1 as the 2nd dim of inputs, but from your explanation I now understand the reason is that we have predictions only for the last position.
I think it would be useful to add this remark as a comment.

rnyak · 2023-06-02T00:45:00Z

@sararb I can save the xlnet model but I cannot load it back. I am getting an error. ValueError: The last dimension of the input shape of a Dense layer should be defined. Found None. Received: input_shape=(None, None)

do you think you can add a test for that in the unit test? also a test to showcase how one can do offline prediction? thanks.

gabrielspmoreira · 2023-06-02T19:53:25Z

merlin/models/tf/outputs/topk.py

@@ -206,6 +206,8 @@ def call(
                "You should call the `index` method first to " "set the _candidates index."
            )

+        if isinstance(inputs, tf.RaggedTensor):


Thanks @sararb. I was trying to understand why we would have 1 as the 2nd dim of inputs, but from your explanation I now understand the reason is that we have predictions only for the last position.
I think it would be useful to add this remark as a comment.

rnyak · 2023-11-07T14:49:40Z

Note: with making the bias term optional in the weight-tying layer EmbeddingTablePrediction, we make sure that the training and inference models have same score calculation, bcs contrastive output head is not exported during inference.

sararb added bug Something isn't working enhancement New feature or request P0 labels May 31, 2023

sararb requested review from gabrielspmoreira and rnyak May 31, 2023 13:33

sararb self-assigned this May 31, 2023

sararb commented Jun 1, 2023

View reviewed changes

gabrielspmoreira approved these changes Jun 2, 2023

View reviewed changes

sararb added 4 commits June 5, 2023 13:12

add ragged support in topk block

1d97f48

extend candidate embeddings extraction to CategoricalOutput

021ac2a

make bias term optional in the weight-tying class

9e740c8

add comment about top-k only works for the last item in the session

cf2cc45

sararb force-pushed the topk-ragged branch from c9ee7e8 to cf2cc45 Compare June 5, 2023 13:22

Merge branch 'main' into topk-ragged

a58c2ed

rnyak added this to the Merlin 23.06 milestone Jun 5, 2023

rnyak and others added 4 commits June 7, 2023 10:53

Merge branch 'main' into topk-ragged

06f3bcb

Merge branch 'main' into topk-ragged

1574b6d

Merge branch 'main' into topk-ragged

c4e9f58

Merge branch 'main' into topk-ragged

3a80592

edknv merged commit 9980689 into main Jun 12, 2023

edknv deleted the topk-ragged branch June 12, 2023 08:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support of transformer-based retrieval models #1128

Add support of transformer-based retrieval models #1128

sararb commented May 31, 2023

github-actions bot commented May 31, 2023

sararb Jun 1, 2023

gabrielspmoreira Jun 2, 2023

rnyak commented Jun 2, 2023

gabrielspmoreira Jun 2, 2023

rnyak commented Nov 7, 2023

Add support of transformer-based retrieval models #1128

Add support of transformer-based retrieval models #1128

Conversation

sararb commented May 31, 2023

Goals ⚽

Testing Details 🔍

github-actions bot commented May 31, 2023

Documentation preview

sararb Jun 1, 2023

Choose a reason for hiding this comment

gabrielspmoreira Jun 2, 2023

Choose a reason for hiding this comment

rnyak commented Jun 2, 2023

gabrielspmoreira Jun 2, 2023

Choose a reason for hiding this comment

rnyak commented Nov 7, 2023