[FEA] Support feeding pre-trained embeddings to TF4Rec model with high-level api #475

rnyak · 2022-08-18T11:36:39Z

🚀 Feature request

Currently we do not have out of the box support for adding pre-trained embeddings to embedding layer, and ability to freeze them, and train a TF4Rec model. We have embedding_initializer but we never tested if it works accurately and as expected. May be we can create in PyTorch a class like TensorInitializer (TF) as we did in Merlin Models and expose the embedding initializer and trainable args to the user.

We need to

Expose definition of embeddings module in the input blocks: TabularFeatures and TabularSequenceFeatures
Support feeding pre-trained embeddings to TF4Rec model with high-level api (users should be add them to the embedding layer, and freeze them, i.e., set trainable=False (TF Api) or requires_grad=False (PyTorch API))
create an example notebook for showcasing that functionality

Motivation

This is a FEA coming from our customers and users.

The text was updated successfully, but these errors were encountered:

karlhigley · 2022-08-18T13:57:52Z

Is this related to or part of NVIDIA-Merlin/Merlin#211?

rnyak · 2022-08-18T17:09:48Z

Is this related to or part of NVIDIA-Merlin/Merlin#211?

@karlhigley more related to NVIDIA-Merlin/Merlin#471. Not sure about the link to 211.

gabrielspmoreira · 2022-12-08T16:21:48Z

When the embedding table are not huge and fit GPU memory, the new PretrainedEmbeddingsInitializer ( #572 ) can be used to initialize the embedding matrix with pre-trained embeddings and set them to trainable or not.

karunaahuja · 2024-07-08T05:32:53Z

Is there an example notebook of usage of PretrainedEmbeddingsInitializer to initialize the embedding matrix

rnyak · 2024-07-08T13:51:37Z

Is there an example notebook of usage of PretrainedEmbeddingsInitializer to initialize the embedding matrix

We dont have an example for this feature, but you can refer to the unit test, and try to implement it.

karunaahuja · 2024-07-09T16:03:42Z

Is there an example notebook of usage of PretrainedEmbeddingsInitializer to initialize the embedding matrix

We dont have an example for this feature, but you can refer to the unit test, and try to implement it.

Thanks, I guess what I am looking for is how to use this along with the input block defined by a model schema, TabularSequenceFeatures (with a series of categorical and continuous features) and tr.NextItemPredictionTask and Electra config. Here's my pseudo code without using the embeddings

input_module = tr.TabularSequenceFeatures.from_schema(
       schema,
       max_sequence_length=max_sequence_length,
       aggregation="concat",
       d_output=d_model,
       masking="mlm",
       embedding_dim_default=embedding_dim_default,
   )  
   
   metrics = [
       tr.ranking_metric.NDCGAt(top_ks=[10, 20, 50, 100, 150, 200], labels_onehot=True),
       tr.ranking_metric.AvgPrecisionAt(
           top_ks=[10, 20, 50, 100, 150, 200], labels_onehot=True
       ),
       tr.ranking_metric.RecallAt(top_ks=[10, 20, 50, 100, 150, 200], labels_onehot=True),
   ]
   
   prediction_task = tr.NextItemPredictionTask(weight_tying=True, metrics=metrics)

   
   transformer_config = tr.Electra.build(
       d_model=d_model,
       n_head=n_head,
       n_layer=n_layer,
       total_seq_length=max_sequence_length,
       pad_token=PAD_TOKEN,
   )
   
   model = transformer_config.to_torch_model(input_module, prediction_task)

karunaahuja · 2024-07-16T16:48:23Z

following up on this ^

Tottowich · 2024-09-09T22:47:17Z

Any progress? @karunaahuja
Looking to do the same thing :D

rnyak added P0 status/needs-triage labels Aug 18, 2022

rnyak mentioned this issue Aug 18, 2022

[RMP] Support pre-trained vector embeddings as input features into a model via the dataloader NVIDIA-Merlin/Merlin#211

Closed

33 tasks

viswa-nvidia added this to the Merlin 22.11 milestone Oct 6, 2022

viswa-nvidia modified the milestones: Merlin 22.11, Merlin 22.12 Nov 1, 2022

viswa-nvidia modified the milestones: Merlin 22.12, Merlin 23.01 Nov 15, 2022

gabrielspmoreira mentioned this issue Dec 8, 2022

Support to pre-trained embeddings initializer (trainable or not) #572

Merged

rnyak self-assigned this Jan 30, 2023

rnyak modified the milestones: Merlin 23.01, Merlin 23.02 Jan 30, 2023

karlhigley modified the milestones: Merlin 23.02, Merlin 23.04 Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support feeding pre-trained embeddings to TF4Rec model with high-level api #475

[FEA] Support feeding pre-trained embeddings to TF4Rec model with high-level api #475

rnyak commented Aug 18, 2022 •

edited

Loading

karlhigley commented Aug 18, 2022

rnyak commented Aug 18, 2022 •

edited

Loading

gabrielspmoreira commented Dec 8, 2022

karunaahuja commented Jul 8, 2024

rnyak commented Jul 8, 2024

karunaahuja commented Jul 9, 2024

karunaahuja commented Jul 16, 2024

Tottowich commented Sep 9, 2024

[FEA] Support feeding pre-trained embeddings to TF4Rec model with high-level api #475

[FEA] Support feeding pre-trained embeddings to TF4Rec model with high-level api #475

Comments

rnyak commented Aug 18, 2022 • edited Loading

🚀 Feature request

Motivation

karlhigley commented Aug 18, 2022

rnyak commented Aug 18, 2022 • edited Loading

gabrielspmoreira commented Dec 8, 2022

karunaahuja commented Jul 8, 2024

rnyak commented Jul 8, 2024

karunaahuja commented Jul 9, 2024

karunaahuja commented Jul 16, 2024

Tottowich commented Sep 9, 2024

rnyak commented Aug 18, 2022 •

edited

Loading

rnyak commented Aug 18, 2022 •

edited

Loading