[Task] Support pre-trained embeddings for ranking and session based models via the new dataloader functionality #1043

EvenOldridge · 2023-03-29T16:47:58Z

Update the input block to support the pre-trained embeddings:
Add a module to extract pre-trained embeddings
Show an example of how to transform pre-trained embeddings before aggregating them with the other trainable embeddings in the model. Such as Layer Normalization or MLP layer.
Update DLRM, DeepFM, etc to include an optional projection layer for combining pretrained embeddings with the other embeddings in the model.
Implement and evaluate different aggregation methods to combine pre-trained embeddings with the other trainable embeddings.
Add an example demonstrating support for pretrained embeddings for DLRM or another non session based model. Artificial example? KDD cup?

Constraints

Dataloader should support fixed-size 3D tensors.
Dataloader needs to add a tag to the pre-trained embeddings so that the T4R input module can differentiate between trainable features and pre-trained embeddings.

The text was updated successfully, but these errors were encountered:

gabrielspmoreira · 2023-04-25T18:31:56Z

Closing this macro-task, as the sub-tasks have been listed in this RMP

EvenOldridge added this to the Merlin 23.04 milestone Mar 29, 2023

EvenOldridge assigned rnyak and bschifferer Mar 29, 2023

EvenOldridge mentioned this issue Mar 29, 2023

[RMP] Support pre-trained vector embeddings as input features into a model via the dataloader NVIDIA-Merlin/Merlin#211

Closed

33 tasks

gabrielspmoreira closed this as completed Apr 25, 2023

rnyak mentioned this issue May 1, 2023

Add an example of a pre-trained embeddings support to TabularSequenceFeatures and TabularFeatures NVIDIA-Merlin/Transformers4Rec#558

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Task] Support pre-trained embeddings for ranking and session based models via the new dataloader functionality #1043

[Task] Support pre-trained embeddings for ranking and session based models via the new dataloader functionality #1043

EvenOldridge commented Mar 29, 2023 •

edited

Loading

gabrielspmoreira commented Apr 25, 2023

[Task] Support pre-trained embeddings for ranking and session based models via the new dataloader functionality #1043

[Task] Support pre-trained embeddings for ranking and session based models via the new dataloader functionality #1043

Comments

EvenOldridge commented Mar 29, 2023 • edited Loading

Constraints

gabrielspmoreira commented Apr 25, 2023

EvenOldridge commented Mar 29, 2023 •

edited

Loading