【QUES】Padding ??? #2

Data-Designer · 2021-11-10T09:46:59Z

Obviously, only fixed-length sequences are used here, and there is no filling operation, which limits the scope of application of this model.

jiwidi · 2021-11-10T09:55:33Z

The "padding" is done while preprocessing the data. We explode the full ordered list of each user ratings into multiple subsequences. Let me illustrate:

# Example: Transform
# +---+------------+
# | id|  movie     |
# +---+------------+
# |  2|[1, 2, 3, 4]|
# +---+------------+
# Into
# +---+------+------+
# | id|  past|target|
# +---+------+------+
# |  2|[0, 0]|     1|
# |  2|[0, 1]|     2|
# |  2|[1, 2]|     3|
# |  2|[2, 3]|     4|
# +---+------+------+

We are using each movie on the sequence as a target with its subsequent past, padding with 0s if past is missing.

We reserve the latest window of the sequence 2, [2.3] ,4] as validation as is the latest known step of the user (closest to actual time).

There is no application limit of this model as you are not losing any datapoints, once you are using it for inference you can use any sequence length assuming your batch_size is 1.

Data-Designer · 2021-11-11T06:08:50Z

Thanks a lot ! You used a third-party library in the model part to build the transformer module.. But will the transformer automatically ignore the loss value caused by padding?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【QUES】Padding ??? #2

【QUES】Padding ??? #2

Data-Designer commented Nov 10, 2021

jiwidi commented Nov 10, 2021

Data-Designer commented Nov 11, 2021

【QUES】Padding ??? #2

【QUES】Padding ??? #2

Comments

Data-Designer commented Nov 10, 2021

jiwidi commented Nov 10, 2021

Data-Designer commented Nov 11, 2021