Suport for embedded representation #3156

Radu1999 · 2025-01-09T00:20:22Z

Previously, the SentenceTransformer wrapper did not allow passing precomputed embeddings (embedded representation).
Many models from the transformers library support this, such as llama, gemma, gemma2, mistral, bert etc.

This feature is particularly (but not only) useful for experimenting with different techniques of custom soft prompting.

The implementations do not support passing both input_ids and inputs_embeds simultaneously: example. Therefore, i implemented them as being mutually exclusive, with inputs_embeds taking precedence.

sentence_transformers/models/Transformer.py

tomaarsen · 2025-01-10T12:19:16Z

Looks good! Thanks for this.

Tom Aarsen

inputs_embeds

13324b2

Radu1999 changed the title ~~Suport for precomputed embeddings~~ Suport for embedded representation Jan 9, 2025

tomaarsen reviewed Jan 9, 2025

View reviewed changes

sentence_transformers/models/Transformer.py Outdated Show resolved Hide resolved

leave feature validation to transformers

84a5846

Radu1999 requested a review from tomaarsen January 9, 2025 13:03

tomaarsen merged commit a7e3707 into UKPLab:master Jan 10, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suport for embedded representation #3156

Suport for embedded representation #3156

Radu1999 commented Jan 9, 2025

tomaarsen commented Jan 10, 2025

Suport for embedded representation #3156

Suport for embedded representation #3156

Conversation

Radu1999 commented Jan 9, 2025

tomaarsen commented Jan 10, 2025