Vector Search using ML transformer models

Demonstation of search using Language Agnostic transformer model.

What Does this code do?

Indexing Data

Creates a document mapping to to create a search index. Includes vector properties, making sure to match the dimensions in the mapping with the dimensions of the transformer model we are using.
Runs the document data through the transformer model to create the vector embedding i.e "This is a sentence" => [-0.0297,0.0618,0.0240,-0.0288, ...]
Indexes the document with the vector embeddings.

Searching data

Runs the query through the transformer model.
Formats your search query to use the generated vector embedding to perform a vector search
We are using a cosine similiarity function on the vector properties to get the results but this can be tweaked to adjust for query speed and accuracy.

What Can we do with transformer models & vector search?

Improve standard keyword search by combining it with semantic search including:
- Natural language processing. Get results based on intent & contextual meaning of the search query.
- Language Agnostic search. Get relevant results, agnostic of the language of the text used to create the indexed embedding and of the language of the search query text.
Image search. Create vector embeddings from images and perform search on them using a natural language text query.

How to run

Requirements

Python3
Docker Desktop
ElasticSearch 8.14 (How to install in Docker Locally)
A whole lot of free RAM

First time set up

Rename .env.example to .env and modify constants to match your setup.
Run pip install -r requirements.txt
Start the Flask Server by running python main.py
Add products & generate embeddings by calling the /index route. i.e.

curl -X POST "http://localhost:5050/index" -H "Content-Type: application/json" -d '{
  "product_id": 1,
  "spin": "SPIN12345",
  "product_title": "Sample Product",
  "clean_product_description": "This is a sample product description.",
  "category_title": "Sample Category",
  "category_description": "This is a description of the sample category.",
  "custom_category_text": "Custom text for category",
  "parent_title": "Parent Product",
  "product_tags": ["tag1", "tag2"],
  "product_configurations": [
    {
      "product_configuration_url": "http://example.com/config1",
      "product_configuration_id": 101,
      "product_configuration_display_name": "Config 1",
      "product_configuration_total_price": 19.99,
      "product_pictures": [
        {
          "product_picture_url": "http://example.com/pic1",
          "product_picture_id": 201,
          "picture_entity_id": 301,
          "priority": 1,
          "title": "Picture 1",
          "description": "Description for picture 1",
          "picture_id": 401
        }
      ]
    },
    {
      "product_configuration_url": "http://example.com/config2",
      "product_configuration_id": 102,
      "product_configuration_display_name": "Config 2",
      "product_configuration_total_price": 29.99,
      "product_pictures": [
        {
          "product_picture_url": "http://example.com/pic2",
          "product_picture_id": 202,
          "picture_entity_id": 302,
          "priority": 2,
          "title": "Picture 2",
          "description": "Description for picture 2",
          "picture_id": 402
        }
      ]
    }
  ]
}'

Querying the index

While the flask server is still running, open search page in your browser by going to http://localhost:5050 (Use port that is in your .env file)
Enter a term in the box and hit search.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
index.html		index.html
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vector Search using ML transformer models

Demonstation of search using Language Agnostic transformer model.

What Does this code do?

What Can we do with transformer models & vector search?

How to run

Requirements

First time set up

Querying the index

Further Reading

About

Releases

Packages

Languages

sfgarza/ml-model-vector-search-demo-app

Folders and files

Latest commit

History

Repository files navigation

Vector Search using ML transformer models

Demonstation of search using Language Agnostic transformer model.

What Does this code do?

What Can we do with transformer models & vector search?

How to run

Requirements

First time set up

Querying the index

Further Reading

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages