Changed default local model to nomic #1943

hagen-danswer · 2024-07-26T00:40:15Z

No description provided.

vercel · 2024-07-26T00:40:18Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Aug 1, 2024 1:53am

hagen-danswer · 2024-07-26T19:16:02Z

backend/Dockerfile.model_server

@@ -21,10 +21,12 @@ RUN apt-get remove -y --allow-remove-essential perl-base && \
 RUN python -c "from transformers import AutoModel, AutoTokenizer, TFDistilBertForSequenceClassification; \
 from huggingface_hub import snapshot_download; \
 AutoTokenizer.from_pretrained('danswer/intent-model'); \
-AutoTokenizer.from_pretrained('intfloat/e5-base-v2'); \
+AutoTokenizer.from_pretrained('nomic-ai/nomic-embed-text-v1'); \
+AutoTokenizer.from_pretrained('nomic-ai/nomic-bert-2048'); \


to make this work while airgapped, you need to .from_pretrained and snapshot_download not only nomic-ai/nomic-embed-text-v1, but also nomic-ai/nomic-bert-2048.

It was hard to find the exact reasoning for this, but I'm pretty sure it has something to do with nomic-embed-text-v1 being built on top of nomic-bert-2048 and that it needs to run .py scripts located only in the the nomic-bert-2048 repo here

hagen-danswer · 2024-07-26T19:16:30Z

backend/danswer/configs/model_configs.py

-ASYM_QUERY_PREFIX = os.environ.get("ASYM_QUERY_PREFIX", "query: ")
-ASYM_PASSAGE_PREFIX = os.environ.get("ASYM_PASSAGE_PREFIX", "passage: ")
+ASYM_QUERY_PREFIX = os.environ.get("ASYM_QUERY_PREFIX", "search_query: ")
+ASYM_PASSAGE_PREFIX = os.environ.get("ASYM_PASSAGE_PREFIX", "search_document: ")
 # Purely an optimization, memory limitation consideration


these are the defaults for nomic-ai/nomic-embed-text-v1

hagen-danswer · 2024-07-26T19:26:03Z

backend/model_server/encoders.py

-        model = SentenceTransformer(model_name)
+        model = SentenceTransformer(
+            model_name_or_path=model_name, trust_remote_code=True
+        )


This is related to the needing to also install nomic-bert-2048
there is a script that has to be executed to use the model (unsure when) that is located in nomic-bert-2048 and not in nomic-embed-text-v1 (a couple .py scripts you can see here)

Not 100% sure though

hagen-danswer · 2024-07-31T19:17:28Z

backend/danswer/natural_language_processing/utils.py

@@ -116,8 +116,9 @@ def get_tokenizer(model_name: str | None, provider_type: str | None) -> BaseToke
        if provider_type.lower() == "openai":
            # Used across ada and text-embedding-3 models
            return _check_tokenizer_cache("openai")
+        # If we are given a cloud provider_type that isn't OpenAI, we default to trying to use the model_name
+        # this means we are approximating the token count which may leave some performance on the table



general note

yuhongsun96 · 2024-08-01T01:16:48Z

backend/Dockerfile.model_server

-snapshot_download('danswer/intent-model'); \
-snapshot_download('intfloat/e5-base-v2'); \
-snapshot_download('mixedbread-ai/mxbai-rerank-xsmall-v1')"
+RUN python -c "from transformers import AutoTokenizer; \


It's better to combine these into a single layer. If you do a single RUN it creates a single layer that can be cached.

yuhongsun96 · 2024-08-01T01:24:31Z

backend/model_server/encoders.py

-        model = SentenceTransformer(model_name)
+        model = SentenceTransformer(
+            model_name_or_path=model_name,
+            trust_remote_code=True,


I would add a comment here:
"Some model architectures that aren't built into the Transformers or Sentence Transformer need to be downloaded to be loaded locally. This does not mean data is sent to remote servers for inference, however the remote code can be fairly arbitrary so only use trusted models"

vercel bot deployed to Preview July 26, 2024 02:55 View deployment

vercel bot deployed to Preview July 26, 2024 19:11 View deployment

vercel bot deployed to Preview July 26, 2024 19:17 View deployment

hagen-danswer commented Jul 26, 2024

View reviewed changes

hagen-danswer force-pushed the set-default-local-to-nomic branch from b1c41a9 to ba16cea Compare July 26, 2024 20:21

vercel bot deployed to Preview July 26, 2024 20:23 View deployment

vercel bot deployed to Preview July 26, 2024 20:46 View deployment

yuhongsun96 closed this Jul 26, 2024

yuhongsun96 reopened this Jul 26, 2024

vercel bot deployed to Preview July 28, 2024 22:02 View deployment

vercel bot deployed to Preview July 29, 2024 00:58 View deployment

hagen-danswer added 7 commits July 31, 2024 10:43

Changed default local model to nomic

19222d7

cleanup

8a865b6

made it work with airgap

5839826

final cleanup

5153d2d

support transferring from e5

4103a07

cleanup

85568cd

Need to reduce calls but this works

bdfe36e

hagen-danswer force-pushed the set-default-local-to-nomic branch from 3cb2ef8 to bdfe36e Compare July 31, 2024 17:43

vercel bot deployed to Preview July 31, 2024 17:46 View deployment

more

c129c90

vercel bot deployed to Preview July 31, 2024 19:13 View deployment

hagen-danswer commented Jul 31, 2024

View reviewed changes

uses temp cache to put stuff in volume

3e9f2a4

vercel bot deployed to Preview July 31, 2024 23:13 View deployment

mypy

1f53351

vercel bot deployed to Preview August 1, 2024 00:04 View deployment

yuhongsun96 reviewed Aug 1, 2024

View reviewed changes

fixes

4bb9ef2

vercel bot deployed to Preview August 1, 2024 01:39 View deployment

yuhongsun96 added 2 commits July 31, 2024 18:47

no-prettier

a472f4c

k

0b2e760

vercel bot deployed to Preview August 1, 2024 01:51 View deployment

ok

b5886f4

vercel bot deployed to Preview August 1, 2024 01:53 View deployment

yuhongsun96 merged commit 1be1959 into main Aug 1, 2024
5 checks passed

yuhongsun96 deleted the set-default-local-to-nomic branch August 1, 2024 01:54

onimsha mentioned this pull request Sep 4, 2024

test/try merge 1 mindvalley/danswer#51

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed default local model to nomic #1943

Changed default local model to nomic #1943

hagen-danswer commented Jul 26, 2024

vercel bot commented Jul 26, 2024 •

edited

Loading

hagen-danswer Jul 26, 2024

hagen-danswer Jul 26, 2024

hagen-danswer Jul 26, 2024

hagen-danswer Jul 31, 2024

yuhongsun96 Aug 1, 2024

yuhongsun96 Aug 1, 2024

Changed default local model to nomic #1943

Changed default local model to nomic #1943

Conversation

hagen-danswer commented Jul 26, 2024

vercel bot commented Jul 26, 2024 • edited Loading

hagen-danswer Jul 26, 2024

Choose a reason for hiding this comment

hagen-danswer Jul 26, 2024

Choose a reason for hiding this comment

hagen-danswer Jul 26, 2024

Choose a reason for hiding this comment

hagen-danswer Jul 31, 2024

Choose a reason for hiding this comment

yuhongsun96 Aug 1, 2024

Choose a reason for hiding this comment

yuhongsun96 Aug 1, 2024

Choose a reason for hiding this comment

vercel bot commented Jul 26, 2024 •

edited

Loading