ML endpoints #1

DiegoPino · 2024-06-20T20:44:23Z

What?

@alliomeria I will request your review on the Python code here. I am aware the new code is not perfect and it will require me to explain what is happening before you can review. But your new Python skills/formal learning are required/needed

Let's talk about this tomorrow (Friday 21st, Summer) when you have a time. Thanks

@alliomeria

Download the medium model to the models folder for testing wget https://github.com/ultralytics/assets/releases/download/v8.2.0/yolov8m.pt @alliomeria yolov8 as service. WIP but works

means via docker-compose ENVs Of course you want to mount /models so you can deploy from the host

Some cleanups too

…t this

… On L1 affecting score and who else knows what when searching with Solr

While testing because i have two Servers running i change to 6401. Then fetch and fail. gosh

Try again (i'm a 🐍 but also not a 🐍)

… correctly And we are doing research so i need all options available. ML processors will also get an option to define the requested norm.

This is above of my pay grade ...https://github.com/FlagOpen/FlagEmbedding see bge-small-en-v1.5 So far i am letting hugging face download it on first call. Not ideal ... so the app.config["SENTENCE_TRANSFORMER_MODEL_FOLDER"] has no use yet. But once i move into building docker i will fetch the folder with requires LFS access via git (more more bytes/apps, etc. Bananas)

@alliomeria

For now i am letting the app to download the model on a first run but will eventually manage to get it upfront. I just don't want a huge Docker Container There is no communication going out to the world but i am not returning Gender or Age (Unethical and also Spookie ) This model is the same one used internally by Apple (ArcFace) I need to figure out (at the Archipelago side) how to make a single Processor generate multiple Flavor Documents, so right now i am extracting embbeding (l2) vector for only ONE face, the one with the highest score and normalizing the bounding box @alliomeria (just pinging bc this is the lonely ML mountain and there is no queen/king/dwarf under the mountain, not even a dragon)

alliomeria

Ok @DiegoPino, went through a first pass review and left some comments. I have a couple other comments & questions for you, will save for our live review later today. This is a lot of great work on your part, hats off to you! 🎩 Let's talk more later about all this!

alliomeria · 2024-06-24T14:32:04Z

nlpserver.py

+	if not params['norm']:
+		params['norm'] = 'l2'
+
+	if params['norm'] and params['norm'] not in ['l1','l2','max']:


l1 not used anywhere? (seeing notes below)

Yes not used. Reason I wanted to leave/show that for you bc the comparison for image ML models when using one or the other is semantically so so different. Let's talk about that. I can also use it and then let the SBR processors just call L2.

alliomeria · 2024-06-24T14:33:49Z

nlpserver.py

+	# Vector size for this layer (i think by default it will be numlayers - 2 so 20) is 576
+	# array.reshape(-1, 1) if your data has a single feature or array.reshape(1, -1) if it contains a single sample
+	# This "should" return a Unit Vector so we can use "dot_product" in Solr
+    #  Even if Norm L1 is better for comparison, dot product on Solr gives me less than 1 of itself. So will try with L2


alliomeria · 2024-06-24T14:42:18Z

nlpserver.py

+		img = loadImage(params['iiif_image_url'], 640)
+		if img is not False:
+			app = FaceAnalysis(providers=['CPUExecutionProvider'])
+			# This will get all models. Inclussive age, gender. (bad juju) etc. But we won't return those


Good that we're not returning these. It would be better if there was a way to prevent these factors from being processed at all, but I understand that might not be possible. Maybe a good reason to consider not using insightface at all?

Hi, you are right but the reason I can not avoid it is the Embedding (for the vector/just image comparison), that one is not generated when I do not run Analysis.

alliomeria · 2024-06-24T14:42:53Z

nlpserver.py

+		if img is not False:
+			app = FaceAnalysis(providers=['CPUExecutionProvider'])
+			# This will get all models. Inclussive age, gender. (bad juju) etc. But we won't return those
+			# We could limit to just 'detection' and 'recognition' (last one provides the embeddings)


alliomeria · 2024-06-24T14:44:21Z

nlpserver.py

+	data['message'] = 'Sentence transformer (embedding)'
+	data['sentence_transformer'] = {}
+	params = {}
+	# For s2p(short query to long passage) retrieval task, each short query should start with an instruction (instructions see Model List...NOTE link leads to nothing good). But the instruction is not needed for passages. #


alliomeria · 2024-06-24T14:48:38Z

nlpserver.py

@@ -12,7 +12,7 @@
 #  configurations
 #app.config['var1'] = 'test'
 app.config["YOLO_MODEL_NAME"] = "yolov8m.pt"
-app.config["MOBILENET_MODEL_NAME"] = "mobilenet_v3_small.tflite"
+app.config["MOBILENET_MODEL_NAME"] = "mobilenet_v3_large.tflite"


Really a matter of space/taste/ The small one really had little inference capabilities. But I will defer to you on this

…hts) and set a root/download folder for insightface, which will be pre-loaded by the docker container

DiegoPino added 23 commits May 5, 2024 20:38

YOLO implementation

71657c3

Download the medium model to the models folder for testing wget https://github.com/ultralytics/assets/releases/download/v8.2.0/yolov8m.pt @alliomeria yolov8 as service. WIP but works

Unit vector. If i am doing this right or not? Math from 1998 for me

f05d332

Allow Yolo model to be set via an ENV

a01bc9f

means via docker-compose ENVs Of course you want to mount /models so you can deploy from the host

Much better output. All JSON, no weird strings.

ea0a041

Adds mobilenetv3 embedder (size 1024) via mediapipe (tensorflow/google)

c00ac91

Some cleanups too

adds Bert/sentence model loader and embedder

9e3b0e1

return to 6400 for nlpserver.py

f99f5b2

Ignore models when committing

8dd8af9

Smooth out the code

1023cdd

Keep making the same mistakes. Gosh. No more alpha releases. let's gi…

f8f97cd

…t this

Nop. probs are always None

70c6150

Trying with norm L2, because dot_product between same vector is not 1…

b700b5e

… On L1 affecting score and who else knows what when searching with Solr

Literally need to stop doing this

94c2be1

While testing because i have two Servers running i change to 6401. Then fetch and fail. gosh

torch Tensor stuff condition + empirical confidence of at least 0.3

4b8a675

Try again (i'm a 🐍 but also not a 🐍)

Fix mobilenet "capabilities" check/key

f85591b

WE ship with the large model now

0676246

Because not convinced that L2 (even if correct) handles size/coverage…

6fe9d47

… correctly And we are doing research so i need all options available. ML processors will also get an option to define the requested norm.

Python driving me crazy

da1a18f

more argument checking

33cc504

So distracted. Single argument not a list

85d9f57

Reserved Dimensions of Image

de16332

DiegoPino requested a review from alliomeria June 20, 2024 20:44

DiegoPino self-assigned this Jun 20, 2024

alliomeria reviewed Jun 24, 2024

View reviewed changes

DiegoPino added 2 commits June 24, 2024 15:54

print the Embedding Size on the logs (for future data-scientist delig…

695bef6

…hts) and set a root/download folder for insightface, which will be pre-loaded by the docker container

Keras is driving me nuts

a823aeb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML endpoints #1

ML endpoints #1

DiegoPino commented Jun 20, 2024

alliomeria left a comment

alliomeria Jun 24, 2024

DiegoPino Jun 24, 2024

alliomeria Jun 24, 2024

alliomeria Jun 24, 2024

DiegoPino Jun 24, 2024

alliomeria Jun 24, 2024

alliomeria Jun 24, 2024

alliomeria Jun 24, 2024

DiegoPino Jun 24, 2024

ML endpoints #1

Are you sure you want to change the base?

ML endpoints #1

Conversation

DiegoPino commented Jun 20, 2024

What?

alliomeria left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment