Big perf improvement on loading.

Our OpenAI "is server up" check was retrying (default openai client behaviour), with sleeps between retries! Really slowed things down. Especially since local servers (LiteLLM, etc) are often down if app closed. Now with retry=0 it's speedy
Kiln-AI · Jan 10, 2025 · c23ae14 · c23ae14
1 parent 7430759
commit c23ae14
Showing 1 changed file with 4 additions and 0 deletions.
diff --git a/app/desktop/studio_server/provider_api.py b/app/desktop/studio_server/provider_api.py
@@ -633,6 +633,10 @@ def openai_compatible_providers_load_cache() -> OpenAICompatibleProviderCache |
         openai_client = openai.OpenAI(
             api_key=api_key,
             base_url=base_url,
+            # Important: max_retries must be 0 for performance.
+            # It's common for these servers to be down sometimes (could be local app that isn't running)
+            # OpenAI client will retry a few times, with a sleep in between! Big loading perf hit.
+            max_retries=0,
         )
 
         try: