Merge branch 'master' into chroma_persistent

langchain-ai · Dec 17, 2024 · 7612250 · 7612250
2 parents 6bb7505 + 0afc284
commit 7612250
Show file tree

Hide file tree

Showing 20 changed files with 887 additions and 203 deletions.
diff --git a/docs/docs/concepts/index.mdx b/docs/docs/concepts/index.mdx
@@ -48,7 +48,7 @@ The conceptual guide does not cover step-by-step instructions or specific implem
 - **[AIMessage](/docs/concepts/messages#aimessage)**: Represents a complete response from an AI model.
 - **[astream_events](/docs/concepts/chat_models#key-methods)**: Stream granular information from [LCEL](/docs/concepts/lcel) chains.
 - **[BaseTool](/docs/concepts/tools/#tool-interface)**: The base class for all tools in LangChain.
-- **[batch](/docs/concepts/runnables)**: Use to execute a runnable with batch inputs a Runnable.
+- **[batch](/docs/concepts/runnables)**: Use to execute a runnable with batch inputs.
 - **[bind_tools](/docs/concepts/tool_calling/#tool-binding)**: Allows models to interact with tools.
 - **[Caching](/docs/concepts/chat_models#caching)**: Storing results to avoid redundant calls to a chat model.
 - **[Chat models](/docs/concepts/multimodality/#multimodality-in-chat-models)**: Chat models that handle multiple data modalities.
@@ -70,7 +70,7 @@ The conceptual guide does not cover step-by-step instructions or specific implem
 - **[langchain-core](/docs/concepts/architecture#langchain-core)**: Core langchain package. Includes base interfaces and in-memory implementations.
 - **[langchain](/docs/concepts/architecture#langchain)**: A package for higher level components (e.g., some pre-built chains).
 - **[langgraph](/docs/concepts/architecture#langgraph)**: Powerful orchestration layer for LangChain. Use to build complex pipelines and workflows.
-- **[langserve](/docs/concepts/architecture#langserve)**: Use to deploy LangChain Runnables as REST endpoints. Uses FastAPI. Works primarily for LangChain Runnables, does not currently integrate with LangGraph.
+- **[langserve](/docs/concepts/architecture#langserve)**: Used to deploy LangChain Runnables as REST endpoints. Uses FastAPI. Works primarily for LangChain Runnables, does not currently integrate with LangGraph.
 - **[LLMs (legacy)](/docs/concepts/text_llms)**: Older language models that take a string as input and return a string as output.
 - **[Managing chat history](/docs/concepts/chat_history#managing-chat-history)**: Techniques to maintain and manage the chat history.
 - **[OpenAI format](/docs/concepts/messages#openai-format)**: OpenAI's message format for chat models.
@@ -79,7 +79,7 @@ The conceptual guide does not cover step-by-step instructions or specific implem
 - **[RemoveMessage](/docs/concepts/messages/#removemessage)**: An abstraction used to remove a message from chat history, used primarily in LangGraph.
 - **[role](/docs/concepts/messages#role)**: Represents the role (e.g., user, assistant) of a chat message.
 - **[RunnableConfig](/docs/concepts/runnables/#runnableconfig)**: Use to pass run time information to Runnables (e.g., `run_name`, `run_id`, `tags`, `metadata`, `max_concurrency`, `recursion_limit`, `configurable`).
-- **[Standard parameters for chat models](/docs/concepts/chat_models#standard-parameters)**: Parameters such as API key, `temperature`, and `max_tokens`,
+- **[Standard parameters for chat models](/docs/concepts/chat_models#standard-parameters)**: Parameters such as API key, `temperature`, and `max_tokens`.
 - **[Standard tests](/docs/concepts/testing#standard-tests)**: A defined set of unit and integration tests that all integrations must pass.
 - **[stream](/docs/concepts/streaming)**: Use to stream output from a Runnable or a graph.
 - **[Tokenization](/docs/concepts/tokens)**: The process of converting data into tokens and vice versa.

diff --git a/docs/docs/contributing/how_to/integrations/package.mdx b/docs/docs/contributing/how_to/integrations/package.mdx
@@ -291,37 +291,8 @@ import VectorstoreSource from '../../../../src/theme/integration_template/integr
 Embeddings are used to convert `str` objects from `Document.page_content` fields
 into a vector representation (represented as a list of floats).
 
-The `Embeddings` class must inherit from the [Embeddings](https://python.langchain.com/api_reference/core/embeddings/langchain_core.embeddings.embeddings.Embeddings.html#langchain_core.embeddings.embeddings.Embeddings)
-base class. This interface has 5 methods that can be implemented.
-
-| Method/Property         | Description                                          |
-|------------------------ |------------------------------------------------------|
-| `__init__`              | Initialize the embeddings object. (optional)         |
-| `embed_query`           | Embed a list of texts. (required)                    |
-| `embed_documents`       | Embed a list of documents. (required)                |
-| `aembed_query`          | Asynchronously embed a list of texts. (optional)     |
-| `aembed_documents`      | Asynchronously embed a list of documents. (optional) |
-
-### Constructor
-
-The `__init__` constructor is optional but common, but can be used to set up any necessary attributes
-that a user can pass in when initializing the embeddings object. Common attributes include
-
-- `model` - the id of the model to use for embeddings
-
-### Embedding queries vs documents
-
-The `embed_query` and `embed_documents` methods are required. These methods both operate
-on string inputs (the accessing of `Document.page_content` attributes) is handled
-by the VectorStore using the embedding model for legacy reasons.
-
-`embed_query` takes in a single string and returns a single embedding as a list of floats.
-If your model has different modes for embedding queries vs the underlying documents, you can
-implement this method to handle that. 
-
-`embed_documents` takes in a list of strings and returns a list of embeddings as a list of lists of floats.
-
-### Implementation
+Refer to the [Custom Embeddings Guide](/docs/how_to/custom_embeddings) guide for
+detail on a starter embeddings [implementation](/docs/how_to/custom_embeddings/#implementation).
 
 You can start from the following template or langchain-cli command:
 

diff --git a/docs/docs/how_to/custom_embeddings.ipynb b/docs/docs/how_to/custom_embeddings.ipynb
@@ -0,0 +1,222 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "c160026f-aadb-4e9f-8642-b4a9e8479d77",
+   "metadata": {},
+   "source": [
+    "# Custom Embeddings\n",
+    "\n",
+    "LangChain is integrated with many [3rd party embedding models](/docs/integrations/text_embedding/). In this guide we'll show you how to create a custom Embedding class, in case a built-in one does not already exist. Embeddings are critical in natural language processing applications as they convert text into a numerical form that algorithms can understand, thereby enabling a wide range of applications such as similarity search, text classification, and clustering.\n",
+    "\n",
+    "Implementing embeddings using the standard [Embeddings](https://python.langchain.com/api_reference/core/embeddings/langchain_core.embeddings.embeddings.Embeddings.html) interface will allow your embeddings to be utilized in existing `LangChain` abstractions (e.g., as the embeddings powering a [VectorStore](https://python.langchain.com/api_reference/core/vectorstores/langchain_core.vectorstores.base.VectorStore.html) or cached using [CacheBackedEmbeddings](/docs/how_to/caching_embeddings/)).\n",
+    "\n",
+    "## Interface\n",
+    "\n",
+    "The current `Embeddings` abstraction in LangChain is designed to operate on text data. In this implementation, the inputs are either single strings or lists of strings, and the outputs are lists of numerical arrays (vectors), where each vector represents\n",
+    "an embedding of the input text into some n-dimensional space.\n",
+    "\n",
+    "Your custom embedding must implement the following methods:\n",
+    "\n",
+    "| Method/Property                 | Description                                                                | Required/Optional |\n",
+    "|---------------------------------|----------------------------------------------------------------------------|-------------------|\n",
+    "| `embed_documents(texts)`        | Generates embeddings for a list of strings.                                | Required          |\n",
+    "| `embed_query(text)`             | Generates an embedding for a single text query.                            | Required          |\n",
+    "| `aembed_documents(texts)`       | Asynchronously generates embeddings for a list of strings.                 | Optional          |\n",
+    "| `aembed_query(text)`            | Asynchronously generates an embedding for a single text query.             | Optional          |\n",
+    "\n",
+    "These methods ensure that your embedding model can be integrated seamlessly into the LangChain framework, providing both synchronous and asynchronous capabilities for scalability and performance optimization.\n",
+    "\n",
+    "\n",
+    ":::note\n",
+    "`Embeddings` do not currently implement the [Runnable](/docs/concepts/runnables/) interface and are also **not** instances of pydantic `BaseModel`.\n",
+    ":::\n",
+    "\n",
+    "### Embedding queries vs documents\n",
+    "\n",
+    "The `embed_query` and `embed_documents` methods are required. These methods both operate\n",
+    "on string inputs. The accessing of `Document.page_content` attributes is handled\n",
+    "by the vector store using the embedding model for legacy reasons.\n",
+    "\n",
+    "`embed_query` takes in a single string and returns a single embedding as a list of floats.\n",
+    "If your model has different modes for embedding queries vs the underlying documents, you can\n",
+    "implement this method to handle that. \n",
+    "\n",
+    "`embed_documents` takes in a list of strings and returns a list of embeddings as a list of lists of floats.\n",
+    "\n",
+    ":::note\n",
+    "`embed_documents` takes in a list of plain text, not a list of LangChain `Document` objects. The name of this method\n",
+    "may change in future versions of LangChain.\n",
+    ":::"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2162547f-4577-47e8-b12f-e9aa3c243797",
+   "metadata": {},
+   "source": [
+    "## Implementation\n",
+    "\n",
+    "As an example, we'll implement a simple embeddings model that returns a constant vector. This model is for illustrative purposes only."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "6b838062-552c-43f8-94f8-d17e4ae4c221",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from typing import List\n",
+    "\n",
+    "from langchain_core.embeddings import Embeddings\n",
+    "\n",
+    "\n",
+    "class ParrotLinkEmbeddings(Embeddings):\n",
+    "    \"\"\"ParrotLink embedding model integration.\n",
+    "\n",
+    "    # TODO: Populate with relevant params.\n",
+    "    Key init args — completion params:\n",
+    "        model: str\n",
+    "            Name of ParrotLink model to use.\n",
+    "\n",
+    "    See full list of supported init args and their descriptions in the params section.\n",
+    "\n",
+    "    # TODO: Replace with relevant init params.\n",
+    "    Instantiate:\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            from langchain_parrot_link import ParrotLinkEmbeddings\n",
+    "\n",
+    "            embed = ParrotLinkEmbeddings(\n",
+    "                model=\"...\",\n",
+    "                # api_key=\"...\",\n",
+    "                # other params...\n",
+    "            )\n",
+    "\n",
+    "    Embed single text:\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            input_text = \"The meaning of life is 42\"\n",
+    "            embed.embed_query(input_text)\n",
+    "\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            # TODO: Example output.\n",
+    "\n",
+    "    # TODO: Delete if token-level streaming isn't supported.\n",
+    "    Embed multiple text:\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "             input_texts = [\"Document 1...\", \"Document 2...\"]\n",
+    "            embed.embed_documents(input_texts)\n",
+    "\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            # TODO: Example output.\n",
+    "\n",
+    "    # TODO: Delete if native async isn't supported.\n",
+    "    Async:\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            await embed.aembed_query(input_text)\n",
+    "\n",
+    "            # multiple:\n",
+    "            # await embed.aembed_documents(input_texts)\n",
+    "\n",
+    "        .. code-block:: python\n",
+    "\n",
+    "            # TODO: Example output.\n",
+    "\n",
+    "    \"\"\"\n",
+    "\n",
+    "    def __init__(self, model: str):\n",
+    "        self.model = model\n",
+    "\n",
+    "    def embed_documents(self, texts: List[str]) -> List[List[float]]:\n",
+    "        \"\"\"Embed search docs.\"\"\"\n",
+    "        return [[0.5, 0.6, 0.7] for _ in texts]\n",
+    "\n",
+    "    def embed_query(self, text: str) -> List[float]:\n",
+    "        \"\"\"Embed query text.\"\"\"\n",
+    "        return self.embed_documents([text])[0]\n",
+    "\n",
+    "    # optional: add custom async implementations here\n",
+    "    # you can also delete these, and the base class will\n",
+    "    # use the default implementation, which calls the sync\n",
+    "    # version in an async executor:\n",
+    "\n",
+    "    # async def aembed_documents(self, texts: List[str]) -> List[List[float]]:\n",
+    "    #     \"\"\"Asynchronous Embed search docs.\"\"\"\n",
+    "    #     ...\n",
+    "\n",
+    "    # async def aembed_query(self, text: str) -> List[float]:\n",
+    "    #     \"\"\"Asynchronous Embed query text.\"\"\"\n",
+    "    #     ..."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "47a19044-5c3f-40da-889a-1a1cfffc137c",
+   "metadata": {},
+   "source": [
+    "### Let's test it 🧪"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "21c218fe-8f91-437f-b523-c2b6e5cf749e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[[0.5, 0.6, 0.7], [0.5, 0.6, 0.7]]\n",
+      "[0.5, 0.6, 0.7]\n"
+     ]
+    }
+   ],
+   "source": [
+    "embeddings = ParrotLinkEmbeddings(\"test-model\")\n",
+    "print(embeddings.embed_documents([\"Hello\", \"world\"]))\n",
+    "print(embeddings.embed_query(\"Hello\"))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "de50f690-178e-4561-af98-14967b3c8501",
+   "metadata": {},
+   "source": [
+    "## Contributing\n",
+    "\n",
+    "We welcome contributions of Embedding models to the LangChain code base.\n",
+    "\n",
+    "If you aim to contribute an embedding model for a new provider (e.g., with a new set of dependencies or SDK), we encourage you to publish your implementation in a separate `langchain-*` integration package. This will enable you to appropriately manage dependencies and version your package. Please refer to our [contributing guide](/docs/contributing/how_to/integrations/) for a walkthrough of this process."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.4"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/docs/docs/how_to/custom_llm.ipynb b/docs/docs/how_to/custom_llm.ipynb
@@ -9,10 +9,16 @@
     "\n",
     "This notebook goes over how to create a custom LLM wrapper, in case you want to use your own LLM or a different wrapper than one that is supported in LangChain.\n",
     "\n",
-    "Wrapping your LLM with the standard `LLM` interface allow you to use your LLM in existing LangChain programs with minimal code modifications!\n",
+    "Wrapping your LLM with the standard `LLM` interface allow you to use your LLM in existing LangChain programs with minimal code modifications.\n",
     "\n",
     "As an bonus, your LLM will automatically become a LangChain `Runnable` and will benefit from some optimizations out of the box, async support, the `astream_events` API, etc.\n",
     "\n",
+    ":::caution\n",
+    "You are currently on a page documenting the use of [text completion models](/docs/concepts/text_llms). Many of the latest and most popular models are [chat completion models](/docs/concepts/chat_models).\n",
+    "\n",
+    "Unless you are specifically using more advanced prompting techniques, you are probably looking for [this page instead](/docs/how_to/custom_chat_model/).\n",
+    ":::\n",
+    "\n",
     "## Implementation\n",
     "\n",
     "There are only two required things that a custom LLM needs to implement:\n",

diff --git a/docs/docs/how_to/index.mdx b/docs/docs/how_to/index.mdx
@@ -159,6 +159,7 @@ See [supported integrations](/docs/integrations/text_embedding/) for details on
 
 - [How to: embed text data](/docs/how_to/embed_text)
 - [How to: cache embedding results](/docs/how_to/caching_embeddings)
+- [How to: create a custom embeddings class](/docs/how_to/custom_embeddings)
 
 ### Vector stores
 
@@ -244,6 +245,7 @@ All of LangChain components can easily be extended to support your own versions.
 
 - [How to: create a custom chat model class](/docs/how_to/custom_chat_model)
 - [How to: create a custom LLM class](/docs/how_to/custom_llm)
+- [How to: create a custom embeddings class](/docs/how_to/custom_embeddings)
 - [How to: write a custom retriever class](/docs/how_to/custom_retriever)
 - [How to: write a custom document loader](/docs/how_to/document_loader_custom)
 - [How to: write a custom output parser class](/docs/how_to/output_parser_custom)

diff --git a/docs/docs/integrations/chat/litellm_router.ipynb b/docs/docs/integrations/chat/litellm_router.ipynb
@@ -63,17 +63,17 @@
     "        },\n",
     "    },\n",
     "    {\n",
-    "        \"model_name\": \"gpt-4\",\n",
+    "        \"model_name\": \"gpt-35-turbo\",\n",
     "        \"litellm_params\": {\n",
-    "            \"model\": \"azure/gpt-4-1106-preview\",\n",
+    "            \"model\": \"azure/gpt-35-turbo\",\n",
     "            \"api_key\": \"<your-api-key>\",\n",
     "            \"api_version\": \"2023-05-15\",\n",
     "            \"api_base\": \"https://<your-endpoint>.openai.azure.com/\",\n",
     "        },\n",
     "    },\n",
     "]\n",
     "litellm_router = Router(model_list=model_list)\n",
-    "chat = ChatLiteLLMRouter(router=litellm_router)"
+    "chat = ChatLiteLLMRouter(router=litellm_router, model_name=\"gpt-35-turbo\")"
    ]
   },
   {
@@ -177,6 +177,7 @@
    "source": [
     "chat = ChatLiteLLMRouter(\n",
     "    router=litellm_router,\n",
+    "    model_name=\"gpt-35-turbo\",\n",
     "    streaming=True,\n",
     "    verbose=True,\n",
     "    callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]),\n",
@@ -209,7 +210,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.9.13"
+   "version": "3.11.9"
   }
  },
  "nbformat": 4,