Skip to content

Commit

Permalink
Add CohereAI embeddings (#8650)
Browse files Browse the repository at this point in the history
* Add cohereai embeddings

* Add cohereai embeddings

* Update notebook and resolve linting issues.

* Update with review suggestions

* Update with review suggestions

* Update with review suggestions

* Update notebook with retriever

* Fix markdown

* remove api key
  • Loading branch information
ravi03071991 authored Nov 2, 2023
1 parent f7fb2de commit 0acf6d6
Show file tree
Hide file tree
Showing 3 changed files with 539 additions and 0 deletions.
377 changes: 377 additions & 0 deletions docs/examples/embeddings/cohereai.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,377 @@
{
"cells": [
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"<a href=\"https://colab.research.google.com/github/jerryjliu/llama_index/blob/main/docs/examples/embeddings/cohereai.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"# CohereAI Embeddings"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"If you're opening this Notebook on colab, you will probably need to install LlamaIndex 🦙."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"!pip install llama-index"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"# Initilise with your api key\n",
"import os\n",
"\n",
"cohere_api_key = \"YOUR_API_KEY\"\n",
"os.environ[\"COHERE_API_KEY\"] = cohere_api_key"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### With latest `embed-english-v3.0` embeddings.\n",
"\n",
"- input_type=\"search_document\": Use this for texts (documents) you want to store in your vector database\n",
"\n",
"- input_type=\"search_query\": Use this for search queries to find the most relevant documents in your vector database"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"1024\n",
"[-0.041931152, -0.022384644, -0.07067871, -0.011886597, -0.019210815]\n"
]
}
],
"source": [
"from llama_index.embeddings.cohereai import CohereEmbedding\n",
"\n",
"# with input_typ='search_query'\n",
"embed_model = CohereEmbedding(\n",
" cohere_api_key=cohere_api_key,\n",
" model_name=\"embed-english-v3.0\",\n",
" input_type=\"search_query\",\n",
")\n",
"\n",
"embeddings = embed_model.get_text_embedding(\"Hello CohereAI!\")\n",
"\n",
"print(len(embeddings))\n",
"print(embeddings[:5])"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"1024\n",
"[-0.03074646, -0.0029201508, -0.058044434, -0.015457153, -0.02331543]\n"
]
}
],
"source": [
"# with input_type = 'search_document'\n",
"embed_model = CohereEmbedding(\n",
" cohere_api_key=cohere_api_key,\n",
" model_name=\"embed-english-v3.0\",\n",
" input_type=\"search_document\",\n",
")\n",
"\n",
"embeddings = embed_model.get_text_embedding(\"Hello CohereAI!\")\n",
"\n",
"print(len(embeddings))\n",
"print(embeddings[:5])"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### With old `embed-english-v2.0` embeddings."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"4096\n",
"[0.65771484, 0.7998047, 2.3769531, -2.3105469, -1.6044922]\n"
]
}
],
"source": [
"embed_model = CohereEmbedding(\n",
" cohere_api_key=cohere_api_key, model_name=\"embed-english-v2.0\"\n",
")\n",
"\n",
"embeddings = embed_model.get_text_embedding(\"Hello CohereAI!\")\n",
"\n",
"print(len(embeddings))\n",
"print(embeddings[:5])"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Now with latest `embed-english-v3.0` embeddings, \n",
"\n",
"let's use \n",
"1. input_type=`search_document` to build index\n",
"2. input_type=`search_query` to retrive relevant context."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"import logging\n",
"import sys\n",
"\n",
"logging.basicConfig(stream=sys.stdout, level=logging.INFO)\n",
"logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))\n",
"\n",
"from llama_index import (\n",
" VectorStoreIndex,\n",
" SimpleDirectoryReader,\n",
" ServiceContext,\n",
")\n",
"\n",
"from llama_index.llms import LiteLLM\n",
"from llama_index.response.notebook_utils import display_source_node\n",
"\n",
"from IPython.display import Markdown, display"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Download Data"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"--2023-11-03 03:14:50-- https://raw.githubusercontent.com/run-llama/llama_index/main/docs/examples/data/paul_graham/paul_graham_essay.txt\n",
"Resolving raw.githubusercontent.com (raw.githubusercontent.com)... 185.199.109.133, 185.199.111.133, 185.199.110.133, ...\n",
"Connecting to raw.githubusercontent.com (raw.githubusercontent.com)|185.199.109.133|:443... connected.\n",
"HTTP request sent, awaiting response... 200 OK\n",
"Length: 75042 (73K) [text/plain]\n",
"Saving to: 'data/paul_graham/paul_graham_essay.txt'\n",
"\n",
"data/paul_graham/pa 100%[===================>] 73.28K --.-KB/s in 0.006s \n",
"\n",
"2023-11-03 03:14:50 (11.3 MB/s) - 'data/paul_graham/paul_graham_essay.txt' saved [75042/75042]\n",
"\n"
]
}
],
"source": [
"!mkdir -p 'data/paul_graham/'\n",
"!wget 'https://raw.githubusercontent.com/run-llama/llama_index/main/docs/examples/data/paul_graham/paul_graham_essay.txt' -O 'data/paul_graham/paul_graham_essay.txt'"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Load Data"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"documents = SimpleDirectoryReader(\"./data/paul_graham/\").load_data()"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Build index with input_type = 'search_document'"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"llm = LiteLLM(\"command-nightly\")\n",
"embed_model = CohereEmbedding(\n",
" cohere_api_key=cohere_api_key,\n",
" model_name=\"embed-english-v3.0\",\n",
" input_type=\"search_document\",\n",
")\n",
"\n",
"service_context = ServiceContext.from_defaults(\n",
" llm=llm, embed_model=embed_model\n",
")\n",
"index = VectorStoreIndex.from_documents(\n",
" documents=documents, service_context=service_context\n",
")"
]
},
{
"attachments": {},
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Build retriever with input_type = 'search_query'"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"embed_model = CohereEmbedding(\n",
" cohere_api_key=cohere_api_key,\n",
" model_name=\"embed-english-v3.0\",\n",
" input_type=\"search_query\",\n",
")\n",
"\n",
"service_context = ServiceContext.from_defaults(\n",
" llm=llm, embed_model=embed_model\n",
")\n",
"\n",
"search_query_retriever = index.as_retriever(service_context=service_context)\n",
"\n",
"search_query_retrieved_nodes = search_query_retriever.retrieve(\n",
" \"What happened in the summer of 1995?\"\n",
")"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [
{
"data": {
"text/markdown": [
"**Node ID:** 1b0759b6-e6a1-4749-aeaa-1eafe14db055<br>**Similarity:** 0.3253174706260866<br>**Text:** That's not how they sell. I wrote some software to generate web sites for galleries, and Robert wrote some to resize images and set up an http server to serve the pages. Then we tried to sign up galleries. To call this a difficult sale would be an understatement. It was difficult to give away. A few galleries let us make sites for them for free, but none paid us.\n",
"\n",
"Then some online stores started to appear, and I realized that except for the order buttons they were identical to the sites we'd been generating for galleries. This impressive-sounding thing called an \"internet storefront\" was something we already knew how to build.\n",
"\n",
"So in the summer of 1995, after I submitted the camera-ready copy of ANSI Common Lisp to the publishers, we started trying to write software to build online stores. At first this was going to be normal desktop software, which in those days meant Windows software. That was an alarming prospect, because neither of us knew how to write Windows software or wanted to learn. We lived in the Unix world. But we decided we'd at least try writing a prototype store builder on Unix. Robert wrote a shopping cart, and I wrote a new site generator for stores — in Lisp, of course.\n",
"\n",
"We were working out of Robert's apartment in Cambridge. His roommate was away for big chunks of time, during which I got to sleep in his room. For some reason there was no bed frame or sheets, just a mattress on the floor. One morning as I was lying on this mattress I had an idea that made me sit up like a capital L. What if we ran the software on the server, and let users control it by clicking on links? Then we'd never have to write anything to run on users' computers. We could generate the sites on the same server we'd serve them from. Users wouldn't need anything more than a browser.\n",
"\n",
"This kind of software, known as a web app, is common now, but at the time it wasn't clear that it was even possible. To find out, we decided to try making a version of our store builder that y...<br>"
],
"text/plain": [
"<IPython.core.display.Markdown object>"
]
},
"metadata": {},
"output_type": "display_data"
},
{
"data": {
"text/markdown": [
"**Node ID:** ab6c138d-a509-4894-9131-da145eb7a4b4<br>**Similarity:** 0.28713538838359537<br>**Text:** But once again, this was not due to any particular insight on our part. We didn't know how VC firms were organized. It never occurred to us to try to raise a fund, and if it had, we wouldn't have known where to start. [14]\n",
"\n",
"The most distinctive thing about YC is the batch model: to fund a bunch of startups all at once, twice a year, and then to spend three months focusing intensively on trying to help them. That part we discovered by accident, not merely implicitly but explicitly due to our ignorance about investing. We needed to get experience as investors. What better way, we thought, than to fund a whole bunch of startups at once? We knew undergrads got temporary jobs at tech companies during the summer. Why not organize a summer program where they'd start startups instead? We wouldn't feel guilty for being in a sense fake investors, because they would in a similar sense be fake founders. So while we probably wouldn't make much money out of it, we'd at least get to practice being investors on them, and they for their part would probably have a more interesting summer than they would working at Microsoft.\n",
"\n",
"We'd use the building I owned in Cambridge as our headquarters. We'd all have dinner there once a week — on tuesdays, since I was already cooking for the thursday diners on thursdays — and after dinner we'd bring in experts on startups to give talks.\n",
"\n",
"We knew undergrads were deciding then about summer jobs, so in a matter of days we cooked up something we called the Summer Founders Program, and I posted an announcement on my site, inviting undergrads to apply. I had never imagined that writing essays would be a way to get \"deal flow,\" as investors call it, but it turned out to be the perfect source. [15] We got 225 applications for the Summer Founders Program, and we were surprised to find that a lot of them were from people who'd already graduated, or were about to that spring. Already this SFP thing was starting to feel more serious than we'd intended.\n",
"\n",
"We ...<br>"
],
"text/plain": [
"<IPython.core.display.Markdown object>"
]
},
"metadata": {},
"output_type": "display_data"
}
],
"source": [
"for n in search_query_retrieved_nodes:\n",
" display_source_node(n, source_length=2000)"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "py38",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3"
},
"vscode": {
"interpreter": {
"hash": "64bcadabe4cd61f3d117ba0da9d14bf2f8e35582ff79e821f2e71056f2723d1e"
}
}
},
"nbformat": 4,
"nbformat_minor": 2
}
2 changes: 2 additions & 0 deletions llama_index/embeddings/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@
from llama_index.embeddings.base import SimilarityMode
from llama_index.embeddings.bedrock import BedrockEmbedding
from llama_index.embeddings.clarifai import ClarifaiEmbedding
from llama_index.embeddings.cohereai import CohereEmbedding
from llama_index.embeddings.elasticsearch import (
ElasticsearchEmbedding,
ElasticsearchEmbeddings,
Expand All @@ -31,6 +32,7 @@
"AdapterEmbeddingModel",
"BedrockEmbedding",
"ClarifaiEmbedding",
"CohereEmbedding",
"DEFAULT_HUGGINGFACE_EMBEDDING_MODEL",
"ElasticsearchEmbedding",
"GoogleUnivSentEncoderEmbedding",
Expand Down
Loading

0 comments on commit 0acf6d6

Please sign in to comment.