Replies: 1 comment
-
The issue of duplicate entries in your Pinecone vector database might be due to how the nodes are being processed and upserted. Here are a few points to consider:
Here is an example of how you might use the vector_store = PineconeVectorStore(pinecone_index=pinecone_index)
storage_context = StorageContext.from_defaults(vector_store=vector_store)
# Upsert documents to avoid duplicates
index = VectorStoreIndex.from_documents(
nodes,
storage_context=storage_context,
embed_model=EmbeddModel,
) Make sure that your code is designed to upsert documents rather than inserting them multiple times. This should help resolve the issue of duplicate entries in your Pinecone vector database. If you continue to experience issues, consider checking for any version-specific bugs or updates [3][4]. |
Beta Was this translation helpful? Give feedback.
-
I have created nodes using ingestion pipeline:
and I have total 113 nodes after I concatenate all the chunks.
however When I am upserting those nodes in my pinecone vector db it is showing vector count 572
this is the code i am using to upserting the nodes to my vec db.
can anyone please help me out with it.
Beta Was this translation helpful? Give feedback.
All reactions