Conversational Document Assistant using RAG

Retrieval Augmented Generation with Huggingface model Zephyr-7b and ChromaDB

Steps:

Load Model and Tokenizer: Load a quantized conversational model and initialize its tokenizer.
Mount Google Drive: Mount Google Drive to access documents.
Process Text from PDFs: Extract text from PDF documents, remove links, references, stopwords, and blank lines, then save the processed text as text files.
Generate Embeddings and Vectorization: Use Sentence Transformers for text embeddings, then vectorize using Chroma.
Build Chatbot Pipeline: Construct a conversational retrieval chain with memory (RAG) using LangChain, integrating the model, retriever, and memory.
Create UI with Gradio: Develop a user interface using Gradio for users to interact with the chatbot.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Queries and Responses.pdf		Queries and Responses.pdf
README.md		README.md
Retrieval_Augmented_Generation_(RAG)_with_LLM_and_LangChain.ipynb		Retrieval_Augmented_Generation_(RAG)_with_LLM_and_LangChain.ipynb