Retrieval-Augmented Generation (RAG) Model

This repository contains an implementation of a Retrieval-Augmented Generation (RAG) Model using Groq, Gemma Model, and other Python libraries. The RAG Model combines the strengths of an information retrieval system and a generative language model, allowing for accurate and contextual question answering.

Overview

The RAG Model is designed to ingest documents, split them into chunks, and embed them into a vector space using Google's GenerativeAI Embeddings. When a user asks a question, the relevant document chunks are retrieved from the vector store, and the Gemma model generates a response conditioned on those chunks.

Components

Document Ingestion: PDFs are loaded using PyPDFDirectoryLoader and split into chunks using RecursiveCharacterTextSplitter.
Vector Store: The chunks are embedded into a vector space powered by FAISS, a library for efficient similarity search and clustering of dense vectors.
Retrieval: When a user asks a question, the most relevant document chunks are retrieved from the vector store using a similarity search.
Generation: The Gemma-7b language model generates a response based on the retrieved context.

Setup

Clone the repository
Install the required dependencies:

pip install -r requirements.txt

Set up the required API keys:

Create a .env file in the project root directory.
Add your Groq API key and Google API key to the .env file:

GROQ_API_KEY=your_groq_api_key
GOOGLE_API_KEY=your_google_api_key

Place your PDF documents in the financial_docs directory.

Usage

Run the Streamlit app:

streamlit run app.py

Enter your question in the provided text input field.
Click the "Documents Embedding" button to initialize the vector store.
The RAG Model will retrieve relevant document chunks and generate a response to your question.

Applications

Enterprise Knowledge Bases
Customer Support
Research Assistance
Conversational AI

Contributing

Contributions are welcome! Please open an issue or submit a pull request if you have any improvements or bug fixes.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
financial_docs		financial_docs
myenv		myenv
.gitignore		.gitignore
Readme.md		Readme.md
app.py		app.py
img.png		img.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrieval-Augmented Generation (RAG) Model

Overview

Components

Setup

Usage

Applications

Contributing

License

About

Releases

Packages

Languages

saurin16/Gemma-RAG

Folders and files

Latest commit

History

Repository files navigation

Retrieval-Augmented Generation (RAG) Model

Overview

Components

Setup

Usage

Applications

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages