Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How can we search through all help documents using RAG LLM? #24

Open
rowlandm opened this issue Nov 20, 2023 · 7 comments
Open

How can we search through all help documents using RAG LLM? #24

rowlandm opened this issue Nov 20, 2023 · 7 comments

Comments

@rowlandm
Copy link
Contributor

Might make it easier to find?

@rowlandm rowlandm changed the title How can we search through all documents? How can we search through all help documents? Dec 17, 2023
@rowlandm
Copy link
Contributor Author

This includes searching across the FAQ, onboarding document in figshare, handbook in figshare, all the articles off the website etc.

@rowlandm
Copy link
Contributor Author

Example of using a chat bot to help open source maintainers

https://livablesoftware.com/slack-chatbot-github-repositories/

@rowlandm
Copy link
Contributor Author

Another example

https://github.com/derekvawdrey/website-to-chatbot

Would need to convert pdfs to text on website

@rowlandm rowlandm changed the title How can we search through all help documents? How can we search through all help documents using RAG LLM? Oct 7, 2024
@rowlandm
Copy link
Contributor Author

rowlandm commented Oct 7, 2024

@rowlandm
Copy link
Contributor Author

rowlandm commented Oct 8, 2024

RAGged Edge Box: A Personal AI-Powered Document Search System

Details
One of the most popular embodiments of Generative AI are information retrieval (IR) augmented generation (RAG). Such systems use an information retrieval engine (based on semantic embeddings or keyword search) and then use a Large Language Model (LLM) to extract answers to a given query.

These systems require a large amount of computation and are usually implemented in the cloud which presents data privacy issues.

In this talk we will present The RAGged Edge Box project in which basic embedding systems and small local LLMs are packaged inside a multi-platform virtual machine (VirtualBox). The system provides a Web interface that runs locally and allows access to the RAG functionality in a completely private manner. The neural networks run on a ONNX runtime and do not require a GPU. RAG code is implemented in PHP and is easy to modify, requiring a much smaller execution environment than a Python alternative.

https://textualization.com/ragged/

@rowlandm
Copy link
Contributor Author

@rowlandm
Copy link
Contributor Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant