Skip to content

Latest commit

 

History

History
36 lines (22 loc) · 2.59 KB

README.md

File metadata and controls

36 lines (22 loc) · 2.59 KB

Chatbot

A simple, multi-user, multi-conversation, web-based chatbot.

Features

Multi User

Chatbot incorporates OpenID Connect for user identification. It relies on an external OAuth Client oidc-authservice to handle authentication and set a trusted userid Header to the downstream services. Alternatively, oauth2-proxy, which is more actively maintained, can be used in place of oidc-authservice.

Multi Conversation

Chatbot supports multiple conversations. Each conversation is identified by a unique conversationId. Conversations consists of a sequence of messages as well as metadata such as title, updatedAt. Message persistance is handled by langchain's RedisChatMessageHistory module, which leverages Redis for storing chat history. Metadata persistance is handled separately by redis-om, an object mapping library for Redis from Redis Labs. This separation of message content and metadata storage provides modularity and flexibility in the chatbot's underlying persistence architecture.

Streaming

Chatbot supports streaming LLM outputs to the user in real-time. Streaming messages are delivered via WebSockets, which enables bidirectional, full-duplex communication channels between the server and client.

On the LLM side, Chatbot uses Text Generation Inference (TGI), an open source library from HuggingFace, to host large language models for text generation. TGI provides out-of-the-box support for continuous batching, streaming inference, and other useful features for deploying production-ready LLMs. Using TGI eliminates the need to build complex serving infrastructure from scratch. Its continuous batching allows the chatbot to achieve high throughput by batching requests. Streaming inference enables the chatbot to return partial results instantly rather than waiting for the full output.

Architecture

Deployment

See deployment instructions

Configuration

Key Default Value Description
LLM http://localhost:8080 llm service config dict
SAFETY_LLM None safety llm service config dict
POSTGRES_PRIMARY_URL postgresql+psycopg://postgres:postgres@localhost:5432/ Database url to read / write messages and metadata
POSTGRES_STANDBY_URL None Database url to read messages and metadata
LOG_LEVEL INFO log level