Skip to content

Latest commit

 

History

History
68 lines (51 loc) · 2.71 KB

README.md

File metadata and controls

68 lines (51 loc) · 2.71 KB

AI App Template

A template project to run ingestion and querying with AWS services.

Features

  • Fully Rust
  • Serverless (AWS Lambda)
  • Deploy with cargo lambda
  • File base vector graph with AWS S3
  • Ingestion queue system with DynamoDB Streams
  • Collection (group of documents) base search
  • DynamoDB as main database
  • AWS Cognito for authentication
  • Slack integration
  • User team

Architecture

Architecture

Project structure

  • common: Common functions, e.g jwt decode, get env var
  • composer: Compose LLMs input prompt
  • database: Database module to interact with database (dynamodb)
  • document: Document module to parse documents and build document nodes, to chunk document with overlapped chunking
  • helpers: Helper functions for aws services
  • indexer: Indexer module to build vector graph with embedding models
  • lambdas: AWS lambdas functions to do ingestion with SQS, querying, slack API
  • resources: PDFium resource which need to mount in AWS Index lambda to parse PDF
  • slack: Slack module to handle slack integration

How it works

Ingestion

  • When an user uploads document to the system, system saves the document in S3
  • A indexing task is created
  • Document analyser analyzes the document layout and build a document graph
  • A document vector graph is created respect to the document graph with embedding model and store in S3
  • A overlapped chunking method is applied to reduce chance for incomplete context
  • User can associate the document to a collection for multiple documents querying

Querying

  • When received an user query
  • Query is embedded with embedding model
  • System scans all documents in the target collection and filter with cosine similarity
  • System picks top K document graph nodes
  • System constructs the GPT prompt with selected nodes as context
  • System send the enriched query to external GPT service
  • When system got response from external GPT service, a callback request will be triggered

Setup

  • Setup DynamoDB with stream filter which can in found in readme file.
  • Mount PDFium resources to lambda need to run PDF parsing. e.g. document-indexer lambda
  • Mount embedding model resources to lambda need to run embedding. e.g. document-indexer lambda and seach-api lambda
  • Map API lambdas with API gateway and set up auth

Deployment

Every lambda function in lambdas has two deployment command.

  • replace {{IAM_ROLE}} to AWS IAM Role for your project
  • cargo make stage: deploy a lambda function with suffix -stage
  • cargo make production: deploy a lambda function with optimized build