Initial release

Chat engines

This release contains three chat engines:

guru-snap: A prototype that uses content from Guru cards exported as JSON
bridges-eligibility-manual: A prototype that uses content from Bridges Eligibility Manual PDFs
ca-edd-web: A chatbot that uses content found on edd.ca.gov

You may wish to deploy this release or run it locally if you need to run guru-snap or bridges-eligibility-manual, as these prototypes will likely be deprecated in a future release.

What's Changed

feat: Install Application-Flask template by @KevinJBoyer in #1
DST-257: install infra temp by @ccheng26 in #3
feat: Add Chainlit by @KevinJBoyer in #2
Remove Flask and example database models by @KevinJBoyer in #4
feat: Add MockSentenceTransformer by @KevinJBoyer in #5
feat: Add LiteLLM by @KevinJBoyer in #6
DST-263: chainlit healthcheck by @ccheng26 in #7
DST-260: configure env for pgvector by @ccheng26 in #8
feat: Add Document and Chunk models by @KevinJBoyer in #9
DST-258: deploy infra by @ccheng26 in #10
DST-258: deploy infra- enable tests by @ccheng26 in #11
feat: Ingest Guru Cards by @KevinJBoyer in #13
DST-271 feat: generate llm result by @ccheng26 in #12
fix import by @ccheng26 in #14
feat: Augment response with retrieved cards by @KevinJBoyer in #15
feat: Show accordions with cards by @KevinJBoyer in #16
feat: Require login by @KevinJBoyer in #17
fix: Pin Terraform version for deploys and migrations by @KevinJBoyer in #18
perf: Lower database auto-scaling settings by @KevinJBoyer in #19
feat: add ollama call to get_models by @ccheng26 in #20
feat: productionize uvicorn by @ccheng26 in #22
fix: Conditionally import ollama by @yoomlam in #23
feat: Use URL query 'engine' parameter to set chatbot's configuration by @yoomlam in #25
fix: set uvicorn worker to 1 by @ccheng26 in #26
feat: Enable ingest of distinct datasets by @yoomlam in #27
feat: Set default chat engine to 'guru-snap' by @yoomlam in #28
feat: Retrieve from database based on filters by @yoomlam in #29
feat: add similarity score to accordion value by @ccheng26 in #30
refactor: Use generalized ChunkWithScore by @yoomlam in #32
refactor: Move db_session and embedding_model args into AppConfig by @yoomlam in #33
refactor: Remove extraneous MockAppConfig and monkeypatch parameter in tests by @yoomlam in #35
feat: Add ingest-policy-pdfs command to print out PDF file list by @ccheng26 in #31
feat: Normalize similarity scores of retrieved Guru cards by @yoomlam in #36
fix: retrieve build date and service env by @ccheng26 in #34
feat: Add retrieval and docs_shown thresholds by @yoomlam in #37
feat: Add LLM selection in Chainlit by @yoomlam in #38
feat: Use URL query params to set initial chat settings by @yoomlam in #39
feat: chunk and store BEM pdf by @ccheng26 in #40
fix: Commit after ingesting Guru cards by @KevinJBoyer in #42
feat: Enable BEM Chatbot by @KevinJBoyer in #41
feat:add branding logos by @ccheng26 in #43
feat: Make ingestion drop dataset if already exists by @KevinJBoyer in #44
feat: add accordion chunk citations by @ccheng26 in #46
fix: Rename docs_shown_* variable to chunks_shown_* by @yoomlam in #47
bug: fix accordion by @ccheng26 in #48
fix: Read titles from PDF metadata by @KevinJBoyer in #49
feature: Log metadata for retrieved chunks by @KevinJBoyer in #50
bug: citation duplicates and accordion overflow by @ccheng26 in #51
feat: Link to BEM documents by @KevinJBoyer in #52
feat: Support Jupyter notebooks by @KevinJBoyer in #53
Jupyter notebook: exploration into pdfminer.six capabilities by @yoomlam in #54
Jupyter notebook: Investigate unstructured for parsing semantics from PDFs by @KevinJBoyer in #55
feat: Add extract_outline() PDF utility by @yoomlam in #57
feat: Group markdown text list items by @KevinJBoyer in #56
fix: Merge list items only if they have the same heading by @yoomlam in #61
feat: BEM ingest skeleton by @yoomlam in #59
feat: Convert list items to markdown_texts by @yoomlam in #60
DST-401: inline citations by @ccheng26 in #58
feat: update db schema for chunk by @ccheng26 in #62
feat: Utility to extract bolded text from PDFs by @yoomlam in #63
feat: Associate and apply stylings to create bolded markdown by @yoomlam in #64
feat: Format links as markdown by @yoomlam in #65
feat: Merges texts that are split across consecutive pages by @yoomlam in #66
test: Use 707.pdf to test ingest_bem_pdfs.py by @yoomlam in #70
feat: Save BEM JSON chunks to S3 by @KevinJBoyer in #69
feat: enrich text using unstructured data by @ccheng26 in #68
fix: Add heuristics and fixes to improve BEM pdf parsing by @yoomlam in #71
feat: Update citation UI by @KevinJBoyer in #73
feat: Split long paragraphs and lists into chunks by @yoomlam in #72
feat: add dash formatting to list items by @ccheng26 in #74
feat: add ellipses to start/end of text for chunks by @ccheng26 in #75
package update: format files and update black by @ccheng26 in #77
feat: add headings to context by @ccheng26 in #76
CI: Post test coverage report to PR by @yoomlam in #78
feat: Group text with the same heading by @ccheng26 in #79
fix: Address BEM PDF ingestion error cases with bigger chunks by @yoomlam in #80
feat: Sub-chunk citations by @KevinJBoyer in #81
fix: Prevent citations from being rendered inline by Chainlit by @KevinJBoyer in ...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial release

Chat engines

What's Changed

Contributors

Releases: navapbc/labs-decision-support-tool

v0.0.0: guru-snap, bridges-eligibility-manual, ca-edd-web

Initial release

Chat engines

What's Changed

Contributors