Skip to content

Releases: uptrain-ai/uptrain

v0.6.6.post1

05 Mar 11:54
5f4ad39
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.6.6...v0.6.6.post1

v0.6.6

04 Mar 10:45
7f5066b
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.5.post2...v0.6.6

v0.6.5.post2

01 Mar 11:03
b55f9f4
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.6.5.post1...v0.6.5.post2

v0.6.5.post1

01 Mar 09:29
ad9dc4d
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.6.5...v0.6.5.post1

v0.6.5

29 Feb 13:18
eda6229
Compare
Choose a tag to compare

Updated dependencies

v0.6.4

28 Feb 15:32
7944d92
Compare
Choose a tag to compare

All New User Interface

  • Dashboard
    • The new No-Code dashboard allows you to run evaluations without writing any code
    • You can visualize these results locally by running the dashboard on your system

New Operators

Introducing three new operators that fit right into your RAG pipelines.

Query Enhancement Operators

  • Sub Query Completeness
    • See how well the sub-queries cover the main question

Contextual Operators

  • Context Reranking
    • Check whether the reranking improved the order of your documents
  • Context Conciseness
    • Check if a part of your context is as good as the original context to answer your query

Integrations

API Integrations

  • Mistral API Integration

    • Use the up-and-coming Mistral LLMs to perform your evaluations.
  • Langfuse Integration

    • View traces, analyse use cases and user segments while evaluating your LLMs

v0.6.3

22 Feb 05:20
17be4b7
Compare
Choose a tag to compare
Update pyproject.toml (#551)

* Update pyproject.toml

* Update pyproject.toml

v0.6.2

21 Feb 15:51
a509198
Compare
Choose a tag to compare
Update pyproject.toml (#550)

v0.6.1

21 Feb 15:44
9974598
Compare
Choose a tag to compare

Updated dependencies

UpTrain v0.6

20 Feb 20:13
64a48d1
Compare
Choose a tag to compare

We are thrilled to announce a significant array of enhancements aimed at improving user experience, ease of use, and overall functionality in UpTrain v0.6!

New Features:

  1. Local Evaluation Capability ✨

    • Users can now run evaluations locally on their systems, providing more flexibility and control over the evaluation process.
  2. Custom Prompt Evaluation 🎛️

    • Introducing the ability to create custom evaluations tailored to specific user needs, empowering users with more control over the evaluation process.
  3. Scenario Description Parameter for Operators 📝

    • Operators can now specify additional context to the Language Model (LLM) using the scenario description parameter, enhancing the quality of evaluations.
  4. Modular Prompt Templates 🧩

    • Release of customizable prompt templates featuring customizable instructions, few-shot examples, scenario descriptions, and output formats, providing users with versatile tools for prompt creation.
  5. New Integrations 🚀

    • Vector DBs Integration 🔍
      • Integration with vector databases such as Qdrant, ChromaDB, and FAISS for RAG operations, query responses, and evaluation using UpTrain.
    • Framework Integration 🛠️
      • Integration with LLamaindex framework for streamlined operations.
    • LLM Providers Integration 💡
      • Integration with LLMs like Mistral and Llama from platforms such as Anyscale and Together AI for evaluation purposes.
    • LLM Embeddings Integration 🧠
      • Integration with Jina for generating embeddings to enhance RAG operations and evaluations.
  6. Research Integration 📚

    • UpTrain now incorporates the state-of-the-art Spade framework for auto-generating assertions to identify poor LLM outputs, facilitating seamless evaluation on user datasets.
  7. Root Cause Analysis 🕵️

    • UpTrain facilitates root cause analysis for failure issues in RAG pipelines, aiding in the identification and resolution of problems.
  8. Vector Search Integration 🔍

    • Enhanced vector search capability allows for comparing different embedding models, enabling users to derive more relevant context from vector databases.

New Evaluations:

  1. Jailbreak Detection 🚨

    • Identify attempts to perform illegal activities or misuse of the LLM. Users can specify a model purpose to ensure adherence to intended usage.
  2. Code Hallucination 💻

    • Determine the grounding of code generated by the LLM based on provided documents/context, ensuring coherence and relevance.