cleanlab-tools

Miscellaneous cookbooks and code made available for purposes of education, reproducibility, and transparency.

Example	Description
TLM-Demo-Notebook	Showcasing various applications of the Trustworthy Language Model
TLM-PII-Detection	Find and mask PII with the Trustworthy Language Model
TLM-Record-Matching	Using the Trustworthy Language Model to reliably match records between two different data tables.
TLM-SimpleQA-Benchmark	Benchmarking TLM and OpenAI LLMs on the SimpleQA dataset
benchmarking_hallucination_metrics	Evaluate the performance of popular real-time hallucination detection methods on RAG benchmarks.
benchmarking_hallucination_model	Evaluate the performance of popular hallucination detection models on RAG benchmarks.
fine_tuning_data_curation	Notebook showing how to use Cleanlab TLM and Cleanlab Studio to detect bad data in instruction tuning LLM datasets.
Detecting GDPR Violations with TLM	Notebook showing the code used to analyze application logs using TLM to detect GDPR violations
Customer Support AI Agent with NeMo Guardrails	Reliable customer support AI Agent with Guardrails and trustworthiness scoring
few_shot_prompt_selection	Notebook showing how to clean few-shot examples pool to improve prompt template for OpenAI LLM.
fine_tuning_classification	Notebook showing how to use Cleanlab Studio to improve the accuracy of fine-tuned LLMs for classification tasks.
generate_llm_response	Notebook showing how to generate LLM responses for customer service requests using Llama 2 and OpenAI's API.
gpt4-rag-logprobs	Notebook showing how to obtain logprobs from a GPT-4 based RAG system.
fine_tuning_mistral_beavertails	Analyze human annotated AI-safety-related labels (like toxicity) using Cleanlab Studio, and thus generate safer responses from LLMs.
Evaluating_Toxicity_Datasets_Large_Language_Models	Notebook on analyzing toxicity annotations in the Jigsaw dataset using Cleanlab Studio.
time_series_automl	Notebook showing how to model time series data in a tabular format and use AutoML with Cleanlab Studio to improve out-of-sample accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cleanlab-tools

About

Releases

Packages

Contributors 11

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 182 Commits
NeMo-Guardrails-Customer-Support		NeMo-Guardrails-Customer-Support
TLM-Demo-Notebook		TLM-Demo-Notebook
TLM-PII-Detection		TLM-PII-Detection
TLM-Record-Matching		TLM-Record-Matching
TLM-SimpleQA-Benchmark		TLM-SimpleQA-Benchmark
TLM-o1-benchmark		TLM-o1-benchmark
benchmarking_hallucination_metrics		benchmarking_hallucination_metrics
benchmarking_hallucination_model		benchmarking_hallucination_model
few_shot_prompt_selection		few_shot_prompt_selection
fine_tuning_classification		fine_tuning_classification
fine_tuning_data_curation		fine_tuning_data_curation
fine_tuning_mistral_beavertails		fine_tuning_mistral_beavertails
gdpr_tlm_blog_post		gdpr_tlm_blog_post
generate_llm_response		generate_llm_response
gpt4-rag-logprobs		gpt4-rag-logprobs
jigsaw_ai_safety_keras		jigsaw_ai_safety_keras
time_series_automl		time_series_automl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

cleanlab/cleanlab-tools

Folders and files

Latest commit

History

Repository files navigation

cleanlab-tools

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 11

Languages

Packages