Red teaming, safety testing, and improved synthesizer, conversational metrics, multi-modal metrics

Latest

Latest

penguine-ip released this 31 Oct 23:01

· 89 commits to main since this release

In DeepEval 1.4.7, we're releasing:

LLM red teaming. Safety test your LLM application for 40+ vulnerabilities with 10+ attack enhancements, docs here: https://docs.confident-ai.com/docs/red-teaming-introduction
Improved synthetic data synthesizer, much more functionality and customizbility: https://docs.confident-ai.com/docs/evaluation-datasets-synthetic-data
Conversational metrics: Dedicated metrics to evaluate LLM turns
Multi-modal metrics: Image editing and text to image evaluation

Assets 2