Media Summary: We're introducing a set of upgrades to make complex agents radically easier to understand and debug: - Agent Tools now surface ... The Evaluator Library lets you use LLM-as-a-Judge evals to monitor and score key metrics In this video our Co-Founder & CEO Marc walks you through the Evaluations product of the
Langfuse Launch Week Day 4 Experiments In Langfuse - Detailed Analysis & Overview
We're introducing a set of upgrades to make complex agents radically easier to understand and debug: - Agent Tools now surface ... The Evaluator Library lets you use LLM-as-a-Judge evals to monitor and score key metrics In this video our Co-Founder & CEO Marc walks you through the Evaluations product of the Many users have millions of traces and observations. We've made it easier to filter and search This video walks through a practical example of an N+1 evaluation process Comments now support and emoji reactions, making it easier to collaborate with your team directly in
This video demonstrates how to simulate and evaluate multi-turn conversations