Media Summary: Many users have millions of traces and observations. We've made it easier to filter and We're introducing a set of upgrades to make complex agents radically easier to understand and debug: - Agent Tools now surface ... The Evaluator Library lets you use LLM-as-a-Judge evals to monitor and score key metrics for your LLM applications or AI agents.

Langfuse Launch Week 3 Day 1 Full Text Search - Detailed Analysis & Overview

Many users have millions of traces and observations. We've made it easier to filter and We're introducing a set of upgrades to make complex agents radically easier to understand and debug: - Agent Tools now surface ... The Evaluator Library lets you use LLM-as-a-Judge evals to monitor and score key metrics for your LLM applications or AI agents. Custom Dashboards save views that show the numbers you care about and keep every team on top of what ... Comments now support and emoji reactions, making it easier to collaborate with your team directly in Want to start freelancing? Let me help: Want to learn real AI Engineering? Go here: ...

Hi my name is Max co-founder of l fuse and today I want to In this video our Co-Founder & CEO Marc walks you through the Evaluations product of the Introducing LLM-as-a-judge Evaluation for Dataset Experiments in This video explains how to use OpenTelemetry and Turn production failures into repeatable evals by capturing failing LLM runs with

Photo Gallery

Langfuse Launch Week 3, Day 1: Full Text Search
Langfuse Launch Week Day 1: New Filters for Tables and API
Langfuse Launch Week Day 3: Agent Tracing and Evaluation
Langfuse Launch Week 3, Day 6: Langfuse Evaluator Library
Langfuse Launch Week 3, Day 3: Custom Dashboards
Langfuse Launch Week Day 2: Collaborate with your team directly in Langfuse
Get Started with Langfuse - Open-Source LLM Monitoring
Langfuse Launch Week Day 4: Experiments in Langfuse
Langfuse Launch Week 1: Model-based Evaluation
Langfuse Intro - Evaluations Deep Dive
Langfuse vs LangSmith vs Opik. Pick Wrong, Lose Months.
LLM-as-a-Judge Evaluation for Dataset Experiments in Langfuse
View Detailed Profile
Langfuse Launch Week 3, Day 1: Full Text Search

Langfuse Launch Week 3, Day 1: Full Text Search

Day 1

Langfuse Launch Week Day 1: New Filters for Tables and API

Langfuse Launch Week Day 1: New Filters for Tables and API

Many users have millions of traces and observations. We've made it easier to filter and

Langfuse Launch Week Day 3: Agent Tracing and Evaluation

Langfuse Launch Week Day 3: Agent Tracing and Evaluation

We're introducing a set of upgrades to make complex agents radically easier to understand and debug: - Agent Tools now surface ...

Langfuse Launch Week 3, Day 6: Langfuse Evaluator Library

Langfuse Launch Week 3, Day 6: Langfuse Evaluator Library

The Evaluator Library lets you use LLM-as-a-Judge evals to monitor and score key metrics for your LLM applications or AI agents.

Langfuse Launch Week 3, Day 3: Custom Dashboards

Langfuse Launch Week 3, Day 3: Custom Dashboards

Custom Dashboards save views that show the numbers you care about and keep every team on top of what ...

Langfuse Launch Week Day 2: Collaborate with your team directly in Langfuse

Langfuse Launch Week Day 2: Collaborate with your team directly in Langfuse

Comments now support @mentions and emoji reactions, making it easier to collaborate with your team directly in

Get Started with Langfuse - Open-Source LLM Monitoring

Get Started with Langfuse - Open-Source LLM Monitoring

Want to start freelancing? Let me help: https://academy.datalumina.com/freelance Want to learn real AI Engineering? Go here: ...

Langfuse Launch Week Day 4: Experiments in Langfuse

Langfuse Launch Week Day 4: Experiments in Langfuse

It's

Langfuse Launch Week 1: Model-based Evaluation

Langfuse Launch Week 1: Model-based Evaluation

Hi my name is Max co-founder of l fuse and today I want to

Langfuse Intro - Evaluations Deep Dive

Langfuse Intro - Evaluations Deep Dive

In this video our Co-Founder & CEO Marc walks you through the Evaluations product of the

Langfuse vs LangSmith vs Opik. Pick Wrong, Lose Months.

Langfuse vs LangSmith vs Opik. Pick Wrong, Lose Months.

You spent three

LLM-as-a-Judge Evaluation for Dataset Experiments in Langfuse

LLM-as-a-Judge Evaluation for Dataset Experiments in Langfuse

Introducing LLM-as-a-judge Evaluation for Dataset Experiments in

10 min Walkthrough of Langfuse – Open Source LLM Observability, Evaluation, and Prompt Management

10 min Walkthrough of Langfuse – Open Source LLM Observability, Evaluation, and Prompt Management

Learn more: https://

OTEL Observability with Langfuse for Strands Agents

OTEL Observability with Langfuse for Strands Agents

This video explains how to use OpenTelemetry and

Langfuse Tracing in Python: Turn LLM Failures into Eval Tests

Langfuse Tracing in Python: Turn LLM Failures into Eval Tests

Turn production failures into repeatable evals by capturing failing LLM runs with