Media Summary: Shishir Patal, a Research Scientist at Meta, delivered a presentation on In this video we are going to see how you can In this video we take a look at Ragas, a Python package made for

How To Evaluate Ai Agents Ai Agent Evaluation At Scale - Detailed Analysis & Overview

Shishir Patal, a Research Scientist at Meta, delivered a presentation on In this video we are going to see how you can In this video we take a look at Ragas, a Python package made for This video introduces a new series on testing Join Mahesh Yadav, top Maven instructor and former In this step-by-step tutorial, you'll discover how to

This lecture discusses the critical shift from Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. Business owner or operator with a team? We build

Photo Gallery

Agentic Evals by Shishir Patil
How to Evaluate AI Agents? | AI Agent Evaluation at Scale
LLM as a Judge: Scaling AI Evaluation Strategies
How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems
Evaluate AI Agents in  Python with Ragas
The agent evaluation revolution
How to set Evaluation for AI Agents & Scale them
Agent evaluation with ADK & Vertex AI | The Agent Factory Podcast
How to evaluate agents in practice
AI Agent evaluation: A complete guide to measuring performance
Scale AI Agent Evaluation with NVIDIA NeMo Evaluator LLM-as-a-Judge
The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)
View Detailed Profile
Agentic Evals by Shishir Patil

Agentic Evals by Shishir Patil

Shishir Patal, a Research Scientist at Meta, delivered a presentation on

How to Evaluate AI Agents? | AI Agent Evaluation at Scale

How to Evaluate AI Agents? | AI Agent Evaluation at Scale

In this video we are going to see how you can

LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx

How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems

How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems

Evaluating AI agents

Evaluate AI Agents in  Python with Ragas

Evaluate AI Agents in Python with Ragas

In this video we take a look at Ragas, a Python package made for

The agent evaluation revolution

The agent evaluation revolution

This video introduces a new series on testing

How to set Evaluation for AI Agents & Scale them

How to set Evaluation for AI Agents & Scale them

Join Mahesh Yadav, top Maven instructor and former

Agent evaluation with ADK & Vertex AI | The Agent Factory Podcast

Agent evaluation with ADK & Vertex AI | The Agent Factory Podcast

Learn how to effectively

How to evaluate agents in practice

How to evaluate agents in practice

Evaluating Agents

AI Agent evaluation: A complete guide to measuring performance

AI Agent evaluation: A complete guide to measuring performance

Evaluating AI agents

Scale AI Agent Evaluation with NVIDIA NeMo Evaluator LLM-as-a-Judge

Scale AI Agent Evaluation with NVIDIA NeMo Evaluator LLM-as-a-Judge

In this step-by-step tutorial, you'll discover how to

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

Learn how to professionally

Agent Evaluation & Benchmarks - Agentic AI MOOC 2025 Lecture 4 Summary

Agent Evaluation & Benchmarks - Agentic AI MOOC 2025 Lecture 4 Summary

This lecture discusses the critical shift from

How to Evaluate AI Agents ?

How to Evaluate AI Agents ?

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

AI Agent Evaluation with RAGAS

AI Agent Evaluation with RAGAS

RAGAS (RAG ASsessment) is an

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want to learn real

Evaluate N8N AI Agents & RAG like a PRO | N8N Evaluation Tutorial

Evaluate N8N AI Agents & RAG like a PRO | N8N Evaluation Tutorial

Evaluate

The Beginner’s Guide to n8n Evaluations (Optimize Your AI Agents)

The Beginner’s Guide to n8n Evaluations (Optimize Your AI Agents)

Business owner or operator with a team? We build

Beginner's Guide to Agent Evaluations

Beginner's Guide to Agent Evaluations

When companies deploy their