Quick Overview: Shishir Patal, a Research Scientist at Meta, delivered a presentation on Today, I want to share a new episode with Aman Khan. The best way to learn about Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

How To Evaluate Ai Applications - Detailed Overview & Context

Shishir Patal, a Research Scientist at Meta, delivered a presentation on Today, I want to share a new episode with Aman Khan. The best way to learn about Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for Want your team maximizing Claude? I run 1:1 and team Are you still relying on the "vibe check" to

Want to work with us? Schedule a FREE intro meeting today: ... In this webinar, we heard firsthand about the challenges and opportunities presented by LLM observability. We discussed: ... For more information about Stanford's graduate programs, visit: November 21, ... OpenAI's recent glitch revealed one of the many flaws in

Photo Gallery

How to evaluate AI applications
LLM as a Judge: Scaling AI Evaluation Strategies
The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)
Agentic Evals by Shishir Patil
How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)
Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan
How to Evaluate AI Agents ?
A Practical Guide to Evaluating Generative AI Applications - Updated Nov 2025
How to Evaluate (and Improve) Your LLM Apps
I Built An AI Agent - Here’s How I Test It
Stop Guessing: How to Actually Measure AI Performance (AI Evals)
How to evaluate your Gen AI models with Vertex AI
Sponsored
Sponsored
View Main Result
Sponsored
Sponsored