How To Evaluate Ai Applications

How to evaluate AI applications

Vertex

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

Learn how to professionally

Agentic Evals by Shishir Patil

Shishir Patal, a Research Scientist at Meta, delivered a presentation on

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want to learn real

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Today, I want to share a new episode with Aman Khan. The best way to learn about

How to Evaluate AI Agents ?

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

A Practical Guide to Evaluating Generative AI Applications - Updated Nov 2025

In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for

How to Evaluate (and Improve) Your LLM Apps

Want your team maximizing Claude? I run 1:1 and team

I Built An AI Agent - Here’s How I Test It

... I build productivity

Stop Guessing: How to Actually Measure AI Performance (AI Evals)

Are you still relying on the "vibe check" to

How to evaluate your Gen AI models with Vertex AI

Gen

Should You Use AI in Your College Application?

Want to work with us? Schedule a FREE intro meeting today: ...

How to evaluate an LLM application

How to evaluate

LLM Evaluation and Testing for Reliable AI Apps - MLOps Live #38 with Evidently AI

In this webinar, we heard firsthand about the challenges and opportunities presented by LLM observability. We discussed: ...

Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 8 - LLM Evaluation

For more information about Stanford's graduate programs, visit: https://online.stanford.edu/graduate-education November 21, ...

Must-Learn AI Skill for PMs: AI Evals (and how to set them up)

NOTE: see our updated

AI Agents, Clearly Explained

My

How to Test AI Model (Hidden Bias & Fairness 🧠⚖️)

OpenAI's recent glitch revealed one of the many flaws in