Beyond the Prompt: Evaluating, Testing, and Securing LLM Applications - Mete Atamel - NDC Oslo 2025

youtube
Beyond the Prompt: Evaluating, Testing, and Securing LLM Applications - Mete Atamel - NDC Oslo 2025 This talk was recorded at NDC Oslo in Oslo, Norway. #ndcoslo #ndcconferences #developer #softwaredeveloper Attend the next NDC conference near you: Subscribe to our YouTube channel and learn every day: / @NDC Follow our Social Media! When you change prompts or modify the Retrieval-Augmented Generation (RAG) pipeline in your LLM applications, how do you know it’s making a difference? You don’t—until you measure. But what should you measure, and how? Similarly, how can you ensure your LLM app is resilient against prompt injections or avoids providing harmful responses? More robust guardrails on inputs and outputs are needed beyond basic safety settings. In this talk, we’ll explore various evaluation frameworks such as Vertex AI Evaluation, DeepEval, and Promptfoo to assess LLM outputs, understand the types of metrics they offer, and how these metrics are useful. We’ll also dive into testing and security frameworks like LLM Guard to ensure your LLM apps are safe and limited to precisely what you need.
  2025/07/31      youtube

Our Tag

最近投稿されたプログラミング学習動画

Senior Playstation Engineer's tips for learning new tools and getting

study

On this week's episode of the podcast, f...

  2025/08/01

WearOS Material 3 shape morphing | Jetpack Compose Tips

Spice up your Wear OS UIs with shape mor...

  2025/07/31

Pixel Pirate: Interactive DevTools demo

Arrr! Stop chasing disappearing UI! Expl...

  2025/07/31

Beyond the Prompt: Evaluating, Testing, and Securing LLM Applications

This talk was recorded at NDC Oslo in Os...

  2025/07/31

Securing AI RAG Pipelines with Fine Grained Authorization - Sohan Mahe

This talk was recorded at NDC Oslo in Os...

  2025/07/31

"Run Query Run" - A Fresh Look at SQL Wait Stats - Pinal Dave - NDC Os

sql

This talk was recorded at NDC Oslo in Os...

  2025/07/31

Accessibility by Everyone (and for Everyone) - Amy Kapernick - NDC Osl

This talk was recorded at NDC Oslo in Os...

  2025/07/31

Let's Fight a Dragon with Godot - Kristian Hiim - NDC Oslo 2025

This talk was recorded at NDC Oslo in Os...

  2025/07/31

Why Algorithms Work – Algorithm Analysis Deep Dive Course

study

This course is a university-level explor...

  2025/07/30

OpenID Connect Architectural Patterns - Anders Abel - NDC Oslo 2025

This talk was recorded at NDC Oslo in Os...

  2025/07/30

Helping solve for Agriculture with the power of Google AI

Google
農業

There are two capabilities that Google's...

  2025/07/30