Alex Volkov - AI in production: Observe, Compile, Eval

Building LLM-based applications is exciting, but keeping them reliably performant in production is a challenge—even for the best teams. How do you track user interactions? Diagnose issues? Prevent regressions when prompts or models change? At Weights & Biases, we’ve worked with hundreds of companies to tackle questions like these.

Date: Feb 22
Time: 12:30pm - 1:50pm
Track: Workshops
Room: AWS JFK27 (12 W 39th St) 300/301 - entrance 39th St & 5th Ave, large gold doors, bring ID

Join us for this exclusive workshop, where we’ll guide you through building and evaluating an AI application using a proven, future-proof workflow, learning how to:

Trace and Log everything
Collect and leverage user feedback
Build evaluation datasets
Design evaluations (Programmatic, HITL, LLM as a judge)
Confidently iterate and optimize performance of your applications

No prior experience with evaluations? No problem. Just bring your laptop, and leave with practical skills to future-proof your AI applications.

Buy AI Engineer Summit 2025 Tickets

By invitation only.

Apply here.