Alex Volkov - AI in production: Observe, Compile, Eval

Building LLM-based applications is exciting, but keeping them reliably performant in production is a challenge—even for the best teams. How do you track user interactions? Diagnose issues? Prevent regressions when prompts or models change? At Weights & Biases, we’ve worked with hundreds of companies to tackle questions like these.

Date
Time
Track
Workshops
Room
AWS JFK27 (12 W 39th St) 300/301 - entrance 39th St & 5th Ave, large gold doors, bring ID

Building LLM-based applications is exciting, but keeping them reliably performant in production is a challenge—even for the best teams. How do you track user interactions? Diagnose issues? Prevent regressions when prompts or models change? At Weights & Biases, we’ve worked with hundreds of companies to tackle questions like these.

Join us for this exclusive workshop, where we’ll guide you through building and evaluating an AI application using a proven, future-proof workflow, learning how to:

  • Trace and Log everything
  • Collect and leverage user feedback
  • Build evaluation datasets
  • Design evaluations (Programmatic, HITL, LLM as a judge)
  • Confidently iterate and optimize performance of your applications

No prior experience with evaluations? No problem. Just bring your laptop, and leave with practical skills to future-proof your AI applications.

Alex Volkov

Alex Volkov is an AI Evangelist at Weights & Biases as well as the founder and host of ThursdAI, a weekly newsletter and podcast that explores the latest innovations in AI, their practical applications, and the open-source AI community. Alex is an AI startup founder with 20 years of full-stack software engineering experience, offering a deep well of insights into AI innovation. He’s celebrated for his ability to clarify and summarize the complexities of the rapid AI advances and advocating for its beneficial uses.

Alex Volkov
Alex VolkovAI Evangelist

Buy AI Engineer Summit 2025 Tickets

By invitation only.

Apply here.