Back To Schedule
Thursday, November 15 • 1:10pm - 1:30pm
Monitoring AI with AI

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The best performing offline algorithm can lose in production. The most accurate model does not always improve business metrics. Environment misconfiguration or upstream data pipeline inconsistency can silently kill the model performance. Neither prodops, data science or engineering teams are skilled to detect, monitor and debug such types of incidents. Was it possible for Microsoft to test Tay chatbot in advance and then monitor and adjust it continuously in production to prevent its unexpected behaviour? Real mission critical AI systems require advanced monitoring and testing ecosystem which enables continuous and reliable delivery of machine learning models and data pipelines into production. Common production incidents include: - Data drifts, new data, wrong features - Vulnerability issues, adversarial attacks - Concept drifts, new concepts, expected model degradation - Dramatic unexpected drifts - Biased Training set / training issue - Performance issue In this demo based talk we discuss a solution, tooling and architecture that allows machine learning engineer to be involved in delivery phase and take ownership over deployment and monitoring of machine learning pipelines. It allows data scientists to safely deploy early results as end-to-end AI applications in a self serve mode without assistance from engineering and operations teams. It shifts experimentation and even training phases from offline datasets to live production and closes a feedback loop between research and production. Technical part of the talk will cover the following topics: - Automatic Data Profiling - Anomaly Detection - Deep Autoencoders - GANs - Density based Clustering of inputs and outputs of the model - Service Mesh, Envoy Proxy, trafic shadowing Demo part of the talk will simulate a real life concept drift as well as new concepts for the model and different algorithms that will catch those drifts in operational environment.

avatar for Stepan Pushkarev

Stepan Pushkarev

CTO, Hydrosphere.io
hydrosphere.io CTOAutomation of AI/ML Operations: deployment, serving, monitoring, subsampling, retraining.

Thursday November 15, 2018 1:10pm - 1:30pm PST