ML Next Step: ML Ops

(c)Copyright Elephant Scale

July 1, 2021

Overview

Machine Learning and AI represent a great opportunity. All too often, taking a Machine Learning prototype to production makes a difference between success and failure in the AI strategy of a company. This need is fulfilled by Machine Learning Engineers who apply the rules of DevOps to AI.

This course teaches how to take Machine Learning and AI and reduce it to practice. Thus, the name: ML Ops.

Audience

Developers, team leads, project managers

Skill Level

Introductory – Intermediate

Duration

Three days

Format

Lectures and hands-on labs (50% – 50%)

Prerequisites

Comfortable developing code in the target environment

Lab environment

Zero Install: There is no need to install software on students’ machines!
A lab environment in the cloud will be provided for students.

Students will need the following

A reasonably modern laptop with unrestricted connection to the Internet. Laptops with overly restrictive VPNs or firewalls may not work properly.
- A checklist to verify connectivity will be provided
Chrome browser

Detailed outline

Rise of the Machine Learning Engineer and MLOps

What is MLOps
DevOps and MLOps
An MLOps Hierarchy of Needs
Implementing DevOps
DataOps and Data Engineering
Platform Automation
MLOps
Where Can You Deploy?
Conclusion

MLOps Foundations

Bash and the Linux Command Line
Cloud Shell Development Environments
Bash Shell and Commands
List Files
Run Commands
Files and Navigation
Input/Output
Configuration
Writing a Script
Cloud Computing Foundations & Building Blocks
Machine Learning Key Concepts
Build an MLOps Pipeline from Zero

MLOps For Containers And Edge Devices

Containers
Serving a trained model over HTTP
Edge Devices
Coral
Azure Percept
TFHub

Continuous Delivery For Machine Learning Models

Packaging for ML Models
Infrastructure as Code for Continuous Delivery of ML Models
Using Cloud Pipelines
Controlled Rollout of models
Testing techniques for Model Deployment

Monitoring And Logging

Introduction to Logging
Logging in Python
Modifying log levels
Logging different applications
Monitoring Drift with AWS SageMaker
Monitoring Drift with Azure ML

MLOps For Azure

Azure CLI and Python SDK
Authentication
Service Principal
Authenticating API Services
Compute Instances
Deploying
Registering Models
Versioning datasets
Deploying Models to a Compute Cluster
Configuring a Cluster
Deploying a Model
Azure ML Pipelines

Machine Learning Interoperability

Why interoperability is critical
ONNX: Open Neural Network Exchange
Convert PyTorch into ONNX
Convert TensorFlow into ONNX
Apple Core ML