Skip to course content

Python with Dask

Master advanced Python development and scale analytics with Dask for big-data workloads.

Get Course Info

Audience: Developers / Architects

Duration: 3 days

Format: Lectures and hands-on labs (50% lecture, 50% lab)

Overview

Python has recently become the most popular language. It excels at data science, artificial intelligence, and other tasks but is also an outstanding language for web and service programming and general application development. With Dask, you can do more: Big Pandas, Parallel For Loops, Big Arrays, and Machine Learning.

Objective

Master advanced Python development and scale analytics with Dask for big-data workloads.

What You Will Learn

  • Intermediate Python Language
  • Web Programming
  • Database Programming
  • Data Analysis
  • Visualization
  • Deployment

Course Details

Audience: Developers / Architects

Duration: 3 days

Format: Lectures and hands-on labs (50% lecture, 50% lab)

Prerequisites:
  • Some background with Unix or Linux including the command line
  • Completion of a Python Intro class

Setup: A reasonably modern laptop or desktop · Unrestricted Internet · Chrome browser · SSH client

Detailed Outline

  • Installing Python
  • Python Versions
  • IDEs
  • Jupyter Notebook
  • Introducing NumPy
  • NumPy Arrays and Matrices
  • Introducing SciPy
  • Using SciPy
  • Statsmodels
  • Doing Stats in Python
  • Matplotlib
  • Seaborn
  • DataFrames
  • Schema inferences
  • Data exploration
  • Database Connectivity
  • Pandas and DB
  • ORM
  • Python Web Frameworks
  • Flask
  • Restful API with Flask
  • Making Your Own Packages
  • Deployment
  • Environments
  • How to use Containers with Python
  • Dockerizing Python
  • Writing C Modules
  • Using Python with Other Languages
  • TDD and Python
  • Unit test Frameworks
  • Big Pandas
  • Parallel For Loops
  • Big Arrays
  • Machine Learning
  • Dask Scheduler Overview
  • Task Graphs
  • Lazy Evaluation & Larger-than-Memory Data
  • Dask Arrays and Chunking
  • dask.delayed Workflows
  • Dask Futures & Real-Time Distributed Computing
  • Dask with XGBoost / Scikit-Learn
  • Dask-Distributed Clusters
  • Dask Bag for Unstructured Data
  • Performance Optimization & Profiling
  • Deploying on AWS / GCP / Azure
  • Scaling to Production

Ready to Get Started?

Contact us to learn more about this course and schedule your training.