Friday October 29 7:00 PM – Friday October 29 8:30 PM in Workshop/Tutorial I

Bridging Data and Business: Power Plant Output Optimization Based on Electricity Market Price

Sylvia Lee

Prior knowledge:
Previous knowledge expected
Basic Python programming, previous exposure to common libraries such as pandas, numpy, matplotlib, and scikit-learn

Summary

“Success” in a data project may have different meanings. Learn how to set up the best foundations for success in data projects with a business mindset from experienced data analytic consultants and get a full hands-on experience on the complete data analytic process from data ingestion, modeling, to dashboarding on Digital Hub™, a cloud platform for integrated opensource data science tool stack.

Description

Bridging Data and Business is a tutorial workshop targeted for data scientists, especially for industry professionals with interests in exploring and entering the realm of data science. This tutorial is not designed to dive deep into specific topic of data science, but rather is a comprehensive business-oriented guide on the general data science processes that can be broadly applied to many problems and usecases.

The tutorial will showcase machine learning modelling process on a power plant energy output prediction usecase1, extended with a business action recommendation logic based on electricity market forecast data2, and cherry-topped with production grade dashboards to communicate valuable insights and business outcomes. This tutorial session aims to guide data enthusiast newcomers through the data analytic processes for practical productive results and trigger attendees’ thoughts and innovations in how to use data science tools and techniques for the best business outcomes. Attendees will also receive introductory exposure and hands-on experience to Digital Hub™ Platform

Approximate breakdown of materials is as following:

  1. 0 - 20 min: Introduction to Digital Hub™ data science platform; set contexts with topics on machine learning best practices; summarized processes in business contexts; and lessons learned from experience data analytic consultant
  2. 20 – 60 min: Walk through of solution development on the usecase, demonstrating the process from data ingestion, data exploration, feature engineering, machine learning model fitting, hyperparameter tuning, model validations, and outcome data ingestion into a Postgres database. Showcase extended business analysis with feeding the machine learning predictions into a recommendation engine with energy market forecast price.
  3. 60 – 70 min: Building production grade dashboards in Digital Hub™ Platform
  4. 70 – 90 min: Discussion and Q&A

The tutorial will be performed on Digital Hub™ which is a platform that aims to minimize the technical entry barriers to data science by automating the integrations of opensource toolstacks for a complete end-to-end data science pipeline. Users may use the platform with a free-tier subscription3 to follow along the tutorial and get exposure to opensource toolstacks.4

1 Data retrieved from UCI Machine Learning Repository- Combined Cycle Power Plant Data Set
2 Data retrieved from Alberta Electric System Operator (AESO) Historical Market Reporting
3 Free-tier subscription includes 40 hours application runtime and 5 GB storage. Any additional usage will induce extra charges
4 Opensource tool stacks will include JupyterLab, Apache Superset, Grafana and PostgreSQL. Highly technical attendees may choose to install themselves but help on installations and integration problems may receive limited aids in the tutorial.