“Success” in a data project may have different meanings. Learn how to set up the best foundations for success in data projects with a business mindset from experienced data analytic consultants and get a full hands-on experience on the complete data analytic process from data ingestion, modeling, to dashboarding on Digital Hub™, a cloud platform for integrated opensource data science tool stack.
Bridging Data and Business is a tutorial workshop targeted for data scientists, especially for industry professionals with interests in exploring and entering the realm of data science. This tutorial is not designed to dive deep into specific topic of data science, but rather is a comprehensive business-oriented guide on the general data science processes that can be broadly applied to many problems and usecases.
The tutorial will showcase machine learning modelling process on a power plant energy output prediction usecase1, extended with a business action recommendation logic based on electricity market forecast data2, and cherry-topped with production grade dashboards to communicate valuable insights and business outcomes. This tutorial session aims to guide data enthusiast newcomers through the data analytic processes for practical productive results and trigger attendees’ thoughts and innovations in how to use data science tools and techniques for the best business outcomes. Attendees will also receive introductory exposure and hands-on experience to Digital Hub™ Platform
Approximate breakdown of materials is as following:
The tutorial will be performed on Digital Hub™ which is a platform that aims to minimize the technical entry barriers to data science by automating the integrations of opensource toolstacks for a complete end-to-end data science pipeline. Users may use the platform with a free-tier subscription3 to follow along the tutorial and get exposure to opensource toolstacks.4
1 Data retrieved from UCI Machine Learning Repository- Combined Cycle Power Plant Data Set
2 Data retrieved from Alberta Electric System Operator (AESO) Historical Market Reporting
3 Free-tier subscription includes 40 hours application runtime and 5 GB storage. Any additional usage will induce extra charges
4 Opensource tool stacks will include JupyterLab, Apache Superset, Grafana and PostgreSQL. Highly technical attendees may choose to install themselves but help on installations and integration problems may receive limited aids in the tutorial.