Skip to Main Content

View Workshop

Speed up data science with the Accelerated Data Science SDK Workshop

About This Workshop

About This Workshop

The Oracle Data Science Service is a fully managed, self-service platform for data science teams to build, train, and manage machine learning (ML) models in Oracle Cloud Infrastructure. This lab will introduce the Accelerated Data Science SDK, showing you how it can speed up your workflow and make you more productive. In this module, we will build a binary classification model in an effort to predict employee attrition. Using the Accelerated Data Science (ADS) SDK we will do an exploratory data analysis (EDA) to understand the nature and distribution of the data. We will visualize the data and assess the correlation between predictors. The Oracle AutoML tools will be used to perform and automatically tune Light Gradient Boosting Machine (GBM), XG Boost, Random Forest and Logistic Regression classifiers. These models will be evaluated and compared using ADS' model evaluation tools. Once the best model is selected, we will use the machine learning explainability (MLX) tools to explain the global and local behavior of the model. That is, we will see what features are important in the model using feature permutation importance, partial dependence plots (PDP), individual conditional expectation (ICE) and several other methods used to determine why the model made the prediction that it did.

Workshop Duration: 4 hours

Prereq

Workshop prerequisites:
  • Familiarity with Python is desirable, but not required.
  • Some understanding of the model building process is helpful.
  • Familiarity with Oracle Cloud Infrastructure (OCI) is helpful.

Ways to run this workshop

Ways To Run This Workshop

Free Trial

Free Trial with US$300 of free credits to use for up to 30 days on eligible Oracle Cloud Infrastructure services. More info

Aways Free

Always Free services are available for an unlimited time. No cloud credits are required to use these services. More info

On Your Tenancy

Run on Your Tenancy using Oracle Universal Credits you've purchased.
Using your credits | Services available

Reserve on LiveLabs

Run on LiveLabs using our free tenancy. Your session can run for up to 6 hours.
Oracle account help | Oracle account signup

Workshop Outline

Workshop Details

Outline

Workshop outline:
  • Learn about the Data Science service
  • Set up a free trial account
  • Configure your account to use the Data Science service
  • Create a Project
  • Create a Notebook
  • Learn about the Accelerated Data Science SDK
  • Do an exploratory data analysis (EAD).
  • Automatic feature engineering
  • Build a binary classification model using AutoML
  • Assess model quality
  • Learn about machine learning explainability (MLX)
  • Shut down the notebook session

Outcome

Take-aways from this workshop:

Build a binary classification model in an effort to make predictions and do an exploratory data analysis (EDA).

Tags

Level

Level: Beginner

Role

Roles: Data Scientist, Business Analyst

Focus

Focus Areas: Analytics, Serverless, AI/ML

Product

Products/Technologies: Data Science, Python

buttons