Our Team

Cinque Terre

Shaza Safi

Cinque Terre

Danielle Spring

Cinque Terre

Ben Mogil

University of Toronto Data Analytics Bootcamp

Overview

The Sunshine list is annual list includes public sector employees in Ontario who earned over $100,000 last year. The list is mandated by the Public Sector Salary Disclosure Act enacted by the Ontario government under Premier Mike Harris, with the first list released in 1996.

Objective

By analyzing public sector incomes for those employees who earned over $100,000 annually, we seek to determine if the yearly publication can be used to evaluate fairness in the Ontario workforce. In our analysis we will be using the data to determine trends based on salaries and gender. The sunshine list does not denote gender therefore we will use machine learning tools to predict gender.

TECHNOLOGIES USED:

Python

The main language used through the project.

Pandas

Datasets cleansing, filtering and merging to a single dataset for further analysis.

Matplotlib

Used for exploratory analysis.

sklearn

Used sklearn & nltk library for machine learning

HTML

Project Website development using html and JavaScript.

Bootstrap

Project Website responsiveness and design.

Dataset

Dataset used: Kaggle Dataset

We used kaggle consolidated dataset for Ontario Sunshine List 1996 to 2019 and appended the 2020 sunshine list to it.

Stats Canada Wages

We used employee wages by occupation annualy to compare public sector wages to private sector wages.

Stats Canada Inflation

We used Consume Price Index (CPI) statistics to compare if consumer inflation over the years has been taken into account in the sunshine list wages.

205,000

Number of Ontarian employees on the sunshine list in year 2020

Machine Learning Diagram

This video is a Demo of how the dashboard works!

This video is a Demo of how the dashboard works!.

This is our awesome dashboard using tableau!

Designed by BootstrapMade