Assignment title: Information
Description Possible Marks and Wtg(%) Word
Count
Due date
Assignment 2 Written Practical Report 100 marks 15% Weighting 2000 26/04/17
The key frameworks and concepts covered in modules 1–5 are particularly relevant for
this assignment. Assignment 2 relates to the specific course learning objectives 1, 2 and
4 and associated MBA program learning goals and skills: Global Content, Problem
solving, Critical thinking, and Written Communication at level 3:
1. demonstrate applied knowledge of people, markets, finances, technology and
management in a global context of business intelligence practice (data warehouse
design, data mining process, data visualisation and performance management) and
resulting organisational change and how these apply to implementation of business
intelligence in organisation systems and business processes
2. identify and solve complex organisational problems creatively and practically through
the use of business intelligence and critically reflect on how evidence based decision
making and sustainable business performance management can effectively address real
world problems
4. demonstrate the ability to communicate effectively in a clear and concise manner in
written report style for senior management with correct and appropriate
acknowledgment of main ideas presented and discussed.
Note you must use RapidMiner Studio for Task 2 and Tableau Desktop for Task 3
in this Assignment 2. Failure to do so may result in Task 2 and/or 3 not being marked
and zero marks awarded.
Note carefully University policy on Academic Misconduct such as plagiarism,
collusion and cheating. If any of these occur they will be found and dealt with by the
USQ Academic Integrity Procedures. If proven, Academic Misconduct may result in
failure of an individual assessment, the entire course or exclusion from a University
program or programs.Assignment 2 consists of three main tasks and a number of sub tasks
Task 1 Data Driven Decision Making (Worth 40 Marks)
Data driven decision-making – (3D) - Increasingly organisations are looking to make
decisions that are based on the evidence of real data. Data analytics toolsets are evolving
rapidly and many organisations have invested heavily in information architecture so that they
have the capability to move transactional data and external data into data warehouses and
provide end-users with specific data analytics software applications to support evidence
based (data driven) decision making. However, many organisations are still struggling to
make the transition to a data driven decision making paradigm.
Your task as the Data Analytics Lead in XYZ company is to conduct a review of the relevant
literature regarding data driven decision-making (3D) and prepare a strategic briefing report
of about 1350 words for the Chief Executive Officer of XYZ company on key aspects of data
driven decision making as outlined below:
Task 1.1) Identify from the existing literature and discuss the relevant decision making
theories and frameworks which would inform a deeper understanding of the decision making
process in organisations (about 700 words)
Task 1.2) Provide a comprehensive definition of data driven decision making and explain
briefly how your definition has been informed by specific literature on data driven decisionmaking (about 150 words)
Task 1.3) Based on an understanding of how decisions are made in organisations, discuss
how changing organisational culture would be important for Company XYZ to successfully
make the transition to a data driven decision making paradigm (about 500 words)Task 2 Exploratory Data Analysis and Decision Tree Analysis (Worth 25 Marks)
Task 2.1) Conduct an exploratory data analysis of the patient-health.csv data set using the
RapidMiner Studio data mining tool. Summarise the findings of your exploratory data
analysis in terms of describing key characteristics of each of the variables in the patienthealth.csv data set such as maximum, minimum values, average, standard deviation, most
frequent values (mode), missing values and invalid values etc and relationships with other
variables if relevant in a table named Table 2.1 Results of Exploratory Data Analysis for
the patient-health.csv Data Set.
Hint: The Statistics Tab and the Chart Tab in RapidMiner provide descriptive statistical
information and useful charts like Barcharts, Scatterplots etc. You might also like to look at
running some correlations and chi square tests to indicate which variables you consider to
be the top five key variables and which contribute most to determining whether a patient is
healthy. Note in completing Task 2.1 you will find it useful to refer to the data dictionary for
the patient-health.csv data set provided in this document which defines each of the variables
in terms of their data type and range of values.
Briefly discuss the key results of your exploratory data analysis presented in Table 2.1 and
the rationale for why you have selected your five top variables for predicting Patient Health.
(About 250 words)
Task 2.2) Build a Decision Tree model for predicting Patient Health using RapidMiner
and an appropriate set of data mining operators and a reduced patient-health.csv data set
determined by your exploratory data analysis in Task 2.1. Provide these outputs from
RapidMiner (1) Final Decision Tree Model process, (2) Final Decision Tree diagram, and (3)
Decision Tree rules for Task 2.2.
Briefly describe your final Decision Tree Model Process, and discuss the results of the
Final Decision Tree Model drawing on the key outputs (Decision Tree Diagram, Decision
Tree Rules) for predicting Patient Health and relevant supporting literature on the
interpretation of decision trees (About 250 words).
Include all appropriate RapidMiner outputs such as RapidMiner Processes, Graphs and
Tables that support the key aspects of your exploratory data analysis and decision tree model
analysis of the data set in your Assignment 2 report. Note you need export these outputs
from RapidMiner using the File/Print/Export Image option and where relevant include
in Task 2 and/or in Appendix A of the Assignment 2 report.
Table 1 Patient Health Data Set Data Dictionary
Variable Name Type and description of variable Range of values
1. Patient_id Integer Patient Id Range 1 to 20,000
2. genhealth Polynominal, Health Rating of each patient Poor, Fair, Good, Very Good,
Excellent
3. exerany Integer, does the patient exercise? 1 or 0
4. hlthplan Integer, Health insurance plan? 1 or 0
5. smoke100 Integer, Smoker? 1 or 0
6. height Integer, height in inches of patient Height range in inches
7. weight Integer, weight in pounds of each patient? Weight range in pounds
8. wtdesire Integer, desired weight of each patient can be
used to calculate if a patient is overweight etc
Desired weight of each patient
in pounds
9. age Integer Age of each patient
10. gender Polynominal, Gender of each patient M = Male; F = FemaleTask 3 Reports using Tableau Desktop (Worth 25 Marks)
The following web site was used to construct a list of people who died while climbing Mount
Everest. https://en.wikipedia.org/wiki/List_of_people_who_died_climbing_Mount_Everest
contained in MtEverestDeaths.xlsx. This data set contains the following variables, note there
are missing values in this data set.
Variable Name Type and description of variable Range of values
No_Deaths Integer running count of deaths
recorded in climbing Mt
Everest
Name Character String, Name of person who died
on Mt Everest
Range of names of people
Age Integer, age when person died on Mt Everest Range of ages in integer values
Expedition Character String, Name of Mt Everest
Climbing Expedition
Range of Expeditions
Nationality Character String, Nationality of Person who
died on Mt Everest
Range of Nationalities
Cause_Death Character String, Cause of Death Range of causes of death
Location Location of death on Mt Everest Range of locations on Mt
Everest
With the MtEverestDeaths.xlsx data set use Tableau Desktop to produce three different views
of Deaths that have occurred in expeditions attempting to climb Mt Everest. Note you will
find it useful to do some research on the Deaths that have occurred in expeditions attempting
to climb Mt Everest in order to comment on each of three views of Deaths on Mt Everest that
you will need to create for Task 3.1, 3.2 and 3.3.
Task 3.1) Create a view of the Deaths on Mt Everest in a Tableau Text Table or Graph view
that displays by Nationality, No_Deaths over time (in years). Comment on the key trends
and patterns that are apparent in this Text Table/Graph (about 50 words).
Task 3.2) Create a view of the Deaths on Mt Everest in a Tableau Text Table or Graph view
that displays by Location, Cause of Death, and Nationality. Comment on the key trends and
patterns that are apparent in this Text Table/Graph (about 50 words).
Task 3.3) Create a view of the Deaths on Mt Everest in a Text Table or Graph view that
displays by Expedition, Cause of Death and Nationality. Comment on the key trends and
patterns that are apparent in this Text Table/Graph (about 50 words).
Note that you need provide a copy of each Text Table or Graph view in your Assignment 2
report for the relevant sub Tasks 3.1, 3.2 and 3.3. The Tableau Menu / Worksheet and then
Copy or Export Options will allow you to copy and paste the view for each Text table or
Graph into relevant section of Task 3 for your Assignment 2 report.
Note you need to provide a copy of your Assignment 2 Task 3 Tableau file in twbx format as
part of your Assignment 2 submissionYour assignment 2 report must be structured in report format as follows:
Title Page for Assignment 2 Report
Table of Contents
Body of report – main sections and subsections for assignment 2 tasks and sub tasks so
Task 1 will be a main heading with appropriate sub headings etc....for each sub task
such as Task 1.1, Task 1.2 etc..
Task 2 …
Task 3 ….
List of References
List of Appendices
You need to submit two files for Assignment 2:
1. Assignment 2 Report for Tasks 1, 2 and 3 in Word document format with the
extension .docx
2. Tableau packaged workbook with the extension .twbx contains three required Text Table /
Graph views for Task 3
Use the following file naming convention:
1. Student_no_Student_name_CIS8008_Ass2.docx and
2. Student_no_Student_name_CIS8008_Ass2.twbx
Harvard referencing resources
Install a bibliography referencing tool – Endnote which integrates with your word processor.
http://www.usq.edu.au/library/referencing/endnote-bibliographic-software.
USQ Library how to reference correctly using the Harvard referencing system
https://www.usq.edu.au/library/referencing/harvard-agps-referencing-guide