Assignment title: Information


Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 1 DEAKIN UNIVERSITY FACULTY OF BUSINESS & LAW DEAKIN BUSINESS SCHOOL DEPARTMENT OF INFORMATION SYSTEMS AND BUSINESS ANALYTICS MIS771 Descriptive Analytics and Visualisation Assignment Two Background This is an individual assignment, which requires you to analyse a given data set, interpret and draw conclusions from your analysis, and then convey your conclusions in a written report to a person with little or no knowledge of Business Analytics. Percentage of final grade 40% Due date Monday 19th September 2016 11.59pm The assignment must be submitted by the due date electronically in CloudDeakin. When submitting electronically, you must check that you have submitted the work correctly by following the instructions provided in CloudDeakin. Please note that we will NOT accept any hard copies or part of the assignment submitted after the deadline or via Email. No extensions will be considered unless a written request is submitted and negotiated with Dr Dilal Saundage prior to Thursday 15th September 2016, 5pm. Please note that assignment extensions will only be considered if you attach your draft assignment with your request for an extension. The assignment uses the file Supermart.xlsx which can be downloaded from CloudDeakin. Analysis of the data requires the use of techniques studied in both Modules 1 and 2.Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 2 Assurance of Learning This assignment assesses following Graduate Learning Outcomes and related Unit Learning Outcomes: Graduate Learning Outcome (GLO) Unit Learning Outcome (ULO) GLO1: Discipline‐specific knowledge and capabilities: appropriate to the level of study related to a discipline or profession. GLO5: Problem Solving: creating solutions to authentic (real world and ill define) problems. ULO1: Create solutions by applying data analysis and statistical techniques and skills to practical business, economic and personal decision‐making and research. ULO2: Use computer software to analyse data and understand the limitations of such software. ULO3: Deduce clear and unambiguous solutions in a form that they useful for decision making and research purposes and for communication to the wider public. ULO4: Appraise the accuracy, sources and relevance of statistical data and models. ULO5: Create and appraise solutions, reports and arguments based on statistical data and models. Feedback prior to submission Students are able to seek assistance from the teaching staff to ascertain whether the assignment conforms to submission guidelines. Feedback after submission Your assignment feedback will be returned in a rubric via CloudDeakin with an overall mark together with comments. In order to understand any areas of improvement, students are expected to refer, and compare, their answers to the suggested solutions. Case Study Supermart is one of Australia's leading supermarket chains. There are 700 stores in the chain. Originating from a family based chain of general stores, Supermart now has stores all over Australia, with the first one being established 27 years ago. In 2015 the company launched its online store to enable customers, in selected suburbs, to make their purchase online. In terms of operation, each state capital has a company office and they have significant autonomy in the individual state's operations. Further, individual store management has wide‐ranging powers about day‐to‐day operations of their stores. However, broad company planning and direction take place in the company Head Office in Melbourne. Included in the Head Office, is the Research and Analysis Department. Having compared the company's position in the market and against the competitors, the General Manager ‐ Michael Braithwaite has asked you, Ruby Rose, to conduct exploratory and descriptive analysis to gain a better understanding of Sales of Supermart.Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 3 The Data The data relates to a random sample of 150 stores in the Supermart chain. The survey is conducted every year to gather information that is not readily available from the central sources. The variables in the data table are described below: Variable Name Description Store No. Unique ID of the store Sales $m Total Sales revenue for each store for the financial year ($ million) Wages $m Total Wage and salary bill for the financial year ($million) No. Staff The number of effective full‐time staff employed on a weekly basis Av. Wage The average annual wage/salary per effective full‐time staff member. GrossProfit $m Gross Profit for each store for the financial year ($ million) Adv.$'000 Advertising and promotional expenses for the financial year ($'000) Competitors The number of competing stores in the consumer catchment area HrsTrading The total number of hours open for trading per week Sundays Open on Sundays; Close on Sunday Mng‐Gender Male store manager; Female store manager Mng‐Age Age of the store manager, years Mng‐Exp No. of years of experience in some form of junior/senior management at Supermart Mng‐Train No. of management training courses taken while employed at Supermart Union% The proportion of the staff at the store which belongs to a union Car Spaces The number of parking spaces available to the store Online Channel Whether or not the store has an online channel OnlineSales$m Sales revenue from Online Sales Channel ($ million) Basket:2015 Cost ($) of the basket of food items in each store at 1 June 2015 Basket:2016 Cost ($) of the basket of food items in each store at 1 June 2016 %Change Annual percentage increase of the cost of the basket of food items in each store In addition, time series data is available on Quarterly Sales Time Period Time Period Index Quarter Quarter Description Total Sales($m) Total Sales of Supermart in $mDescriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 4 Email from Michael Braithwaite To: Ruby Rose From: Michael Braithwaite – General Manager Subject: Analysis of Sales Attachment: Appendix – A (Explanatory Notes) Dear Ruby, Thank you for your earlier responses to my email as they were very insightful. Your responses got me thinking about few more issues about our financial position, especially Sales. 1. Can you provide an overall summary of Sales? 2. I presumed that our Sales differ significantly between stores. I would like a report on main factors that influence Sales. 3. Richard tells me that you are very good at model building. Are you able to build a model for Sales as I am very keen to understand what other factors influence Sales? Hopefully, your model will also provide me with an opportunity to predict Sales for a given scenario. 4. We have been tracking our quarterly Sales for a while. I would like you to develop a multiplicative time‐ series model to forecast Sales for the next 4 quarters. 5. I would also like to formalise what we have done to date so that we can repeat this exercise next year: Can you critically evaluate what we have done so far? In your opinion are we doing the right type of analysis? Are there any alternative approaches? Are there any other sources of data we can use? I look forward to reading your report. Sincerely Michael BraithwaiteDescriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 5 Appendix – A (Explanatory Notes) In order to prepare a reply to Michael's email, you will need to examine and analyse the dataset, Supermart.xlsx, thoroughly. The following are some guidelines to follow. Task One – Summary of Sales Only analyse Sales by itself. The importance of other variables is considered in other tasks. You should, at the very least, thoroughly investigate relevant summary measures (and their reliability) for this variable. Also, there may well be suitable tables and graphs that will illustrate, further and more clearly, other important features of sales. In your report you should comment, where relevant, on data location, central tendency, variability, shape and outliers for this variable. Reference: Module 1 – Topic 2 Task Two – Factors influencing Sales Analyse Sales of stores against other variables included in the data set. Use appropriate descriptive techniques such as cross‐tabulations, comparative summary measures, scatter diagrams to explore key relationships. In your report you should only include the most important factors that impact Sales. Reference: Module 1 – Topic 3 Task Three – Development of a multiple regression model You should follow the model building process outlined in topic 5. You are only required to consider linear relationships in the model. Each stage of developing your model should be included in your analysis. You will notice in the Supermart Excel file that there are separate worksheets (tabs) named Q3‐1, Q3‐2, etc. These are where you place each version of your model. Note that if you have undertaken more iterations of the model then add more worksheets as required. Minimum requirements for the final model:  At least two independent variables  One categorical independent variable In your written report, please provide an interpretation of all elements of the final model – (for e.g. a practical interpretation of the regression equation, description of the explanatory / predictive power/ overall strength of the models as well as the significance of each individual independent variable.) Reference: Module 2 – Topics 4 and 5 Task Four – Time Series analysis Quarterly Total Sales for Supermart from Q2, 2012 to Q1, 2016 are given in the QtrSales worksheet. Develop a multiplicative time series model to forecast Sales for the next 4 quarters (Q2, 2016 to Q1, 2017). If the observed values for those 4 quarters are as below, calculate the MAPE of the forecast. Time Period Quarter Observed 17 2016‐Q2 95 18 2016‐Q3 132 19 2016‐Q4 150 20 2017‐Q1 109Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 6 In your written report, please provide an interpretation of the final model – (for e.g. a practical interpretation of the time series model, measures of forecast accuracy, choices about exponential smoothing constant). Reference: Module 2 – Topic 6 Task Five – Critique the Business Research Approach Discuss the suitability of the general business research approach taken. In your response, include possible alternative approaches and other sources of (secondary) data. If the analysis was to be repeated in the future, would you recommend a different approach? Note that no actual analysis is required for this task. Reference: Module 1 – Topic 1 Submission The assignment consists of two parts: Analysis and Report. You are required to submit both your written report (approx. 2000 words) and analysis (in Excel).  Analysis The analysis should be submitted in the appropriate worksheets in the Excel file. Each step in the model building for task three should be included in the tabs Q3‐Correlation, Q3‐1, Q3‐2, etc. If you need more worksheets, then add them. Further instructions are included at the top of each worksheet. Before submitting your analysis make sure it is logically organised and any incorrect or unnecessary output has been removed. Marks will be penalised for poor presentation or disorganised/incorrect results. Note: Give the Excel file an appropriate name such as A2_studentID.xlsx (use a short file name while you are doing the analysis – once you complete your analysis rename the file to the format mention before).  Report The report should be written for an audience that has no, or minimal, business analytics background. You should avoid the use of technical terms. The one exception may be in task 3 and 4 as you may want to include the actual models in the report. In this report first and foremost you are required to present the insights of your analysis (tasks 1‐4) then provide a critique of the approach you adapted for this study (task 5). It is up to you how to structure and format the final report. Note: Name the report with an appropriate file name such as A2_studentID.docx.Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 7 Criteria Poor Below standard Satisfactory Good Very good Excellent Analysis Task 1 (10%) Does not use any appropriate exploratory data analysis tools. Uses irrelevant or inappropriate techniques to analyse the SALES variable and/or there are many errors in the analysis. Uses appropriate data analysis and visualisation tools to analyse the data but there are some errors in the analysis. Mostly good analysis of the SALES data using appropriate techniques but there are minor errors in the analysis. Very comprehensive analysis of the SALES data using appropriate techniques. Skilful and comprehensive analysis of the SALES data using many different techniques. Uses data visualisations/tables to produce novel insights. Analysis Task 2 (Marks: 15%) Does not use any appropriate bivariate exploratory data analysis tools. Uses irrelevant or inappropriate bivariate techniques to identify relationships and/or there are many errors in the analysis. Uses appropriate bivariate data analysis and to identify relationships but there are some errors in the analysis. Most of the important factors have been identified using appropriate techniques but there are minor errors in the analysis. All of the important factors have been identified using appropriate techniques. Skilful and comprehensive identification of all relationships using a variety of appropriate techniques. Analysis Task 3 (Marks: 20%) No correlation analysis performed and/or inappropriate model developed. Correlation analysis and regression model developed but there are many errors in the analysis. Correlation analysis identified appropriate variables and regression model was OK but there are some errors in the analysis. Correlation analysis identified important factors and model was developed in a logical fashion but there are minor errors/missing analysis. All of the important variables included (taking into account multi‐collinearity). Model developed in a logical fashion including appropriate residual analysis. Correlation and regression modelling completed in a logical and comprehensive fashion taking into consideration multi‐ collinearity, dummy variables, residual analysis, etc. Analysis Task 4 (Marks: 15%) Does not use any appropriate time series techniques. Uses irrelevant or inappropriate techniques to analyse the time series and/or there are many errors in the analysis. Time series model developed but there are some errors in the analysis. Time series model and MAPE developed but there are minor errors in the analysis. Time series and MAPE model developed correctly. Time series and MAPE model developed correctly and presented in a clear and logical fashion including visualisations.Descriptive Analytics and Visualisation Assignment two – Trimester 2, 2016 Page 8 Criteria Poor Below standard Satisfactory Good Very good Excellent Report (Marks: 40%) Does not communicate any of the main findings of the analysis in an accurate or useful way. No or poor critique of approach provided. Interpretation and communication of findings is at a basic level or does not adequately explain the main findings of the analysis. Critique of approach not explained well. Explains most of the main findings of the analysis accurately and enables reader to draw some reasonable conclusions. Critique of approach provides OK guidelines for future analysis. Provides a reasonable and accurate description of the most important features of the analysis along with appropriately qualified conclusions. Critique provides overview of potential alternatives and recommendations for fur future analysis. Provides a very detailed and accurate descriptions of the most important features of the analysis in appropriate language. Appropriately qualified conclusions are very sound and critique of approach provides good insights to alternatives and future analysis. Provides an outstanding descriptions and conclusions that are carefully considered, novel, explained in clear language and insightful. Critique provides an excellent summary of alternative approaches along with recommendations.