Assignment title: Information


CIS8701 – Big Data Visualization Assessment 2 – weight 30% Due on 19 April 2017 Specification: For the purpose of this assignment, you will be focusing on open data visualization aspects. We discussed Visualization in three distinct stages – data management, data modeling and visualization. We also understood that these three stages can further be grouped into discovery, wrangling, profiling, modeling and reporting. The five groups of activities can be distilled into 1. Data collection and curation. 2. Data pre-processing. 3. Data analysis. With this context, your specific task is to discuss your experience gained in understanding the above concepts – through the reading materials provided in the lecture and workshop contents in weeks 1 – 5. For the purpose of this assessment, you have to use Tableau a data visualization software-­­‑that can be installed using the provided link on study desk. The management report will be provided to the CEO of your organization. The CEO is very keen to explore how visualization can be improved in the organization, especially in preparing executive level presentations, as there appears to be some challenges in conveying a 'uniform' message through charts, reports and graphs. The CEO read the visualization aspects and Big Data and thought this is a great area and commissioned you as a special project officer to provide strategic advice to the organization on how visualization can be handled in the organization that uses both qualitative and quantitative data, in big volumes. Please note that the organization uses cloud technology for their data storage, and the data contains numbers, texts and graphics. 1. What type of pre-processing required using the software application that you explored for the data? The qualitative data in the organization is derived from reports that contain 700 – 800 pages, and the report is produced to senior divisional managers. The CEO wants to shorten the duration for the meeting and thought a visual presentation, based on the qualitative data would achieve this objective. Note: Dummy data source for tableau is been provided.  Create a graph using Tableau that shows a trend analysis for sales by Product category over the year 2010 to 2012.  Create a geographical map presentation using Tableau showing graphically the relative size by city with each state for year 2012.  Create graph using Tableau that show Product category average profit and Total sales for each month over the year 2010 to 2012.2. In one of the lecture content, we discussed that 'executives need to be familiar with methodologies involved in the analytics'. Discuss three suit methodologies required for the data analytics. 3. Discuss various visualization techniques that would be relevant to this organization so that 'efficiency' gains can be realised, and also state three challenges which can affect organization in managing big data technology.For the purpose of the report, you can choose any format, but the report should; not exceed 3000 words, excluding appendix, references and other attachment that you would like to present. You need to systematically address these key points, and provide evidence from extant literature as how other organizations have managed similar situations. This warrants reading beyond the materials provided. Marking Criteria: You need to ensure that the document you prepared is original and is your own work. If the marker finds any similarity with other source, you will be awarded with zero marks and the case will be referred to the Faculty for further investigation. If you need to check your work through a plagiarism application, please do so on your own. Item Quality Range Score Range Demonstration in depth content understanding through a well laid out, articulated report. Poor -­­‑ High 0 -­­‑ 20 Discussion on various methodologies and justifying relevance and appropriateness of methodologies of your choice to suit the given context. Poor -­­‑ High 0 -­­‑ 20 Evidence on the appropriateness of visualization techniques considered by you, justification for consideration and relevance of techniques considered. Poor -­­‑ High 0 -­­‑ 20 Detailed discussion on how your techniques and methodologies would result in efficiency gains, and the challenges which can affect the organization in managing big data. Poor -­­‑ High 0 -­­‑ 20 Discussion on how the chosen methodologies would lead to analytics, and then visualization to minimize non uniform understanding of data. Poor -­­‑ High 0 -­­‑ 20 The above marking criteria are given to provide an initial scope to indicate where the marking concentration is going to be. The real scores will reflect the quality of evidence provided, the effort put in in developing the evidence, the quality of discussion using proper support from quality references, demonstration of your ability to source quality materials and reflection of reading these materials in providing the arguments, meaningful discussion, and a level of writing that is suitable for a 8000 level course. All assessments should be submitted via the USQ system, and a single 'pdf' file is to be used. Use of other file type is strictly prohibited as this makes distribution to markers difficult.