Module 1&2 Workshops and Assignment Copyright © Jacob L. Cybulski
M2&3 A ssignme nt Preview: SAS EM , Regress ion , NN , DTs
The following mini-case study will be used in your group assignment A2. Groups are up to 3 members (1-2-3).
An independent online business Best Iowa Buys have setup a members-only service to predict the likely value of real-estate put up for auction. They have collected sample data about property sales in Ames, Iowa (USA), in CSV format. The data consists of 2,930 records of properties sold from 2006 till late 2010 in Ames, each described with 79 variables.
You have been asked to develop an analytics solution to estimate the price of Ames properties advertised for auction. They are also interested in the property classification in terms of its affordability within its category or group, and its value for money for the potential buyer. Select the best predictive model and provide a summary of the model and its performance.
1
Software: Access SAS EMiner on Deakin Apps On Demand prior to classes Explore: Learn SAS EMiner, explore the data, tabulate and chart your findings Cluster Analysis: Perform cluster analysis and segmentation of your data to identify categories or groups of the Ames properties that could be used to guide the customer buying choices Modelling and Evaluation: Create a number of predictive models (e.g. Neural Nets, Regression and Decision Trees) based on the available data, evaluate, possibly using cross-validation techniques, and optimise their performance Model Integration: Create an analytic solution, possibly involving ensemble models, integrating and evaluating the models developed so far Submission: Present, explain and justify all results and recommendations, report your final results and final findings as a team via CloudDeakin