Assignment title: Information


A. Calculate the descriptive statistics from the data and display in a table. Be sure to comment on the central tendency, variability and shape for each variable. (1 Mark)B. Draw a graph that displays the distribution of admissions. (1 Mark)C. Create a box-and-whisker plot for the distribution of the top price and describe the shape. Is there evidence of outliers in the data? (1 Mark)D. What is the likelihood that the admissions are greater than 70 million if the real price of tickets exceeds $20.00? Are admissions statistically independent of price? Use a Contingency Table. (2 Marks)E. Estimate the 95% confidence interval for the population mean theatre capacity. (1 Mark)F. Your supervisor recently stated that theatre admissions from 2008 through 2014 (ie. last 8 years) have exceeded the admissions in Taiwan which have been a constant 84 million per year. Test her claim at the 5% level of significance. (1 Mark)G. Run a multiple linear regression using the data and show the output from Excel. (1 Mark)H. Is the coefficient estimate for the real ticket price in 2014 $ different than zero at the 5% level of significance? Set-up the correct hypothesis test using the results found in the table in Part (G) using both the critical value and p-value approach. Interpret the coefficient estimate of the slope. (2 Marks)I. Interpret the remaining slope coefficient estimates. Comment on whether the signs are what you are expecting. (2 Marks)J. Interpret the value of the Adjusted R2. Is the overall model statistically significant at the 5% level of significance? Use the p-value approach. (1 Mark)K. Do the results suggest that the data satisfy the assumptions of a linear regression: Linearity, Normality of the Errors, and Homoscedasticity of Errors? Show using scatter diagrams, normal probability plots and/or histograms and Explain. (3 Marks)L. Based on the results of the regressions, is it likely that other factors have influenced the theatre admissions? If so, provide a couple possible examples and indicate whether these would likely influence the regression results if they were included. (1 Mark)M. If a community housing organisation asked for information regarding the characteristics of housing targeting the households of native born Australians, explain whether a simple random sampling technique would provide an accurate representation of these households. (Note: This question does not use the data) (1 Mark)