Assignment title: Information
Question
Management
Q
Considering the two variables number of visits and number of adult materials in circula-
tion in the data _le Library2012New.sav, available on the course StudyDesk, assuming
that the data represents an SRS, answer the following questions.
(a) Find estimates of the mean (_) and standard deviation (_) of the (i) number of
visits and (ii) number of adult materials in circulation. [6 marks]
STA2300—Data Analysis 10
(b) Obtain a 90% confidence interval for the difference of means of the number of visits
and number of adult materials in circulation in the libraries. [4 marks]
(c) State the hypotheses to test, if the difference of means of the number of visits and
number of adult materials in circulation is significant. [3 marks]
(d) Find the value of the appropriate test statistic for the test in part (c). [5 marks]
(e) Obtain the P-value, and make an appropriate conclusion on the outcome of the
test. [4 marks]
(f) What assumptions are necessary for the inference procedure in part (b) to be valid?
[3 marks]
Note: In some parts of this question you may require to decide if a paired sample or
two independent samples procedure is appropriate. You will not be penalised if you
properly justify your choice and subsequent answers are correct.
Question 2 (13 marks)
A new surgical procedure is successful with probability p = 0:8. Assume that the
operation is performed five times and the results are independent of one another.
(a) What is the probability that all five operations are successful? [2 marks]
(b) What is the probability that less than two operations are successful? [3 marks]
(c) Find the mean and standard deviation of number of successful operation. [3 marks]
(d) If the procedure is performed 100 times, what is the probability that at least 95
operations are successful? [5 marks]
Question 3 (18 marks)
A Psychology research team was interested to study the reaction time of university
students. They took a random sample 100 students from across the university and
administered a series of tests to determine the reaction time. The observed mean and
standard deviation of the data are 27.35 seconds and 6.31 seconds respectively.
(a) What is the sampling distribution of the sample mean? Justify your answer. [4
marks]
(b) Find a 95% confidence interval for the mean reaction time of all university students.
[4 marks]
STA2300—Data Analysis 11
(c) Give the correct interpretation of the above confidence interval. [2 marks]
(d) Calculate the margin of error for a 99% confidence interval for the mean reaction
time. What is the width of the 99% confidence interval? [4 marks]
(e) If the population standard deviation is 8 seconds, what sample size would be re-
quired to produce a 95% confidence interval for the population mean reaction time
with a margin of error of 1.50 seconds? [4 marks]
Question 4 (17 marks)
From long term experience it is known that the time required to answer a set of 10
computer managed questions in the Data Analysis course follows a normal distribution
with mean _ = 15 minutes and standard deviation _ = 2 minutes. If a randomly
chosen o_-campus student answers a test of 10 computer managed questions, answer
the following questions.
(a) What is probability that she would complete the test in less than 14 minutes? [4
marks]
(b) What is the probability that she would complete the test between 15 and 19
minutes? [4 marks]
(c) Determine her completion time so that only 10% of the students doing the test
will take longer than her. [4 marks]
(d) For a set of 5 randomly selected tests (each with 10 questions), what is the prob-
ability that her mean completion time will be 14 minutes or more? [5 marks]
Question 5 (12 marks)
In January this year, 200 randomly selected voters in Australia were asked whether they
believed that the Government is doing a good job to protect the environment.
(a) If 156 of these 200 voters believe the Government is doing a good job, deter-
mine a 90% confidence interval for the true proportion of voters who believe the
Government is doing a good job to protect the environment. [6 marks]
(b) In previous years, approximately 70% of the the voters believed the Government
was doing a good job to protect the environment. Has the proportion of voters who
believe the Government is doing a good job to protect the environment changed?
Test this hypothesis at the 1% level. Show all your working. [6 marks]
STA2300—Data Analysis 12
Question 6 (15 marks)
Answer the following questions:
(a) In no more than 100 words, identify the problems associated with non-random
sampling. [2 marks]
(b) Based on a random sample of size n = 144 from a population with proportion
p = 0:52, explain the sampling distribution of the sample proportion. State the
name of the distribution, underlying parameter(s), and any assumptions required.
[3 marks]
(c) Based on a random sample of size n = 144 from a population with mean _ = 20 and
standard deviation _ = 6, explain the sampling distribution of the sample mean. In
your answer you may state the name of the distribution, underlying parameter(s),
and any assumptions required. [3 marks]
(d) State how you would describe any association between (i) two categorical variables,
(ii) one categorical and one quantitative variable, and (iii) two quantitative vari-
ables. [3 marks]
(e) State and explain the Central Limit Theorem (CLT) for the sample mean when the
population distribution is (i) symmetric and (ii) not symmetric. [2 marks]
(f) With appropriate examples, distinguish between paired samples and two indepen-
dent samples.