Assignment title: Information
1. Examining and cleaning your data.
Using appropriate methods, individually examine the following variables: load, mode, age, sex, height, weight, employ, sleep, transp, walktotal, SMed and SMedtimes to determine if there are any missing or outlier values. If necessary, delete any values as appropriate. For each variable, comment very briefly on what analysis you used, report what you found, any data edits you performed (if any) and why. (Preferably present it in a table; otherwise it must be a maximum of one sentence per variable). 2 marks
NB: You should use your cleaned data, i.e. the data with any edits performed, for the remainder of the assignment.
2. Social Media Use by students.
a) Examine the distribution of SMedtimes. Create a new variable that divides SMedtimes into intervals of 5 times/day. Produce and display a table showing the frequency, relative frequency and cumulative relative frequency. (2)
b) From this table, determine what proportion of students access social media ten (10) or more times per day. Explain how you derived your answer. (1)
c) Create and display a frequency histogram of SMedtimes that corresponds to the same groups you created in a) above. (2)
d) Summarise the distribution using appropriate statistics and describe the important features of this distribution. (Maximum 3 sentences) (2)
e) Construct a chart to display the proportion of students who report using each of the social media (SMed) platforms most often by sex. (Note: Also include those that do not use social media). Briefly describe this chart (maximum two sentences). (2) 9 marks
3. Work and study patterns of students. Your tables should not contain any superfluous information or statistics and it should be clear how your answers were calculated by referring to your tables.
a) Using employment status and study load, construct and display a two-way table that shows the number of students in each employment status category by study load. Within this table for each employment status category, show the percentage of students who are part-time and the percentage who are full-time.
What percentage of students who work full-time study full-time? (2)
b) Using employment status, study load and study mode construct and display a three-way table. Within this table show separately for each employment status category, the number of students in each study mode category by study load.
Within each employment status category, and study mode category, show the percentage of students who study part-time and the percentage who study full-time.
Among students who work full-time, what percentage of students who study part-time are studying online? (2) 4 marks
4. Physical activity of students.
Your tables should not contain any superfluous information or statistics and it should be clear how your answers were calculated by referring to your tables.
It is recommended that adults do at least 210 minutes of walking (or equivalent) per week on average.
a) Create a new variable that groups Total walk time walktotal into two categories:
1. Less than 210 minutes (does not meet recommendations),
2. 210 minutes or more (meets recommendations).
How many and what proportion of students meet the recommendation? (1)
b) Construct and display a two-way table that shows for each transportation mode (transp) the number (and proportion) of students that do, and do not, meet the recommendations for walking. (1)
c) Examine the distribution of Total walk time walktotal separated by mode of transportation group (transp) and produce ONE figure that displays all three distributions as boxplots. (1)
d) Describe and compare the distributions for each mode of transportation, commenting on achieving the recommended 210 minutes, important features of the distribution and using appropriate measures of central location and variability. (2) 5 marks
Total = 20 marks