代写assignment：Statistical Analysis and Decision Making G Assignment Answer all questions in this assignment Answers should be written or typed on A4 paper and stapled at the top left hand corner. Each page should be numbered and have your ID number on it. Do NOT place your answer in a plastic sleeve or folder. The assignments are to be the individual work of the submitting student. Submitted assignments are not to be group efforts. However this statement is not meant to discourage students from discussing mutual problems that arise in answering the assignment questions. Such discussions should be verbal and not written. All calculations and computer entries should be done individually. If two or more assignments contain the same (or marginally changed) sentences and/or the same numerical errors they will be treated as a single assignment and the total marks awarded divided among the contributing students. Your assignment should be handed to Dr. Shuang Liu on 16 Dec. Late assignments, without an acceptable reason, will be penalised at a rate of 10% per day or part of day (including weekends) late. The maximum possible penalty for late assignments is 100% (i.e. a mark of zero). Note that ‘the printers were out of paper’ or other printing issues are not acceptable reasons for a late assignment. Answer all questions in this assignment. Question 1 (22 marks) Canberrans have been concerned with the consumption of water and the factors that affect the water consumption level. In a recent study, researchers wanted to know whether Canberra's daily water consumption (ML) could be predicted by the daily maximum temperature (°C). They investigated data for January, 2011. They used Excel to analyse the results. Some of the Excel output is shown below: Figure 1: Water Usage against Daily Maximum Temperature in Canberra, January, 2011 Source: The Canberra Times Figure 2: Excel Output, Regression Analysis of Water Usage against Daily Maximum Temperature in Canberra, January, 2011 a)What are the independent and dependent variables? b)Write down the estimated equation. Y= -4.76+4.92X c)For January, 2011, the average water usage was 137.1 ML. On January 10th, the daily maximum temperature was 26.5 °C and the water usage was 111 ML. For January 10th, calculate: i)the predicted water consumption ii)the residual iii)the explained deviation from the mean Explained deviation = y^ - mean y = 125.62ML - 137.1 ML = -11.48 ML#p#分页标题#e# iv)the observed deviation from the mean Observed deviation = y – mean y = 111ML – 137.1ML = 1 26.1 ML d)Draw a diagram to show how the four values you calculated in (c) are related RSS = 13468.02987 f)The value of the correlation coefficient is missing from the Excel output. Calculate the correlation coefficient and in one or two sentences, provide an interpretation of this value. SUMMARY OUTPUT Question 2 (14 marks) An internet blogger is interested in the amount of time readers spend browsing his blog during one particular visit. For each reader who visited his blog, the blogger recorded whether the amount was ‘less than 5 minutes’, ‘between 5 and 10 minutes inclusive' or ‘more than 10 minutes’. a)What is the sample space? (‘less than 5 minutes’, ‘between 5 and 10 minutes inclusive' , ‘more than 10 minutes’) b)After recording data for 400 people, the blogger found that 283 spent ‘less than 5 minutes’, 99 spent ‘between 5 and 10 minutes inclusive' and 18 spent ‘more than 10 minutes’. Construct the probability distribution for the amount of time spent browsing the blog. less than 5 minutes = 283/ 400 = 0.7075 c)In a few words, please specify the approach to probability you have used in b). There are 3 approaches to probability. They are classical, relative frequency and formula. d)What is the probability that a visitor to the blog spends ‘less than 10 minutes (inclusive)’ browsing? The probability that a visitor to the blog spends ‘less than 10 minutes (inclusive)’ browsing is 283/400 +99/400 = 0.7075+ 0.2475 = 0.955 = 95.5% e)The internet blogger also recorded whether the customer ‘clicked on an ad’ or According to question, P (AB) = 0.00185 , P (A) = 0.0025 and P (B) = 0.045. Question 3 (14 marks)#p#分页标题#e# The University of Orkutschk has collected information on the background knowledge of its MBA students. 55% of MBA students at the University have previously studied Statistics, 45% have previously studied Management and 40% have previously studied both. a)Display the above information in a Venn or tree diagram. b)What is the probability that a randomly selected MBA student has previously studied neither Statistics nor Management? 1- P(S or M) = 1- (55%-40%)(45%-40%) - 40%= 0.5925 c)An MBA student is randomly selected. It is found that this student has studied Statistics. What is the probability that this student has also studied Management? 40% / 55% = 72.72% Next door to the University of Orkutschk, lies the University of Qapla. Whilst the University of Qapla does not offer a MBA program, it does offer a 5 day intensive management training program. In this program, 60% of the trainees are female and 40% are male. Of the female trainees, 70% have previously studied Statistics and of the male trainees, 65% have previously studied Statistics. d)Display the above probabilities in a Venn or tree diagram. e)What is the probability that a randomly selected trainee is female and has studied Statistics? 0.6*0.7 = 0.42% f)What is the probability that a randomly selected trainee has studied Statistic 0.42+0.4*0.65 = 68% g)A randomly selected trainee has studied Statistics, what is the probability that the trainee is female? |