美文网首页
讲解:STAT270、Applied Statistics、R、

讲解:STAT270、Applied Statistics、R、

作者: bingernai | 来源:发表于2020-01-10 12:53 被阅读0次

    Applied StatisticsAssignment Semester 1. Due: Thursday 23rd May 5:00pm 2019You are expected to write your assignment using R Markdown (see Lecture 6) or MS Word and submit a PDF.You are required to write your name, student ID and Unit Code on the first page. You need to submit yourassignment via the provided submission link on iLearn.You may discuss the assignment in the early stages with your fellow students. However, the assignmentsubmitted should be your own individual work.The R Markdown ‘Cheatsheet’ from the RStudio team is given here.In your answers to the questions below, produce the appropriate R output and/or explanation of the steps andresults. Don’t include any more R output than necessary and include only concise explanations.Question 1 [28 marks]Since 1979, satellites have regularly measured the extent of sea ice in the Arctic Ocean. Rapid melting ofArctic sea ice is seen both as a symptom and a cause of a changing climate. The average September SeaIce Extent (in 1, 000, 000km2) are recorded for each year and the data is available in the file seaice.dat oniLearn. The variables are defined below.Extent Sea Ice Extent (in 1, 000, 000km2)Year Calendar YearUsing the data for the years 1979 to 2002 only, answer the following questions.Hint: dat1 of the original dataset (dat) and only contains data up to the year 2002.a. [3 marks] State the statistical model for a simple linear regression of Extent explained by Year. Carefullydefine all the necessary variables and parameters in your answer.b. [3 marks] A simple linear regression seems appropriate for the 1979-2002 data. Justify the use of asimple linear regression model.c. [2 marks] Fit a simple linear regression to the 1979-2002 data. Explain why there is a linear relationship.d. [2 marks] Is this a strong linear relationship? Explain your answer in the context of this data.e. [2 marks] Predict the extent of the sea ice (in km2) for the year 2000.f. [2 marks] Compute a 95% prediction band for the Extent of Sea Ice (in km2) in the year 2000.g. [2 marks] Compute a 95% confidence band for the Extent of Sea Ice (in km2) in the year 2000.h. [2 marks] Explain clearly wha代写STAT270作业、代写Applied Statistics作业、代做R语言作业、R程序设计作业代写 代写Pythot the prediction band represents and what the confidence band represents.1Using all the data for the years 1979 to 2012, answer the following questions.i. [2 marks] Justify why a simple linear regression is inappropriate for the 1979-2012 data.j. [3 marks] Fit a second order polynomial regression model to the data and validate the model.k. [1 mark] Plot the fitted polynomial to your datal. [2 marks] Using the second order model you fitted, predict the extent of the sea ice (in km2) for theyear 2000.m. [2 marks] Compare your answers in part e) and part l). Which prediction value do you recommend andwhy?2Question 2 [25 marks]A study into the quality of Portuguese Vinho Verde red wine was conducted to examine the possiblerelationship between wine quality and the chemical composition of the wine. Overall quality scores wereobtained by combining the scores from several tasters. Information was recorded on wine bottles of VinhoVerde and is availble in the file pwine.dat on iLearn. The variables are defined below.Quality Aggregated score across the taste testersAlcohol Level of alcohol in percentDensity Density or specific gravity of the winepH Acidity level of the winea. [4 marks] State the statistical model for a multiple regression with Quality as the response using allother variables as predictors, defining any parameters as necessary.b. [2 marks] Fit this multiple regression model and write down the fitted model.c. [4 marks] What are the assumptions required for a multiple regression analysis? If possible, validatethose assumptions for the multiple regression model you fitted in part b.d. [6 marks] Conduct an F-test for the overall regression i.e. is there any relationship between the responseand the predictors. Write your answer as a formal hypothesis test and include the ANOVA table (onecombined regression SS source is sufficient.)e. [3 marks] From the analysis in part b. determine the 95% CI for the Alcohol slope parameter andcomment on its meaning in this context.f. [2 marks] Using the model selection procedures used in this course, find the best multiple regressionmodel that explains the data giving reasons for your choice(s).g. [4 marks] State the final fitted regression model and comment on its interpretation.3转自:http://www.7daixie.com/2019052031703245.html

    相关文章

      网友评论

          本文标题:讲解:STAT270、Applied Statistics、R、

          本文链接:https://www.haomeiwen.com/subject/npvpactx.html