Loading...

Messages

Proposals

Stuck in your homework and missing deadline? Get urgent help in $10/Page with 24 hours deadline

Get Urgent Writing Help In Your Essays, Assignments, Homeworks, Dissertation, Thesis Or Coursework & Achieve A+ Grades.

Privacy Guaranteed - 100% Plagiarism Free Writing - Free Turnitin Report - Professional And Experienced Writers - 24/7 Online Support

Aeb3133

17/12/2020 Client: saad24vbs Deadline: 7 Days

--- title: "Lab 6" author: "Your name here" date: "Write the date here" output: pdf_document --- The function regsubsets() in the library `leaps` can be used for regression subset selection. Thereafter, one can view the ranked models according to different scoring criteria by plotting the results of regsubsets(). Before using the function for the first time you will need to install the `leaps` package. One way to do this is to type `install.packages("leaps")` in the console window. It is probably best not to put this command in your R Markdown file since you only need to run the command once (not every time you knit your document). Recall that in R Markdown we can display R code without running it by placing `eval=FALSE` at the top of the R code chunk. ```{r eval=FALSE} install.packages("leaps") ``` If you have installed the package correctly then the following commands should load in the `leaps` and `MASS` packages. ```{r} library(leaps) library(MASS) ``` IF YOU GET AN ERROR IN THE PREVIOUS R CODE CHUNK PLEASE REREAD THE ABOVE PARAGRAPHS. ## Example 1: Best-subsets Regression The `Boston` dataset within the `MASS` package contains information collected by the U.S Census Service concerning housing in the area of Boston Massachusetts. It was obtained from the StatLib archive (http://lib.stat.cmu.edu/datasets/boston), and has been used extensively throughout the literature to benchmark algorithms. There are 14 attributes in each case of the dataset. They are: * CRIM - per capita crime rate by town * ZN - proportion of residential land zoned for lots over 25,000 sq.ft. * INDUS - proportion of non-retail business acres per town. * CHAS - Charles River dummy variable (1 if tract bounds river; 0 otherwise) * NOX - nitric oxides concentration (parts per 10 million) * RM - average number of rooms per dwelling * AGE - proportion of owner-occupied units built prior to 1940 * DIS - weighted distances to five Boston employment centres * RAD - index of accessibility to radial highways * TAX - full-value property-tax rate per $10,000 * PTRATIO - pupil-teacher ratio by town * B - 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town * LSTAT - % lower status of the population * MEDV - Median value of owner-occupied homes in $1000's ```{r} summary(Boston) dim(Boston) ``` Use the median home value (`medv`) as the dependent variable and determine which of the five independent variables (`nox`, `rm`, `age`, `crim`, `tax`) should be included in the regression model using the`regsubsets` command to perform "Best-subsets Regression". ```{r} BSR=regsubsets(medv~nox+rm+age+crim+tax, data=Boston) ``` Once we have used the `regsubsets` command we can view a plot that shows us which variables are significant according to the adjusted R-square. ```{r} plot(BSR, scale="adjr2") ``` Here black indicates that a variable is included in the model, while white indicates that they are not. The model containing all variables minimizes the adjusted R-square criteria as can be determined by looking at the top row of the plot. We can see that the regression equation with the highest adjusted R-square includes every variable except `nox`. We can verify that the best number of variables is 4 by using the following code: ```{r} reg.summary=summary(BSR) which.max(reg.summary$adjr2) ``` The resulting best fit using 4 variables can be shown using the `coef` function: ```{r} coef(BSR,4) ``` The predict command is not set up to handle the `regsubsets` so we create a function that predicts for us. ```{r} predict.regsubsets = function(object, newdata, id, ...) { form = as.formula(object$call[[2]]) mat = model.matrix(form, newdata) coefi = coef(object, id) mat[, names(coefi)] %*% coefi } ``` Here we will make a prediction based on the original data set with the number of variables set to 4. ```{r} pred=predict.regsubsets(BSR,Boston, id=4) head(pred) ``` ## Example 2: One variable-at-a-time Models As you should recall from class going through every unique model can be computationally expensive. Therefore, we can consider some other methods that consider adding or subtracting one variable-at-a-time. The methods that consider one-variable-at-a-time are: * Backward elimination * Forward selection * Stepwise regression. The above three methods are useful when the number of dependent variables is large and it is not feasible to fit all possible models. In this case, it is more efficient to use a search algorithm (e.g., Forward selection, Backward elimination and Stepwise regression) to find the best model. The R function step() can be used to perform variable selection. To perform forward selection we need to begin by specifying a starting model and the range of models which we want to examine in the search. ```{r} null=lm(medv~1, data=Boston) null ``` ```{r} full=lm(medv~nox+rm+age+crim+tax, data=Boston) full ``` We can perform forward selection using the command: ```{r} step(null, scope=list(lower=null, upper=full), direction="forward") ``` This tells R to start with the null model and search through models lying in the range between the null and full model using the forward selection algorithm. According to this procedure, the best model is the one that includes the variables Taxes, Size, Lot and Baths. We can perform backward elimination on the same data set using the command: ```{r} step(full, data=Boston, direction="backward") ``` and stepwise regression using the command: ```{r} stepwise=step(null, scope = list(upper=full), data=Boston, direction="both") stepwise ``` All of the algorithms yield equivalent results in this simple example. Let's make a prediction based on the stepwise model. ```{r} pred_stepwise=predict(stepwise,Boston) head(pred_stepwise) ``` ## Problem 1: Carseat Sales Revisited Again Your challenge is to use the data from the inclass Kaggle challenge (https://www.kaggle.com/t/8f161e4d717f443a8bfecae2de5bf872). First read in the data and get rid of the observation IDs. ```{r} data1=read.csv("Carseats_training.csv") data1$ID=NULL ``` Now run the following command to perform best-subsets regression. ```{r} BSR1=regsubsets(Sales~., data=data1) ``` a. Make a plot of the results according to adjusted R-square. (5 points) ```{r} # Your code here ``` b. Find out how many variables are used in the best subset according to the adjusted R-square. (5 points) ```{r} # Your code here ``` c. State the coefficients of the model in the aforementioned best subset. (5 points) ```{r} # Your code here ``` d. Perform stepwise regression on this data. (5 points) ```{r} # Your code here ``` e. Compare the variables selected from stepwise regression versus the variables selected from the best subsets approach. Which model would you trust the most? Why? (5 points) *Your answer here.* f. Make predictions on your test set and upload the results to kaggle. (5 points) ```{r} test=read.csv("Carseats_testing.csv") test$Sales=rep(1,80) # Your code here ``` g. Report your RMSE as measured by Kaggle. (7 points) *Your answer here.* h. Is your submission submitted properly submitted to Kaggle? You will be graded based upon whether or not your name is on the leaderboard with a RMSE that is consistent with what you reported in the previous part. (7 points) ## Problem 2: Restaurant Revenue Prediction Revisited Your challenge is to use the data from the inclass Kaggle challenge (https://www.kaggle.com/t/b97c47b1cc7742b2b0920b078b04d88e) and make a submission. Read in the data. ```{r} data2=read.csv("RR_train.csv") ``` a. Fit a backward elimination model. (5 points) ```{r} # Your code here ``` b. Predict the test set using this model and upload to kaggle. (5 points) ```{r} # Your code here ``` c. What is your resulting RMSE from the previous part as reported by kaggle? (7 points) *Your answer here.* d. Did you make a submission properly to Kaggle? You will be graded based upon whether or not your name is on the leaderboard with a RMSE that is consistent with what you reported in the previous part. (7 points) e. Fit a model using best-subsets regression. (5 points) ```{r} # Your code here ``` f. Predict the test set using the model with the highest adjusted R-square and upload to kaggle. (8 points) ```{r} # test2$revenue=rep(1,37) # Your code here ``` g. What is your resulting RMSE from the previous part as reported by kaggle? (7 points) *Your answer here.* h. Did you make a submission properly to Kaggle? You will be graded based upon whether or not your name is on the leaderboard with a RMSE that is consistent with what you reported in the previous part. (7 points) ## Problem 3: Markdown Now you will modify your document so that it is in pristine format. Knit the document as a pdf. You will need to submit this file and the pdf you created. Submit these two files to blackboard. Partial credit will not be given for this problem. (5 points)


Applied Sciences

Architecture and Design

Biology

Business & Finance

Chemistry

Computer Science

Geography

Geology

Education

Engineering

English

Environmental science

Spanish

Government

History

Human Resource Management

Information Systems

Law

Literature

Mathematics

Nursing

Physics

Political Science

Psychology

Reading

Science

Social Science

Home

Blog

Archive

Contact

google+twitterfacebook

Copyright © 2019 HomeworkMarket.com

Homework is Completed By:

Writer Writer Name Amount Client Comments & Rating
Instant Homework Helper

ONLINE

Instant Homework Helper

$36

She helped me in last minute in a very reasonable price. She is a lifesaver, I got A+ grade in my homework, I will surely hire her again for my next assignments, Thumbs Up!

Order & Get This Solution Within 3 Hours in $25/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 3 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 6 Hours in $20/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 6 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

Order & Get This Solution Within 12 Hours in $15/Page

Custom Original Solution And Get A+ Grades

  • 100% Plagiarism Free
  • Proper APA/MLA/Harvard Referencing
  • Delivery in 12 Hours After Placing Order
  • Free Turnitin Report
  • Unlimited Revisions
  • Privacy Guaranteed

6 writers have sent their proposals to do this homework:

Best Coursework Help
Helping Hand
Homework Guru
Top Essay Tutor
University Coursework Help
Writer Writer Name Offer Chat
Best Coursework Help

ONLINE

Best Coursework Help

I am an Academic writer with 10 years of experience. As an Academic writer, my aim is to generate unique content without Plagiarism as per the client’s requirements.

$35 Chat With Writer
Helping Hand

ONLINE

Helping Hand

I am an Academic writer with 10 years of experience. As an Academic writer, my aim is to generate unique content without Plagiarism as per the client’s requirements.

$35 Chat With Writer
Homework Guru

ONLINE

Homework Guru

Hi dear, I am ready to do your homework in a reasonable price and in a timely manner.

$37 Chat With Writer
Top Essay Tutor

ONLINE

Top Essay Tutor

I have more than 12 years of experience in managing online classes, exams, and quizzes on different websites like; Connect, McGraw-Hill, and Blackboard. I always provide a guarantee to my clients for their grades.

$40 Chat With Writer
University Coursework Help

ONLINE

University Coursework Help

Hi dear, I am ready to do your homework in a reasonable price.

$37 Chat With Writer

Let our expert academic writers to help you in achieving a+ grades in your homework, assignment, quiz or exam.

Similar Homework Questions

Bsn essentials - Recent changes and challenges in international financial management - Fetch decode execute cycle - Give up and settle nigahiga - Crazy love study guide chapter 1 - Describe two cryptographic applications and how they are used in Information System Security. - Igcse french past papers 2009 - Windshield survey powerpoint presentation - Which circuit below would have the lowest voltmeter reading - Legacy of the first blade magistrate or doctor - The revenue account fees income is closed by debiting - Swatch annual report 2015 - Homework - Lumiere de vie facial cleanser - Oft agency agreement fact sheet - TAE40116 - Pool fence non-climbable zone vic - Johari window assignment - Network troubleshooting questions and answers - Signature assignment challenges of expansion to a foreign location - Twisted pair coaxial fiber optic - How do spines protect ocean stickleback fish - Engl 102 test 2 answers - Milestone 1 discussion 2 - Web analytics at quality alloys inc case solution - What are the elements of informed consent - Surface level or surface-level - Essay assignment- how Blockchain technology can be implemented to reduce ad fraud and cost while meeting consumers’ increasing expectation for excellent customer service? - Fin 534 homework set 1 - John barton looking for alibrandi - Ceo and cfo certification requirements - Describe the tone of gandhi's chapter on his child marriage - History 2A - Indicate which person made each statement based on the fotonovela video. - Math 123 quantitative reasoning ivy tech - Urgent - Mil dtl 24308 glenair - Recrystallization of aspirin method - Reefer container power supply - Hysteria in the dressmaker - Goods receiving process flow chart - Max challenge training program pdf - Str/581 - Types of chemical reactions lab report answers - 121 bus route schedule - Nursing care plan for self mutilation - Micro environment of apple company - Medical terminology crossword puzzle answers - The Body Rituals of the Nacirema - How to interpret binary logistic regression in spss - Business MBNQA - Asian united food service woolner nt - Philosophy unit 7 Discussion - Country of origin codes - 978 0 07 340326 7 - Maslow's hierarchy of needs project - 1100 u gs3 24 - Healthcare reimbursement - Algebra even and odd numbers hands on lesson 1.1 - Pre socratic philosophers were primarily concerned with - Grammar review packet 6 answers - Sho me led lights - Am i addicted to sexting quiz - How to do a chapter outline - Post-Tania - 1700 hours in 12 hour time - Building rapport in Negotiation - Complex motor stereotypy autism - Fundamentals of health HW2 - Sudarshan kriya side effects - Chapter 2 conceptual framework for financial reporting - Cushing ok wti spot price fob - What is the scholar practitioner model - Determination of the ka for a weak acid lab - Valentine carol ann duffy - Safe Drivers Dubai | Reliable Chauffeur Service - Mallard brand quick cooking rolled oats - Case Study 6 - Examples of bad hospitality in the odyssey - Excel 2019 skills approach ch 3 challenge yourself 3.3 - Jetblue airways ipo valuation excel - Jon adgemis net worth - Converting watts to amps - What is maize offal - W5 continuing - Applying Writing Strategies - SPL model - Al khwarizmi borrowed the zero to nine decimal system from - Usted: la computadora suya; la suya - Weekly tax tables 2021 - Math Questions - Mondale winery depreciates its equipment using the group method - They say i say is college the best option - Great homer street market - SOCW 6060 - Hilton hotels customizes rooms and lobbies according to location - Telemation e crm - X2 DISCUSSION QUESTIONS (200 WORDS, 1 CURRENT SOURCE) APA - Andre malraux museum without walls pdf - Manage risk