subject

The dataset ToyotaCorolla. csv contains data on used cars on sale during the late summer of 2004 in the Netherlands. It has 1436 records containing details on 38 attributes, including Price, Age, Kilometers, HP, and other specifications.

Explore the data using the data visualization capabilities of R. Which of the pairs among the variables seem to be correlated?

We plan to analyze the data using various data mining techniques described in future chapters. Prepare the data for use as follows:

The dataset has two categorical attributes, Fuel Type and Metallic. Describe how you would convert these to binary variables. Confirm this using R’s functions to transform categorical data into dummies.

Prepare the dataset (as factored into dummies) for data mining techniques of supervised learning by creating partitions in R. Select all the variables and use default values for the random seed and partitioning percentages for training (50%), validation (50%). Describe the roles that these partitions will play in modeling.

# how many total rows are there in the origDummies. df?

# randomly select row numbers for the training partition

# randomly select row numbers for the validation partition: sample from (all rows - training rows)

# Now create the train. data and valid. data dataframes

# 3) Propose three variables that could be used in a linear regression model

# 4) Create a linear regression model on the training dataset using variables
# Age, Kilometer and Manufacturer's Guarantee to predict "Price"
# use "reg" as the name of the model

# See the predicted values and actual values side by side

# plot the residuals

# see the model coefficients and their statistical significance# compute accuracy on a training set

# use the model to make a prediction using new data

# What could be your price estimate for a car 30 months old, 22000 kilometers,
# no manufacturer guarantee?

ansver
Answers: 2

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 21:30, aesthetickait
How do you take a green screen out of the video while editing?
Answers: 2
image
Computers and Technology, 24.06.2019 00:30, sethhdoty
Use the keyword strategy to remember the meaning of the following word. the meaning for the word has been provided. write your keyword and describe the picture you would create in your mind. obfuscate: to make something so confusing that it is difficult to understand.
Answers: 2
image
Computers and Technology, 24.06.2019 05:30, roderickhinton
Someone plzz me which of these defines a social search? a. asking a search engine a question that is answered by a real person on the other sideb. modifying search results based on popularity of a web pagec. modifying search results based on a ranking of a web page
Answers: 2
image
Computers and Technology, 24.06.2019 18:30, millie54
How does the use of e-mail benefit business communications? it can be sent at any time. it is faster than regular mail. it improves writing skills. it is less expensive than using a courier. it reduces the need for proofreading.
Answers: 1
You know the right answer?
The dataset ToyotaCorolla. csv contains data on used cars on sale during the late summer of 2004 in...

Questions in other subjects:

Konu
History, 26.08.2019 08:30