In
a regression analysis involving observations, the following
estimated regression equation was obtained. Enter negative values as negative
numbers. a. Interpret , , , and in this estimated
regression equation. Assume that for each coefficient statement, the
remaining three variables are held constant (to 1 decimal). estimated change in per unit change
in estimated change in per unit change
in estimated change in per unit change
in estimated change in per unit change
in b. Predict when , , , and . (to 1 decimal)
|
Hide Feedback
Correct
Spring
is a peak time for selling houses. The file SpringHouses contains
the selling price, number of bathrooms, square footage, and number of
bedrooms of homes sold in Ft. Thomas, Kentucky, in
spring (realtor.com website) Click
on the datafile logo to reference the data. a. Choose the correct scatter plot of selling price versus number of
bathrooms.
The
correct graph is . Choose
the correct scatter plot of selling price versus square footage.
The
correct graph is . Choose
the correct scatter plot of selling price versus number of bedrooms.
The
correct graph is . Comment
on the relationship between selling price and these three variables. show
strong increasing relationships with selling price. The relationship
between and
selling price seems not as strong as the other two variables. b. Develop an estimated regression equation that can be used to predict
the selling price given the three independent variables (number of baths,
square footage, and number of bedrooms) (to decimals). Enter
negative value as negative number. c. It is argued that we do not need both number of baths and number of
bedrooms. Develop an estimated regression equation that can be used to
predict selling price given square footage and the number of bedrooms
(to decimals). Enter negative value as negative number. d. Suppose your house has four bedrooms and is square feet.
What is the predicted selling price using the model developed in part (c).
Use the regression coefficients rounded to decimals in your
calculations. Round your answer to the nearest dollar. $
|
Hide Feedback
Correct
Consider
the following estimated regression equation, based
on observations. The
values of SST and SSR are and , respectively. a. Find SSE (to 2 decimals). b. Compute (to 3 decimals). c. Compute (to 3 decimals). d. Comment on the goodness of fit. The
estimated regression equation .
|
Hide Feedback
Correct
Spring
is a peak time for selling houses. The file SpringHouses contains
the selling price, number of bathrooms, square footage, and number of
bedrooms of homes sold in Ft. Thomas, Kentucky, in
spring (realtor.com website) Click
on the datafile logo to reference the data. a. The Excel output for the estimated regression equation that can be
used to predict the selling price given the number of bathrooms, square
footage, and number of bedrooms in the house: SUMMARY
OUTPUT
ANOVA
Does
the estimated regression equation provide a good fit to the data? Explain.
Hint: If is greater than , the estimated regression equation
provides a good fit. The
estimated regression equation provide
a reasonable fit because the adjusted is (to decimals). b. The Excel output for the estimated regression equation that can be
used to predict selling price given square footage and the number of
bedrooms: SUMMARY
OUTPUT
ANOVA
Compare
the fit for this simpler model to that of the model that also includes number
of bathrooms as an independent variable. The
adjusted for the simpler model is (to decimals) that is than
the adjusted of the model in part a.
|
Hide Feedback
Correct
Hide Feedback Correct The
following estimated regression equation was developed for a model involving
two independent variables. After was
dropped from the model, the least squares method was used to obtain an
estimated regression equation involving only as an independent variable. a. Give an interpretation of the coefficient of in both
models. In
the two independent variable case, the coefficient represents the
expected change in corresponding
to a one unit increase in when is
held constant. In
the single independent variable case, the coefficient represents
the expected change in corresponding
to a one unit increase in . b. Could multicollinearity explain why the coefficient
of differs in the two models? If so, how? Assume
that and are correlated.
|
Hide Feedback
Correct
The
Honda Accord was named the best midsized car for resale value
for by the Kelley Blue Book (Kelley Blue Book website). The
file AutoResale contains mileage, age, and selling price for
a sample of Honda Accords. Click
on the datafile logo to reference the data. a. Develop an estimated regression equation that predicts the selling
price of a used Honda Accord given the mileage and age of the car
(to decimals). Enter negative value as negative number. b. Is multicollinearity an issue for this model? Find the correlation
between the independent variables to answer this question
(to decimals). The
correlation between age and mileage is .
I.
Since the correlation between the
independent variables is less than , we conclude that multicollinearity
is an issue.
II.
Since the correlation between the
independent variables is less than , we conclude that multicollinearity
is not an issue.
III.
Since the correlation between the
independent variables is greater than , we conclude that
multicollinearity is an issue.
IV.
Since the correlation between the
independent variables is greater than , we conclude that
multicollinearity is not an issue. c. Use the test to determine the overall significance of the
relationship (to decimals). What is your conclusion at
the level of significance? Use F table.
d. Use the test to determine the significance of each
independent variable (to decimals). What is your conclusion at
the level of significance? Use t table. Enter negative value as negative number.
|
Hide Feedback
Correct
Consider
the following estimated regression equation based on observations. a. Develop a point estimate of the mean value
of when and (to 3 decimals). b. Predict an individual value
of when and (to 3 decimals).
|
Hide Feedback
Correct
The
Honda Accord was named the best midsized car for resale value
for by the Kelley Blue Book (Kelley Blue Book website). The
file AutoResale contains mileage, age, and selling price for
a sample of Honda Accords. Click
on the datafile logo to reference the data. The
estimated regression equation is Round
your answers to the nearest dollar. a. Estimate the selling price of a four-year-old Honda Accord with
mileage of miles. $ b. Develop a confidence interval for the selling price of a
car with the data in part (a). ( , ) c. Develop a prediction interval for the selling price of a
car with the data in part (a). ( , )
|
Hide Feedback
Correct
Consider
a regression study involving a dependent variable , a quantitative
independent variable , and a categorical independent variable with three
possible levels (level 1, level 2, and level 3). a. How many dummy variables are required to represent the categorical variable? b. Write a multiple regression equation relating and the
categorical variable to .
Enter
the values of dummy variables and that are used to
indicate the three levels of the categorical variable.
c. Interpret the parameters in your regression equation. is
the change in for a unit change in holding
the categorical variable constant.
|
Hide Feedback
Correct
Johnson
Filtration, Inc. provides maintenance service for water-filtration systems.
Suppose that in addition to information on the number of months since the
machine was serviced and whether a mechanical or an electrical repair was
necessary, the managers obtained a list showing which repairperson performed
the service. The revised data follow. Click
on the datafile logo to reference the data.
a. Ignore for now the months since the last maintenance service ( )
and the repairperson who performed the service. Develop the estimated simple
linear regression equation to predict the repair time () given the type of
repair ( ). Recall that if the type of repair is mechanical
and if the type of repair is electrical (to 2 decimals). Time = + Type b. Does the equation that you developed in part (a) provide a good fit
for the observed data? Explain. (to 4 decimals) ,
because the -value of shows that the relationship is for
any reasonable value of α. c. Ignore for now the months since the last maintenance service and the
type of repair associated with the machine. Develop the estimated simple
linear regression equation to predict the repair time given the repairperson
who performed the service. Let if Bob Jones performed the service
and if Dave Newton performed the service (to 2 decimals). Enter
negative value as negative number. Time = + Person d. Does the equation that you developed in part (c) provide a good fit
for the observed data? Explain.
|
Hide Feedback
Correct
Data
for two variables, and , follow. Excel
File: data15-39.xlsx a. Develop the estimated regression equation for these data (to 1
decimal). + b. Select the correct plot for the residuals against .
Does
the residual plot support the assumptions about ε? Explain. With
only observations, it difficult
to determine if there are any violations in the assumptions. c. Select the correct plot for the standardized residuals
against .
Do
any outliers appear in these data? Explain. Because of
the standard residuals is less than or greater than , we
would conclude that there outliers
in the data set.
|
Hide Feedback
Correct
Hide Feedback Correct The
personnel director for Electronics Associates developed the following
estimated regression equation relating an employee's score on a job
satisfaction test to his or her length of service and wage rate. where
Round
your answers to 2 decimal places. a. Interpret the coefficients in this estimated regression equation. If
the wage rate does not change, a one year increase in length of service is
associated with in
job satisfaction score by units. If the length of service does not change, a dollar
increase in wage results in in
job satisfaction score by units. b. Predict the job satisfaction test score for an employee who has four
years of service and makes per hour.
|
Hide Feedback
Correct
The
admissions officer for Clearwater College developed the following estimated
regression equation relating the final college GPA to the student's SAT
mathematics score and high-school GPA. where
Round
test statistic values to 2 decimal places and all other values to 4 decimal
places. Do not round your intermediate calculations. a. Complete the missing entries in this Excel Regression tool output.
Enter negative values as negative numbers.
b. Using , test for overall significance. c. Did the estimated regression equation provide a good fit to the data?
Explain. d. Use the test and to test and .
Use t table. For ,
the -value is ,
so . For ,
the -value is ,
so .
|
Hide Feedback
Correct
Week 16 Asst
Consider
the following data for two variables, and . Excel
File: data16-03.xls
a. Choose the correct scatter diagram with and .
The
correct scatter diagram is . Does
there appear to be a linear relationship between and ?
Explain. b. Develop the estimated regression equation
relating and . Save "predicted" and
"residuals" (to decimals). c. Choose the correct scatter diagram of the standardized residuals
versus for the estimated regression equation developed in part (b).
The
correct scatter diagram is . Do
the model assumptions appear to be satisfied? Explain. d. Perform a logarithmic transformation ( under Data/Transform
Data/Log10) on the dependent variable . Develop an estimated
regression equation using the transformed dependent variable
(to decimals).
|
Hide Feedback
Correct
A
highway department is studying the relationship between traffic flow and
speed. The following model has been hypothesized. where traffic flow in vehicles per hour The
following data were collected during rush hour for six highways leading out of
the city.
a. Use the data to compute the coefficients of this estimated
regression equation (to decimals). Enter negative value as
negative number. (Create the variable first using Data/Transform
Data/Square.) b. Using , test for a significant relationship. (to decimals) -value (to decimals) The
relationship significant. c. Estimate the traffic flow in vehicles per hour at a speed
of miles per hour (to decimals). vehicles per hour
|
Hide Feedback
Correct
Home
Depot, a nationwide home improvement retailer, sells several brands of
washing machines. A sample of models of full-size washing machines
sold by Home Depot and corresponding capacity (in cubic feet) and list price
(in ) follow (Home Depot website). Click
on the datafile logo to reference the data.
a. Which of the following scatter diagrams represents the data,
treating cubic feet as the independent variable?
Does
a simple linear regression model appear to be appropriate? b. Use a simple regression model to develop an estimated regression
equation to predict the list price given the cubic feet
(to decimals). Enter negative value as negative number. + Choose
a standardized residual plot.
Based
upon the standardized residual plot, does a simple linear regression model
appear to be appropriate? c. Using a second-order model, develop an estimated regression
equation to predict the list price given the cubic feet (to decimals).
Enter negative value as negative number. + + d. Do you prefer the estimated regression equation developed in
part (b) or part (c)? e. Are there other factors that should be considered for inclusion
as independent variables in this regression?
|
Hide Feedback
Correct
As
of September , , the film Suicide Squad had an
average rating of out of based on viewer
ratings (Rotten Tomatoes website). How are the viewer ratings
of Suicide Squad related to the viewer age and the viewer
ratings of The Secret Life of Pets? The file RottenTomatoes contains
a sample of data containing viewer ages and their ratings of Suicide
Squad and The Secret Life of Pets. Click
on the datafile logo to reference the data. a. Select a scatter diagram for these data with the viewers' ages as the
independent variable and their rating of Suicide Squad as
the dependent variable.
Does
a simple linear regression model appear to be appropriate? b. Use the data provided to develop the regression equation for
estimating the viewer ratings of Suicide Squad that is
suggested by the scatter diagram in part (a) (to decimals). c. Include the viewer rating of The Secret Life of Pets as
an independent variable in the regression model developed in part (b).
Interpret the regression coefficient for the viewer rating of The
Secret Life of Pets (to decimals). Enter negative values
as negative numbers. Holding
the viewer's age constant, a one point increase in the viewer's rating of The
Secret Life of Pets coincides with an estimated of points in the viewer's rating of Suicide Squad. d. Is the regression equation developed in part (b) or the regression
equation developed in part (c) superior? Use the equation that
explains more variation. e. Suppose a -year-old viewer gave The Secret Life of Pets a
rating of . Use the model you selected in part (d) to predict
that viewer's rating of Suicide Squad (to decimals).
|
Hide Feedback
Correct
The
Ladies Professional Golfers Association of America (LPGA) maintains
statistics on performance and earnings for members of the LPGA Tour. Year-end
performance statistics for golfers for appear in the
file LPGA2014Stats (LPGA website). Earnings () is the total
earnings in thousands of dollars; Scoring Avg. is the average score for all
events; Greens in Reg. is the percentage of time a player is able to hit the
greens in regulation; Putting Avg. is the average number of putts taken on
greens hit in regulation; and Drive Accuracy is the percentage of times a tee
shot comes to rest in the fairway. A green is considered hit in regulation if
any part of the ball is touching the putting surface and the difference
between the value of par for the hole and the number of strokes taken to hit
the green is at least . Click
on the datafile logo to reference the data. If
not otherwise stated, round your answers to the nearest whole number. Enter negative
values as negative numbers. a. Develop an estimated regression equation that can be used to predict
the total earnings for all events given the average number of putts taken on
greens hit in regulation. b. Develop an estimated regression equation that can be used to predict
the total earnings for all events given the average number of putts taken on
greens hit in regulation, the percentage of time a player is able to hit the
greens in regulation, and the percentage of times a player's tee shot comes
to rest in the fairway. c. At the level of significance, test whether the two
independent variables added in part (b), the percentage of time a
player is able to hit the greens in regulation and the percentage of times a
player's tee shot comes to rest in the fairway, contribute significantly to
the estimated regression equation developed in part (a). Use Table 4 in Appendix B. (to decimals) The -value
associated with is . What
is your conclusion? The
addition of the two independent variables statistically
significant. d. In general, lower scores should lead to higher earnings. To
investigate this option for predicting total earnings, develop an estimated
regression equation that can be used to predict total earnings for all events
given the average score for all events. Would
you prefer to use this equation to predict total earnings, or would you
prefer to use the estimated regression equation developed in part (b)?
Explain.
I.
Although the equation developed in
part (b) provides a better fit, the equation developed in part (d)
is a simpler model. Therefore, we cannot determine which equation is better.
II.
Because the equation developed in
part (b) provides a better fit, it is preferred over the equation
developed in part (d).
III.
Because the equation developed in
part (d) provides a better fit and is a simpler model, it is preferred
over the equation developed in part (b).
|
Hide Feedback
Partially Correct
The
average monthly residential gas bill for Black Hills Energy customers in
Cheyenne, Wyoming is (Wyoming Public Service Commission website).
How is the average monthly gas bill for a Cheyenne residence related to the
square footage, number of rooms, and age of the residence? The following data
show the average monthly gas bill for last year, square footage, number of
rooms, and age for typical Cheyenne residences.
a. Develop an estimated regression equation that can be used to predict a
residence's average monthly gas bill for last year given its age. Round your
answers to four decimals. b. Develop an estimated regression equation that can be used to predict a
residence's average monthly gas bill for last year given its age, square
footage, and number of rooms. Round your answers to four decimals. Enter
negative value as negative number. c. At the level of significance, test whether the two
independent variables added in part (b), the square footage and the
number of rooms, contribute significantly to the estimated regression
equation developed in part (a). If not stated otherwise, round your
answers to four decimals. Use Table 4 from Appendix B.
The p-value
associated with F is .
Therefore, the addition of the two independent variables statistically
significant.
|
Hide Feedback
Partially Correct
Consider
a completely randomized design involving four
treatments: , , , and . Select a correct multiple
regression equation that can be used to analyze these data. Define all
variables.
I.
,
II.
,
III.
,
|
Hide Feedback
Correct
Select
a correct multiple regression equation that can be used to analyze the data
for a two-factorial design with two levels for factor and three
levels for factor . Define all variables.
I.
,
II.
,
III.
,
IV.
,
V.
,
|
Hide Feedback
Correct
Four
different paints are advertised as having the same drying time. To check the
manufacturers' claims, five samples were tested for each of the paints. The
time in minutes until the paint was dry enough for a second coat to be
applied was recorded for each sample. The data obtained follow. Excel
File: data16-21.xls
a. Use to test for any significant differences in mean drying
time among the paints. If your answer is zero, enter "0". Lets
define the dummy variables as follows:
At
the level of significance, . b. What is your estimate of mean drying time for paint (to
whole number)?
|
Hide Feedback
Correct
Mbuy
is a media consulting firm that provides advice to companies on how to
allocate their advertising budgets. Mbuy designed a factorial experiment to
test the effect of the size of a banner ad on a website and the ad design on
the number (in thousands) of product inquiries received. Three advertising
designs and two sizes of advertisements were considered. The following data
were obtained. Test for any significant effects due to type of design, size
of advertisement, or interaction. Use . If your answer is zero enter
"". Excel
File: data16-23.xls
Let
define:
The
analysis shows that has
a significant effect.
|
Hide Feedback
Correct
Refer
to the following Cravens data set. Click
on the datafile logo to reference the data.
The
estimated regression equation involving Accounts, AdvExp, Poten, and Share
had an adjusted coefficient of determination of . Use
the level of significance and apply the Durbin-Watson test to
determine whether positive autocorrelation is present. Use Table 16.10.
|
Hide Feedback
Correct
Consumer
Reports tested different
brands and models of road, fitness, and comfort bikes. Road bikes are
designed for long road trips; fitness bikes are designed for regular workouts
or daily commutes; and comfort bikes are designed for leisure rides on typically
flat roads. The following data show the type, weight (), and price () for
the bicycles tested. Click
on the datafile logo to reference the data.
a. Select an appropriate scatter diagram with weight as the independent
variable and price as the dependent variable.
Select
the correct scatter diagram from the options above. Does
a simple linear regression model appear to be appropriate? Round
your answers to four decimal places. b. Develop an estimated multiple regression equation
with and as the two independent variables. c. Use the following dummy variables to develop an estimated regression
equation that can be used to predict the price given the type of
bike: if the bike is a fitness bike, otherwise;
and if the bike is a comfort bike; otherwise. Compare
the results obtained to the results obtained in part (b). Type
of bike appears to be a(n) factor
in predicting price. But, the estimated regression equation developed in part
(b) appears to provide a slightly fit. d. To account for possible interaction between the type of bike and the
weight of the bike, develop a new estimated regression equation that can be
used to predict the price of the bike given the type, the weight of the bike,
and any interaction between weight and each of the dummy variables defined in
part (c). What estimated regression equation appears to be the best
predictor of price? Please round to four decimal places.
|
Hide Feedback
Correct
A
study investigated the relationship between audit delay and variables that
describe the client and the auditor. The file Audit contains
data from a sample of companies on the following set of variables:
Click
on the datafile logo to reference the data. Consider
a model in which only Industry is used to predict Delay. At
a level of significance, test for any positive autocorrelation in
the data. Use Table 16.10.
|
Hide Feedback
Correct
A
study was conducted to investigate browsing activity by shoppers. Shoppers
were classified as nonbrowsers, light browsers, and heavy browsers. For each
shopper in the study a measure was obtained to determine how comfortable the
shopper was in the store. Higher scores indicated greater comfort . Assume
that the following data are from this study.
Use
a level of significance to test for differences in comfort levels
among the three types of browsers.
|
Hide Feedback
Correct
Comments
Post a Comment