You are on page 1of 14

3/12/2018 Unit VII Homework-Nastasskia Sy

Instructor: Sheana Mullen, Adam Wilson,


Student: Nastasskia Sy
Matthew Napier, Chris Marsh Assignment: Unit VII Homework
Date: 03/12/18
Course: MTH2023-W15Fc-4B18-S1

1. Describe the range of values for the correlation coefficient.

Choose the correct answer below.

A. The range of values for the correlation coefficient is 0 to 1, inclusive.


B. The range of values for the correlation coefficient is 0 to 1, not inclusive
C. The range of values for the correlation coefficient is − 1 to 1, inclusive.
D. The range of values for the correlation coefficient is − 1 to 1, not inclusive.

2. Which value of r indicates a stronger correlation: r = 0.831 or r = − 0.941? Explain your reasoning.

Choose the correct answer below.

A. r = 0.831 represents a stronger correlation because − 0.941 > 0.831 .


B. r = − 0.941 represents a stronger correlation because 0.831 > − 0.941.
C. r = − 0.941 represents a stronger correlation because − 0.941 > 0.831 .
D. r = 0.831 represents a stronger correlation because 0.831 > − 0.941.

3. Explain how to determine whether a sample correlation coefficient indicates that the population correlation coefficient is
significant.

Choose the correct answer below.

A. A scatterplot can be created to see if the sample values are accurate to within a critical value of
the population values.
B. The sample correlation coefficient r can be directly compared to the population coefficient.
C. A table can be used to compare the absolute value of r with a critical value, or a hypothesis test
can be performed using a t-test.
D. A table can be used to compare the population coefficient with a critical value, or a hypothesis
test can be performed using a t-test.

4. What does it mean to say "correlation does not imply causation"?

Choose the correct answer below.

A. The fact that two variables are strongly correlated implies a cause-and-effect relationship
between the variables.
B. The fact that two variables are strongly correlated does not in itself imply a cause-and-effect
relationship between the variables.
C. Two variables can only be strongly correlated if there existed a cause-and-effect relationship
between the variables.
D. Two variables that have a cause-and-effect relationship are never correlated.

https://xlitemprod.pearsoncmg.com/api/v1/print/math 1/14
3/12/2018 Unit VII Homework-Nastasskia Sy

5. The scatter plot of a paired data set is shown. Determine whether there is a perfect positive linear
correlation, a strong positive linear correlation, a perfect negative linear correlation, a strong
negative linear correlation, or no linear correlation between the variables.

Choose the correct answer below.

no linear correlation
perfect positive linear correlation
strong negative linear correlation
strong positive linear correlation
perfect negative linear correlation

6. The scatter plot of a paired data set is shown. Determine whether there is a perfect positive y
linear correlation, a strong positive linear correlation, a perfect negative linear correlation, a
strong negative linear correlation, or no linear correlation between the variables.

Choose the correct answer below.

perfect positive linear correlation


no linear correlation
strong positive linear correlation
perfect negative linear correlation
strong negative linear correlation

7. Suppose the scatter plot shows the results of a survey of 37 randomly selected males ages
24 to 35. Using age as the explanatory variable, choose the appropriate description for the 100
In thousands of units

graph. Explain your reasoning. 80

(a) Age and body temperature 60


(b) Age and balance on student loans 40
(c) Age and income
(d) Age and height 20
Age
0
24 26 28 30 32 34 36

The response variable is income because you would expect this variable and age to have
a positive correlation and high variation for adult males.

https://xlitemprod.pearsoncmg.com/api/v1/print/math 2/14
3/12/2018 Unit VII Homework-Nastasskia Sy

8. Identify the explanatory variable and the response variable.

A golfer wants to determine if the type of equipment used every year can be used to predict the
amount of improvement in his game.

The explanatory variable is the type of equipment used .

The response variable is the amount of improvement in his game .

9. For the following data (a) display the data in a scatter plot, (b) calculate the correlation coefficient r, and (c) make a
conclusion about the type of correlation.
The ages (in years) of 6 children and the number of words in their vocabulary
Age, x 1 2 3 4 5 6
Vocabulary size, y 450 1050 1200 1450 2300 2600

(a) Choose the correct scatter plot below.

A. B. C. D.

7 2650 2650 2650


Vocabulary

Vocabulary

Vocabulary
Age

0 0 0 0
0 2650 0 7 0 7 0 7
Vocabulary Age Age Age

(b) The correlation coefficient r is 0.978 .


(Round to three decimal places as needed.)

(c) Which of the following best describes the type of correlation that exists between age and vocabulary size?

A. Strong negative linear correlation


B. Strong positive linear correlation
C. Weak negative linear correlation
D. No linear correlation
E. Weak positive linear correlation

https://xlitemprod.pearsoncmg.com/api/v1/print/math 3/14
3/12/2018 Unit VII Homework-Nastasskia Sy

10. For the following data (a) display the data in a scatter plot, (b) calculate the sample correlation coefficient r, and (c) make a
conclusion about the type of correlation. Use technology.
The earnings per share (in dollars) and the dividends per share (in dollars) for 6 medical supplies companies in a recent
year are shown in the data set below.

Earnings per share, x 2.73 5.07 4.56 3.06 3.76 2.25


Dividends per share, y 0.56 2.44 1.45 0.82 1.01 0.22

(a) Choose the correct scatter plot below. Use technology.

A. B. C. D.
Dividends/share (in $)

Dividends/share (in $)

Dividends/share (in $)

Earnings/share (in $)
3 3 3 6

0 0 0 0
0 6 0 6 0 6 0 3
Earnings/share (in $) Earnings/share (in $) Earnings/share (in $) Dividends/share (in $)

(b) The correlation coefficient r is 0.957 .


(Round to three decimal places as needed.)

(c) Which of the following best describes the type of correlation that exists between earnings per share and dividends per
share?

A. Strong positive linear correlation


B. Strong negative linear correlation
C. Weak negative linear correlation
D. Weak positive linear correlation
E. No linear correlation

https://xlitemprod.pearsoncmg.com/api/v1/print/math 4/14
3/12/2018 Unit VII Homework-Nastasskia Sy

11. The weights (in pounds) of 6 vehicles and the variability of their braking distances (in feet) when stopping on a dry surface
are shown in the table. Can you conclude that there is a significant linear correlation between vehicle weight and variability
in braking distance on a dry surface? Use α = 0.01.
Weight, x 5950 5380 6500 5100 5870 4800
Variability in braking
1.78 1.91 1.87 1.58 1.64 1.50
distance, y
1
Click here to view a table of critical values for Student's t-distribution.

Setup the hypothesis for the test.

H0 : ρ = 0
Ha : ρ ≠ 0

Identify the critical value(s). Select the correct choice below and fill in any answer boxes within your choice.
(Round to three decimal places as needed.)
A. The critical values are − t0 = − 4.604 and t0 = 4.604 .
B. The critical value is .

Calculate the test statistic.

t= 1.746 (Round to three decimal places as needed.)

What is your conclusion?

There is not enough evidence at the 1% level of significance to conclude that there is a significant linear correlation
between vehicle weight and variability in braking distance on a dry surface.

1: Data Table

https://xlitemprod.pearsoncmg.com/api/v1/print/math 5/14
3/12/2018 Unit VII Homework-Nastasskia Sy

Level of
confidence, c 0.50 0.80 0.90 0.95 0.98 0.99
One tail, α 0.25 0.10 0.05 0.025 0.01 0.005
d.f. Two tails, α 0.50 0.20 0.10 0.05 0.02 0.01
1 1.000 3.078 6.314 12.706 31.821 63.657
2 0.816 1.886 2.920 4.303 6.965 9.925
3 0.765 1.638 2.353 3.182 4.541 5.841
4 0.741 1.533 2.132 2.776 3.747 4.604
5 0.727 1.476 2.015 2.571 3.365 4.032
6 0.718 1.440 1.943 2.447 3.143 3.707
7 0.711 1.415 1.895 2.365 2.998 3.499
8 0.706 1.397 1.860 2.306 2.896 3.355
9 0.703 1.383 1.833 2.262 2.821 3.250
10 0.700 1.372 1.812 2.228 2.764 3.169
11 0.697 1.363 1.796 2.201 2.718 3.106
12 0.695 1.356 1.782 2.179 2.681 3.055
13 0.694 1.350 1.771 2.160 2.650 3.012
14 0.692 1.345 1.761 2.145 2.624 2.977
15 0.691 1.341 1.753 2.131 2.602 2.947
16 0.690 1.337 1.746 2.120 2.583 2.921
17 0.689 1.333 1.740 2.110 2.567 2.898
18 0.688 1.330 1.734 2.101 2.552 2.878
19 0.688 1.328 1.729 2.093 2.539 2.861
20 0.687 1.325 1.725 2.086 2.528 2.845
21 0.686 1.323 1.721 2.080 2.518 2.831
22 0.686 1.321 1.717 2.074 2.508 2.819
23 0.685 1.319 1.714 2.069 2.500 2.807
24 0.685 1.318 1.711 2.064 2.492 2.797
25 0.684 1.316 1.708 2.060 2.485 2.787
26 0.684 1.315 1.706 2.056 2.479 2.779
27 0.684 1.314 1.703 2.052 2.473 2.771
28 0.683 1.313 1.701 2.048 2.467 2.763
29 0.683 1.311 1.699 2.045 2.462 2.756
∞ 0.674 1.282 1.645 1.960 2.326 2.576

12. Two variables have a positive linear correlation. Is the slope of the regression line for the variables positive or negative?

A. The slope is negative. As the independent variable increases the dependent variable also
tends to increase.
B. The slope is negative. As the independent variable increases the dependent variable tends to
decrease.
C. The slope is positive. As the independent variable increases the dependent variable also
tends to increase.
D. The slope is positive. As the independent variable increases the dependent variable tends to
decrease.

https://xlitemprod.pearsoncmg.com/api/v1/print/math 6/14
3/12/2018 Unit VII Homework-Nastasskia Sy

13. Explain how to predict y-values using the equation of a regression line.

Choose the correct answer below.

A. Substitute a value of y into the equation of a regression line and solve for x.
B. Substitute a value of x into the equation of a regression line and solve for y.

C. Use the graph of the regression line to determine the x-value that corresponds to the y-value
for which you are solving.
D. Substitute the correlation coefficient into the equation and solve for y.

14. Match this description with a description below.

The y-value of a data point corresponding to xi

Choose the correct answer below.

A. yi

B. b
C. m
D. y
i

15. Match this description with a description below.

Slope

Choose the correct answer below.

A. m
B. b
C. yi
D. y
i

https://xlitemprod.pearsoncmg.com/api/v1/print/math 7/14
3/12/2018 Unit VII Homework-Nastasskia Sy

16. Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the
regression line. (The pair of variables have a significant correlation.) Then use the regression equation to predict the value
of y for each of the given x-values, if meaningful. The table below shows the heights (in feet) and the number of stories of
six notable buildings in a city.
Height, x 774 625 521 508 497 477 (a) x = 503 feet (b) x = 646 feet
Stories, y 51 47 52 25 39 33 (c) x = 318 feet (d) x = 727 feet

Find the regression equation.

y= 0.057 x+ 8.657
(Round the slope to three decimal places as needed. Round the y-intercept to two decimal places as needed.)

Choose the correct graph below.

A. B. C. D.

60 60 60 60
Stories

Stories

Stories

Stories
0 0 0 0
0 800 0 800 0 800 0 800
Height (feet) Height (feet) Height (feet) Height (feet)

(a) Predict the value of y for x = 503. Choose the correct answer below.

A. 37
B. 50
C. 45
D. not meaningful

(b) Predict the value of y for x = 646. Choose the correct answer below.

A. 45
B. 27
C. 37
D. not meaningful

(c) Predict the value of y for x = 318. Choose the correct answer below.

A. 45
B. 27
C. 50
D. not meaningful

(d) Predict the value of y for x = 727. Choose the correct answer below.

A. 37
B. 27
C. 50
D. not meaningful

https://xlitemprod.pearsoncmg.com/api/v1/print/math 8/14
3/12/2018 Unit VII Homework-Nastasskia Sy

17. Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the
regression line. (The pair of variables have a significant correlation.) Then use the regression equation to predict the value
of y for each of the given x-values, if meaningful. The number of hours 6 students spent for a test and their scores on that
test are shown below.
Hours spent studying, x 0 1 2 3 5 5 (a) x = 2 hours (b) x = 3.5 hours
Test score, y 37 43 52 51 62 70 (c) x = 15 hours (d) x = 1.5 hours

Find the regression equation.

y= 5.625 x+ 37.500
(Round the slope to three decimal places as needed. Round the y-intercept to two decimal places as needed.)

Choose the correct graph below.

A. B. C. D.

80 80 80 80
Test score

Test score

Test score

Test score
0 0 0 0
0 8 0 8 0 8 0 8
Hours studying Hours studying Hours studying Hours studying

(a) Predict the value of y for x = 2. Choose the correct answer below.

A. 48.8
B. 45.9
C. 57.2
D. not meaningful

(b) Predict the value of y for x = 3.5. Choose the correct answer below.

A. 57.2
B. 45.9
C. 121.9
D. not meaningful

(c) Predict the value of y for x = 15. Choose the correct answer below.

A. 57.2
B. 121.9
C. 48.8
D. not meaningful

(d) Predict the value of y for x = 1.5. Choose the correct answer below.

A. 48.8
B. 45.9
C. 121.9
D. not meaningful

https://xlitemprod.pearsoncmg.com/api/v1/print/math 9/14
3/12/2018 Unit VII Homework-Nastasskia Sy

18. The accompanying data are the number of wins and the earned run averages (mean number of earned runs allowed per
nine innings pitched) for eight baseball pitchers in a recent season. Find the equation of the regression line. Then
construct a scatter plot of the data and draw the regression line. Then use the regression equation to predict the value of y
for each of the given x-values, if meaningful. If the x-value is not meaningful to predict the value of y, explain why not.
(a) x = 5 wins (b) x = 10 wins (c) x = 21 wins (d) x = 15 wins
2
Click the icon to view the table of numbers of wins and earned run average.

The equation of the regression line is y = 34.6 x+ 23 .


(Round to two decimal places as needed.)

Construct a scatter plot of the data and draw the regression line. Choose the correct graph below.

A. B. C. D.
ERA ERA ERA ERA
6 6 6 6

4 4 4 4

2 2 2 2

0 0 0 0
0 6 12 18 24 0 6 12 18 24 0 6 12 18 24 0 6 12 18 24
Wins Wins Wins Wins

(a) Predict the ERA for 5 wins, if it is meaningful. Select the correct choice below and, if necessary, fill in the answer box
within your choice.

A. y = (Round to two decimal places as needed.)


B. It is not meaningful to predict this value of y because x = 5 is well outside the range of the
original data.
C. It is not meaningful to predict this value of y because x = 5 is not an x-value in the original
data.

(b) Predict the ERA for 10 wins, if it is meaningful. Select the correct choice below and, if necessary, fill in the answer box
within your choice.

A. y = (Round to two decimal places as needed.)


B. It is not meaningful to predict this value of y because x = 10 is not an x-value in the original
data.
C. It is not meaningful to predict this value of y because x = 10 is inside the range of the original
data.

(c) Predict the ERA for 21 wins, if it is meaningful. Select the correct choice below and, if necessary, fill in the answer box
within your choice.

A. y = (Round to two decimal places as needed.)


B. It is not meaningful to predict this value of y because x = 21 is not an x-value in the original
data.
C. It is not meaningful to predict this value of y because x = 21 is well outside the range of the
original data.

(d) Predict the ERA for 15 wins, if it is meaningful. Select the correct choice below and, if necessary, fill in the answer box
within your choice.

A. y = (Round to two decimal places as needed.)


B. It is not meaningful to predict this value of y because x = 15 is inside the range of the original
data.
https://xlitemprod.pearsoncmg.com/api/v1/print/math 10/14
3/12/2018 Unit VII Homework-Nastasskia Sy
C. It is not meaningful to predict this value of y because x = 15 is not an x-value in the original
data.

2: Wins and ERA

Earned run
Wins, x average, y
20 2.81
18 3.28
17 2.73
16 3.71
14 3.87
12 4.29
11 3.83
9 5.18

https://xlitemprod.pearsoncmg.com/api/v1/print/math 11/14
3/12/2018 Unit VII Homework-Nastasskia Sy

19. Use the data in the table below to complete parts (a) through (c).

x 5 6 7 11 13 17 19 47
y 30 32 21 28 19 22 25 9
3
Click the icon to view the steps for finding influential points.

(a) Construct a scatterplot of the data. Choose the correct graph below.

A. B. C. D.
y y y y
40 50 12 12

x x

0 50 0 40
x x
0 0 -12 -12
0 50 0 40

(b) Identify any possible outliers.

A. The point (6,32) may be an outlier.


B. The point (19,25) may be an outlier.
C. The point (5,30) may be an outlier.
D. The point (47,9) may be an outlier.
E. There are no outliers.

(c) Determine if the point is influential. The change in slope or intercept is significant if it is larger than 10%.

The point (1) an influential point because the slopes with the point included and without the point included

(2) significantly different, and the intercepts (3) significantly different.

3: Steps for Finding Influential Points


An influential point is a point in a data set that can greatly affect a regression line. An outlier may or may not be an
influential point. To determine if a point is influential, find the regression lines including all the points in the data set, and
excluding the possible influential point. If the slope or y-intercept of the regression line shows significant changes, the
point can be considered influential. An influential point can be removed from a data set only if there is proper justification.

(1) is (2) are not (3) are not


is not are are

https://xlitemprod.pearsoncmg.com/api/v1/print/math 12/14
3/12/2018 Unit VII Homework-Nastasskia Sy

20. Use the value of the linear correlation coefficient to calculate the coefficient of determination. What does this tell you about
the explained variation of the data about the regression line? About the unexplained variation?

r = − 0.631

Calculate the coefficient of determination.

0.398
(Round to three decimal places as needed.)

What does this tell you about the explained variation of the data about the regression line?

39.8 % of the variation can be explained by the regression line.


(Round to one decimal place as needed.)

About the unexplained variation?

60.2 % of the variation is unexplained and is due to other factors or to sampling error.
(Round to one decimal place as needed.)

21. Use the value of the linear correlation coefficient to calculate the coefficient of determination. What does this tell you about
the explained variation of the data about the regression line? About the unexplained variation?

r = 0.594

Calculate the coefficient of determination.

(Round to three decimal places as needed.)

What does this tell you about the explained variation of the data about the regression line?

% of the variation can be explained by the regression line.


(Round to one decimal place as needed.)

About the unexplained variation?

% of the variation is unexplained and is due to other factors or to sampling error.


(Round to one decimal place as needed.)

22. The table below shows the average weekly wages (in dollars) for state government employees and federal government
employees for 10 years. Construct and interpret a 99% prediction interval for the average weekly wages of federal
government employees when the average weekly wages of state government employees is $806. The equation of the
regression line is y = 1.572x − 151.032.
Wages (state), x 740 776 800 807 832 895 918 924 943 976
Wages (federal), y 1,002 1,037 1,108 1,147 1,193 1,248 1,274 1,304 1,325 1,387

Construct and interpret a 99% prediction interval for the average weekly wages of federal government employees when
the average weekly wages of state government employees is $806. Select the correct choice below and fill in the answer
boxes to complete your choice.
(Round to the nearest cent as needed.)
A. There is a 99% chance that the predicted average weekly wages of federal government
employees is between $ and $ , given a state average weekly
wage of $806.
B. We can be 99% confident that when the average weekly wages of state government
employees is $806, the average weekly wages of federal government employees will be
between $ and $ .

https://xlitemprod.pearsoncmg.com/api/v1/print/math 13/14
3/12/2018 Unit VII Homework-Nastasskia Sy

23. Find the equation of the regression line for the given data. Then construct a scatter plot of the data and draw the
regression line. The table shows the shoe size and heights (in.) for 6 men.

Shoe size, x 6.0 10.0 10.5 12.0 13.0 13.5


Height, y 65.0 69.0 72.0 70.0 74.0 73.0

Find the regression equation.

y= x+
(Round to three decimal places as needed.)

Choose the correct graph below.

A. B. C. D.

75 75 75 75
Height (in.)

Height (in.)

Height (in.)

Height (in.)
65 65 65 65
6 14 6 14 6 14 6 14
Shoe size Shoe size Shoe size Shoe size

https://xlitemprod.pearsoncmg.com/api/v1/print/math 14/14

You might also like