- Analysis of Variance(ANOVA) is a statistical method of comparing the ________ of several populations.
- Standard deviations
- Variances
- Means
- None of the above
- If H0: b1= b2 = b3 =0 is rejected, then we can conclude that there is no linear relationship between the dependent (y) and independent (x) variables in the model.
- True
- False
- In an ANOVA, when testing the significance at α= 0.05, the p-value is 0.003, it can be concluded that:
- there is no statistical evidence that any population mean is different from any other
- no two population means are equal
- no two variances are equal
- there is strong statistical evidence that not all the population means are equal
- When independent variables are correlated with each other, multicollinearity is present.
- True
- False
- The F-ratio used to test for the existence of a linear relationship between the dependent variable and any independent variable is:
- MSR/MSE
- MSR/MST
- MSE/MSR
- None of the above
Use the information below to answer Questions 6 to 9.
Students were randomly selected from TSU undergraduate ECON 3050-01 Class. Their undergraduate performance was compared to their high school GPA and scores on standardized tests. The multiple regression analysis output was as follows:
Predictor | Coefficients | Standard Error | t-ratio | p-value |
Intercept | 1.1066 | 0.2059 | 5.37 | 0.003 |
High School GPA | 0.4755 | 0.1630 | 2.93 | 0.033 |
Standard score | 0.0013392 | 0.0006693 | 2.00 | 0.102 |
Analysis of Variance (ANOVA)
Source | df | SS | MS | F | p-value |
Regression | 2 | 1.02751 | 0.51375 | 170.77 | 0.000 |
Residual | 5 | 0.01504 | 0.00301 | ||
Total | 7 | 1.04255 |
- Using degrees of freedom (df), the total number of students (n) in this sample was:
- 7 students
- 8 students
- 5 students
- Cannot be determined
- Assuming that high school GPA is (x1) and standard score (x2), the regression equation to predict the independent variable (y) is:
- y = 0.4775 x1+ 0.0013392×2
- y = 0.2059 + 0.1630×1 + 0.0006693×2
- y = 1.1066 + 0.4775×1+ 0.0013392×2
- Not enough information given
- At the 5% level of significance, are high school GPA scores and standard scores significant?
- Both are significant
- Neither are significant
- Only high school GPA is significant
- Only standard scores are significant
- What is the value of R2using the information from the above table?
- 4%
- 6%
- 8%
- Insufficient information to determine
- When all members of every block are randomly assigned to all factors/treatments, the design is called:
- Repeated measures design
- Tukey design
- One-way ANOVA
- Randomized complete block design
- Dummy variables are used when:
- Qualitative variables are involved in the model
- Quantitative variables are involved in the model
- Making transformations of quantitative variables
- None of the above
- Multicollinearity may cause the signs of some estimated regression parameters to be the opposite of what we would expect.
- True
- False
- As more independent variables are added to a multiple regression model, ___________ will increase; this is not always so with ___________, which will only increase if the additional variables add substantial explanatory power to the model.
- R2; adjusted R2
- adjusted R2; R2
- Both a) and b)
- None of the above
- TSU wanted to see if there is a relationship between installation of security cameras and crime rate in hostels. A randomly select 5 hostels had security cameras set up at TSU. If we see that crime has decreased in all 5 hostels, we can conclude that the security cameras caused the decrease in crime rate.
- True
- False
- When the null hypothesis, H0: b1= b2 = b3 = 0, is not rejected, the interpretation should be:
- There is no linear relationship between y and any of the three independent variables
- There is a regression relationship between y and at least one of the three independent variables
- All three independent variables have equal slopes
- There is a regression relationship between y and all three independent variables
- A variance inflation factor (VIF) of ______ means there is no correlation between independent variables while a VIF exceeds __________ shows that there is enough correlation
- 5 and 0
- 5 and 1
- 1 and 5
- None of the above
- If the purpose of the regression model is to provide a prediction for the dependent variable (y), the presence of multicollinearity is not necessarily a problem in using the model.
- True
- False
- When One-Way ANOVA F-test is found to be significant, which statistical method is used as a follow-up procedure to determine means that are statistically different?
- t-test for related mean
- Tukey-Kramer test
- t-test for differences between independent means
- None of the above
- In a one-Way ANOVA, if the computed F-statistic exceeds the critical F value, we may reject the null hypothesis since there is evidence that at least one of the means differs.
- True
- False
Consider the one-way ANOVA table below and answer Questions 20 and 21
Source | df | Sum of Squares |
Regression | 3 | 213.88 |
Residual/Error | 20 | 11.21 |
Total | 23 | 225.09 |
- What is the mean square error (MSE)?
- 56
- 88
- 70
- None of the above
- Assuming there are equal number of observations in each factor (treatment), then the facto consists _________ observations
- 3
- 4
- 6
- 23
- Stepwise regression is one of the ways to prevent the problem of multicollinearity.
- True
- False
- The range of feasible values for the multiple coefficient of determination is from
- 0 to 1
- – 1 to + 1
- – 1 to 0
- None of the above
- In order to test the significance of a single independent variable, we use:
- t-test
- The overall F-test
- Adjusted R2
- All of the above
For Questions 25 and 26 use the table below. In the table are absolute differences in pairs of population means for x1, x2 and x3 and their associate critical ranges (CRs).
Absolute Means | Absolute Means | Critical Range (CR) |
1 – | | |5.0 – 7.0| = 2.0 | 1.67 |
1 – | |5.0 – 5.4| = 0.2 | 1.67 |
2 – | | |7.0 – 5.4| = 1.6 | 1.67 |
- From the table above and, using the decision rule for comparing pairs of population means, which of the following statement is correct?
- Population means for are different
- Population means for
- Population means for
- Neither of the above statements are correct
- From the above table, we can conclude that population means for pairs
- True
- False
- No matter how many groups are being compared, the F test from the one-way ANOVA uses only one significance test.
- True
- False
- Qualitative data cannot be incorporated into linear regression models.
- True
- False
- Which of the following iterative search procedures for model-building in a multiple regression analysis adds variables to model as it proceeds?
- a) Backward elimination
- b) Stepwise regression
- c) Forward selection
- d) All possible regressions
- A one-way ANOVA uses 5 factors/treatments and a total of 40 observations. This means there are 35 degrees of freedom (df) within group variation
- True
- False
- The ______ sum of squares measures the variability of the observed values around their respective treatment means.
- Factor/treatment
- Residual/error
- Interaction
- Total
- One-way ANOVA partitions the total variation into “between” and “within” groups.
- True
- False
- It is possible to test the effect of each factor in a two-way ANOVA
- True
- False
- The table below shows correlation coefficients for variables in a multiple regression analysis. If the correlation coefficient was the chosen criterion to build regression model using forward selection procedure. The fist variable to be selected is
y | x1 | x2 | x3 | x4 | x5 | |
y | 1 | |||||
x1 | 0.854168 | 1 | ||||
x2 | -0.11828 | -0.00383 | 1 | |||
x3 | -0.12003 | -0.08499 | -0.14523 | 1 | ||
x5 | -0.18105 | -0.07371 | 0.995886 | -0.14151 | -0.16934 | 1 |
- x1
- x2
- x3
- x4
- The table below shows correlation coefficients for variables in a multiple regression. The analysis reveals potential multicollinearity between which variables?
y | x1 | x2 | x3 | x4 | x5 | |
y | 1 | |||||
x1 | -0.0857 | 1 | ||||
x2 | -0.20246 | 0.868358 | 1 | |||
x3 | -0.22631 | -0.10604 | -0.14853 | 1 | ||
x4 | -0.28175 | -0.0685 | 0.41468 | -0.14151 | 1 | |
x5 | 0.271105 | 0.150796 | 0.129388 | -0.15243 | 0.00821 | 1 |
- x4and x5
- x1and x2
- x1 and x4
- x4and x3
- The null hypothesis for conducting a one-way ANOVA is that “not all the means are equal.”
- True
- False
- A two-way ANOVA examines the simultaneous effect that two main factors have on the observed data
- True
- False
For Questions 35 to 39, use the ANOVA summary table below
Source |
Sum of Squares (SS) |
Degrees of Freedom
(df) |
Mean Sum of Squares |
F |
Between | (a) | 4 | (b) | (e) |
Within | 60 | (c) | (d) | |
Total | 76 | 24 |
- The value of sum squares between (a) is:
- 76
- 16
- 60
- 24
- What is the total number of observations in this ANOVA analysis?
- 24
- 25
- 20
- None of the above
- The mean sum of squares between(MSB) as represented by (b) is:
- 15
- 19
- 4
- 6
- The mean sum squares within (MSW) shown as (d) is:
- 20
- 3
- 8
- Cannot be determined
- What is the value of F-calculated?
- 33
- 1
- 3
- None of the above
- The one-way ANOVA partitions the total variance into four components. These include variance attributable to (i) Factor A, (ii) Factor B, (iii) interaction (Factor A & B), and (iv) that which is unaccounted for.
- True
- False
- How is the degree of association between a set of independent variables and a dependent variable measured?
- Confidence intervals.
- Autocorrelation
- Coefficient of multiple determination
- Standard error of estimate
- The best example of a null hypothesis for testing an individual regression coefficient is:
- None of the above
- The Mean Square Error (MSE) is a biased estimator for the variance of the population, denoted by s2.
- True
- False
- In testing for interaction between two factors (A and B) under a two-way ANOVA, the null hypothesis statement reads — H0: Factors A and B do not interact. If we fail to reject the null hypothesis we can proceed testing Factors A and B.
- True
- False
- Suppose that in a multiple regression the F is significant, but none of the t-ratios for independent coefficients are significant. This means that:
- Multicollinearity may be present
- The regression is good
- Either a) or b)
- None of the above
- A factor in ANOVA describes the cause of variation in the data
- True
- False
- The process of deciding which independent variable should be part of the final regression is known as
- Model building
- Residual analysis
- Multicollinearity
- None of the above