Which Of The Following Is True Regarding The Sampling Distribution Of
Q1. Which of the following is true regarding the sampling distribution of the mean for a large sample size? a. It has the same shape, mean, and standard deviation as the population. b. It has a normal distribution with the same mean and standard deviation as the population. c. It has the same shape and mean as the population, but has a smaller standard deviation. d. It has a normal distribution with the same mean as the population but with a smaller standard deviation. Q2. According to a survey of American households, the probability that the residents own 2 cars if annual household income is over $25,000 is 80%. Of the households surveyed, 60% had incomes over $25,000 and 70% had 2 cars. The probability that annual household income is over $25,000 if the residents of a household do not own 2 cars is: a. 0.12 b. 0.18 c. 0.40 d. 0.55 Q3. The collection and summarization of the socioeconomic and physical characteristics of the employees of a particular firm is an example of: a. inferential statistics. b. descriptive statistics. c. a parameter. d. a statistic. Q4. The head librarian at the Library of Congress has asked her assistant for an interval estimate of the mean number of books checked out each day. The assistant provides the following interval estimate: from 740 to 920 books per day. If the head librarian knows that the population standard deviation is 150 books checked out per day, and she asked her assistant to use 25 days of data to construct the interval estimate, what confidence level can she attach to the interval estimate? a. 99.7% b. 99.0% c. 98.0% d. 95.4% Q5. Referring to the histogram, how many graduating seniors attended the luncheon? a. 4 b. 152 c. 275 d. 388 Q6. The following are the durations in minutes of a sample of long-distance phone calls made within the continental United States reported by one long-distance carrier. Time (in Minutes) Relative Frequency 0 but less than 5 0.37 5 but less than 10 0.22 10 but less than 15 0.15 15 but less than 20 0.10 20 but less than 25 0.07 25 but less than 30 0.07 30 or more 0.02 Referring to the table, if 10 calls lasted 30 minutes or more, how many calls lasted less than 5 minutes? a. 10 b. 185 c. 295 d. 500 Q7. The collection of all possible events is called: a. a simple probability. b. a sample space. c. a joint probability. d. the null set. Q8. A lab orders 100 rats a week for each of the 52 weeks in the year for experiments that the lab conducts. Prices for 100 rats follow the following distribution: Price: $10.00 $12.50 $15.00 Probability: 0.35 0.40 0.25 How much should the lab budget for next year's rat orders be, assuming this distribution does not change? a. $520 b. $637 c. $650 d. $780 Q9. A professor of economics at a small Texas university wanted to determine what year in school students were taking his tough economics course. Shown below is a pie chart of the results. What percentage of the class took the course prior to reaching their senior year? a. 14% b. 44% c. 54% d. 86% Q10. Since a _______ is not a randomly selected probability sample, there is no way to know how well it represents the overall population. a. simple random sample b. quota sample c. stratified sample d. cluster sample Q11. A study is under way in Yosemite National Forest to determine the adult height of American pine trees. Specifically, the study is attempting to determine what factors aid a tree in reaching heights greater than 60 feet tall. It is estimated that the forest contains 25,000 adult American pines. The study involves collecting heights from 250 randomly selected adult American pine trees and analyzing the results. Identify the population from which the study was sampled. a. The 250 randomly selected adult American pine trees in the forest. b. The 25,000 adult American pine trees in the forest. c. All the adult American pine trees taller than 60 feet in the world. d. All American pine trees, of any age, in the forest. Q12. A population frame for a survey contains a listing of 72,345 names. Using a table of random numbers, how many digits will the code numbers for each member of your population contain? a. 3 b. 4 c. 5 d. 6 Q13. The chancellor of a major university was concerned about alcohol abuse on her campus and wanted to find out the portion of students at her university who visited campus bars every weekend. Her advisor took a random sample of 250 students. The portion of students in the sample who visited campus bars every weekend is an example of a __________. a. categorical random variable b. discrete random variable c. parameter d. statistic Q14. The following are the durations in minutes of a sample of long-distance phone calls made within the continental United States reported by one long-distance carrier. Time (in Minutes) Relative Frequency 0 but less than 5 0.37 5 but less than 10 0.22 10 but less than 15 0.15 15 but less than 20 0.10 20 but less than 25 0.07 25 but less than 30 0.07 30 or more 0.02 Referring to Table 2-5, if 100 calls were randomly sampled, how many calls lasted 15 minutes or longer? a. 10 b. 14 c. 26 d. 74 Q15. If two events are mutually exclusive, what is the probability that one or the other occurs? a. 0 b. 0.50 c. 1.00 d. Cannot be determined from the information given. Q16. According to a survey of American households, the probability that the residents own 2 cars if annual household income is over $25,000 is 80%. Of the households surveyed, 60% had incomes over $25,000 and 70% had 2 cars. The probability that the residents of a household do not own 2 cars and have an income over $25,000 a year is: a. 0.12 b. 0.18 c. 0.22 d. 0.48 Q17. If two events are mutually exclusive and collectively exhaustive, what is the probability that one or the other occurs? a. 0. b. 0.50. c. 1.00. d. Cannot be determined from the information given. Q18. Why is the Central Limit Theorem so important to the study of sampling distributions? a. It allows us to disregard the size of the sample selected when the population is not normal. b. It allows us to disregard the shape of the sampling distribution when the size of the population is large. c. It allows us to disregard the size of the population we are sampling from. d. It allows us to disregard the shape of the population whennis large. Q19. A sample of 200 students at a Big-Ten university was taken after the midterm to ask them whether they went bar hopping the weekend before the midterm or spent the weekend studying, and whether they did well or poorly on the midterm. The following table contains the result. Did Well on Midterm Did Poorly on Midterm Studying for Exam 80 20 Went Bar Hopping 30 70 Referring to the table, of those who did well on the midterm in the sample, _______ percent of them went bar hopping the weekend before the midterm. a. 15 b. 27.27 c. 30 d. 50 Q20. Tim was planning for a meeting with his boss to discuss a raise in his annual salary. In preparation, he wanted to use the Consumer Price Index to determine the percentage increase in his salary in terms of real income over the last three years. Which method of data collection was involved when he used the Consumer Price Index? a. Published sources b. Experimentation c. Surveying d. Observation Q21. A survey was conducted to determine how people rated the quality of programming available on television. Respondents were asked to rate the overall quality from 0 (no quality at all) to 100 (extremely good quality). The stem-and-leaf display of the data is shown below. Stem Leaves 3 24 4 3478999 5 112345 6 12566 7 1 8 9 2 Referring to the table, what percentage of the respondents rated overall television quality with a rating between 50 and 75? a. 0.11 b. 0.40 c. 0.44 d. 0.56 Q22. The process of using sample statistics to draw conclusions about true population parameters is called: a. statistical inference. b. the scientific method. c. sampling. d. descriptive statistics. Q23. If two equally likely events A and B are mutually exclusive and collectively exhaustive, what is the probability that event A occurs? a. 0 b. 0.50 c. 1.00 d. Cannot be determined from the information given. Q24. Which of the following is most likely a parameter as opposed to a statistic? a. The average score of the first five students completing an assignment. b. The proportion of females registered to vote in a county. c. The average height of people randomly selected from a database. d. The proportion of trucks stopped yesterday that were cited for bad brakes. Q25. Selection of raffle tickets from a large bowl is an example of: a. sampling with replacement. b. sampling without replacement. c. subjective probability. d. None of the above. Q26. The portfolio expected return of two investments a. will be higher when the covariance is zero. b. will be higher when the covariance is negative. c. will be higher when the covariance is positive. d. does not depend on the covariance. Q27. Given the numbers: 1, 3, 5, 7, 8 what are the average and the median? a. Average = 4.8; Median = 5.0 b. Average = 5.0; Median = 5.0 c. Average = 4.8; Median = 4.8 d. Average = 5.0; Median = 4.8 Q28. The employees of a company were surveyed on questions regarding their educational background and marital status. Of the 600 employees, 400 had college degrees, 100 were single, and 60 were single college graduates. The probability that an employee of the company is single or has a college degree is: a. 0.10 b. 0.25 c. 0.667 d. 0.733 Q29. A catalog company that receives the majority of its orders by telephone conducted a study to determine how long customers were willing to wait on hold before ordering a product. The length of time was found to be a random variable best approximated by an exponential distribution with a mean equal to 3 minutes. What proportion of customers having to hold more than 1.5 minutes will hang up before placing an order? a. 0.86466 b. 0.60653 c. 0.39347 d. 0.13534 Q30. The following are the durations in minutes of a sample of long-distance phone calls made within the continental United States reported by one long-distance carrier. Time (in Minutes) Relative Frequency 0 but less than 5 0.37 5 but less than 10 0.22 10 but less than 15 0.15 15 but less than 20 0.10 20 but less than 25 0.07 25 but less than 30 0.07 30 or more 0.02 Referring to the table, what is the width of each class? a. 1 minute b. 5 minutes c. 2% d. 100% Q31. The probability that house sales will increase in the next 6 months is estimated to be 0.25. the interest rates on housing loans will go up in the same period is estimated to be 0.74 house sales or interest rates will go up during the next 6 months is estimated to be 0.89 The probability that house sales will increase but interest rates will not during the next 6 months is: a. 0.065 b. 0.15 c. 0.51 d. 0.89 Q32. Which of the mean, median, mode, and geometric mean are resistant measures of central tendency? a. The mean and median. b. The median and mode. c. The mode and geometric mean. d. The mean and mode. Q33. The probability that house sales will increase in the next 6 months is estimated to be 0.25 the interest rates on housing loans will go up in the same period is estimated to be 0.74 house sales or interest rates will go up during the next 6 months is estimated to be 0.89 The probability that neither house sales nor interest rates will increase during the next 6 months is: a. 0.11 b. 0.195 c. 0.89 d. 0.90 Q34. Which of the following statistics is not a measure of central tendency? a. Mean. b. Median. c. Mode. d. Q3. Q35. The width of each bar in a histogram corresponds to the: a. differences between the boundaries of the class. b. number of observations in each class. c. midpoint of each class. d. percentage of observations in each class. Q36. In left-skewed distributions, which of the following is the correct statement? a. The distance from Q1 to Q2 is smaller than the distance from Q2 to Q3. b. The distance from the smallest observation to Q1 is larger than the distance from Q3 to the largest observation. c. The distance from the smallest observation to Q2 is smaller than the distance from Q2 to the largest observation. d. The distance from Q1 to Q3 is twice the distance from Q1 to Q2. Q37. The Central Limit Theorem is important in statistics because: a. for a largen, it says the population is approximately normal. b. for any population, it says the sampling distribution of the sample mean is approximately normal, regardless of the sample size. c. for a largen, it says the sampling distribution of the sample mean is approximately normal, regardless of the shape of the population. d. for any sized sample, it says the sampling distribution of the sample mean is approximately normal. Q38. A sample of 200 students at a Big-Ten university was taken after the midterm to ask them whether they went bar hopping the weekend before the midterm or spent the weekend studying, and whether they did well or poorly on the midterm. The following table contains the result. Did Well on the midterm Did Poorly on Midterm Studying for Exam 80 20 Went Bar Hopping 30 70 Referring to the table, _______ percent of the students in the sample went bar hopping the weekend before the midterm and did well on the midterm. a. 15 b. 27.27 c. 30 d. 50 Q39. Which of the following statements about the median is not true? a. It is more affected by extreme values than the mean. b. It is a measure of central tendency. c. It is equal to Q2. d. It is equal to the mode in bell-shaped "normal" distributions. Q40. A confidence interval was used to estimate the proportion of statistics students that are females. A random sample of 72 statistics students generated the following 90% confidence interval: (0.438, 0.642). Based on the interval above, is the population proportion of females equal to 0.60? a. No, and we are 90% sure of it. b. No. The proportion is 54.17%. c. Maybe. 0.60 is a believable value of the population proportion based on the information above. Q1. The sample correlation coefficient between X and Y is 0.375. It has been found out that the p-value is 0.256 when testing H0: ρ = 0 against the two-sided alternative H1: ρ ≠ 0. To test H0: ρ = 0 against the one-sided alternative H1: ρ > 0 at a significance level of 0.193, the p-value is a. 0.256/2 b. 0.256 c. 1 - 0.256 d. 1 - 0.256/2 Q2. A researcher randomly sampled 30 graduates of an MBA program and recorded data concerning their starting salaries. Of primary interest to the researcher was the effect of gender on starting salaries. Analysis of the mean salaries of the females and males in the sample is given below. Size Mean Std Dev Females 18 48,266.7 13,577.63 Males 12 55,000 11,741.29 Std Error = 4,764.82 Means Diff = -6,733.3 Z = -1.4528 2-tailed p value = 0.1463 T = -1.4221 2-tailed p value = 0.1574 Referring to the table, the researcher was attempting to show statistically that the female MBA graduates have a significantly lower mean starting salary than the male MBA graduates. What assumptions were necessary to conduct this hypothesis test? a. Both populations of salaries (male and female) must have approximate normal distributions. b. The population variances are approximately equal. c. The samples were randomly and independently selected. d. All of the above assumptions were necessary. Q3. The Y-intercept (b0) represents the: a. predicted value of Y when X = 0. b. change in estimated average Y per unit change in X. c. predicted value of Y. d. variation around the sample regression line. Q4. A local real estate appraiser analyzed the sales prices of homes in 2 neighborhoods to the corresponding appraised values of the homes. The goal of the analysis was to compare the distribution of sale-to-appraised ratios from homes in the 2 neighborhoods. Random and independent samples were selected from the 2 neighborhoods from last year's homes sales, 8 from each of the 2 neighborhoods. Identify the nonparametric method that would be used to analyze the data. a. the Wilcoxon Signed-Ranks Test, using the test statistic Z b. the Wilcoxon Signed-Ranks Test, using the test statistic W c. the Wilcoxon Rank Sum Test, using the test statistic T1 d. the Wilcoxon Rank Sum Test, using the test statistic Z Q5. A survey claims that 9 out of 10 doctors recommend aspirin for their patients with headaches. To test this claim against the alternative that the actual proportion of doctors who recommend aspirin is less than 0.90, a random sample of 100 doctors results in 83 who indicate that they recommend aspirin. The value of the test statistic in this problem is approximately equal to: a. -4.12 b. -2.33 c. -1.86 d. -0.07 Q6. A realtor wants to compare the average sales-to-appraisal ratios of residential properties sold in four neighborhoods (W, X, Y, and Z). Four properties are randomly selected from each neighborhood and the ratios recorded for each, as shown below. W: 1.2, 1.1, 0.9, 0.4 X: 2.5, 2.1, 1.9, 1.6 Y: 1.0, 1.5, 1.1, 1.3 Z: 0.8, 1.3, 1.1, 0.7 Interpret the results of the analysis summarized in the following table: Source df SS MS F PR > F Neighborhoods 2.97 0.990 8.31 0.0260 Error 12 Total 4.40 Referring to the table, the within group mean squares is a. 0.119 b. 0.990 c. 1.109 d. 8.31 Q7. If a group of independent variables are not significant individually but are significant as a group at a specified level of significance, this is most likely due to: a. autocorrelation. b. the presence of dummy variables. c. the absence of dummy variables. d. collinearity. Q8. In a multiple regression model, the adjusted r2 a. cannot be negative. b. can sometimes be negative. c. can sometimes be greater than +1. d. has to fall between 0 and +1. Q9. Why would you use the Tukey-Kramer procedure? a. To test for normality. b. To test for homogeneity of variance. c. To test independence of errors. d. To test for differences in pairwise means. Q10. A campus researcher wanted to investigate the factors that affect visitor travel time in a complex, multilevel building on campus. Specifically, he wanted to determine whether different building signs (building maps versus wall signage) affect the total amount of time visitors require to reach their destination and whether that time depends on whether the starting location is inside or outside the building. Three subjects were assigned to each of the combinations of signs and starting locations, and travel time in seconds from beginning to destination was recorded. How should the data be analyzed? Starting Room Interior Exterior Wall Signs 141 224 119 339 238 139 Map 85 226 94 129 126 130 a. Completely randomized design b. Randomized block design c. 2 x 2 factorial design d. Kruskal-Wallis rank test Q11. As a business statistics project, a student examined the factors that determine parking meter rates throughout the campus and downtown area. The campus is a group of buildings located in the center of downtown, with an open central quadrangle. Data were collected for the price of parking per hour and the number of blocks to the quadrangle. In addition, two dummy variables were coded to indicate the location of the parking meter (See below). The population regression model hypothesized is Yi = ß0 + ß1x1i + ß2x2i + ß3x3i + ei where Y is the price per hour x1 is a numerical variable = the number of blocks to the quadrangle (Note that if x1 is less than 2, then the meter is on campus; if x1 is less than 3, then the meter is downtown) x2 is a dummy variable = 1 if inside downtown and off campus, 0 otherwise x3 is a dummy variable = 1 if outside downtown and off campus, 0 otherwise The following Excel results are obtained. Regression Statistics Multiple R 9.9659 R Square 0.9331 Adjusted R Square 0.9294 Standard Error 0.0327 Observations 58 ANOVA df SS MS F Signif F Regression 3 0.8094 0.2698 251.1995 1.0964E-31 Residual 54 0.0580 0.0010 Total 57 0.8675 Coeff StdError t Stat P-value Intercept 0.5118 0.0136 37.4675 2.4904 X1 -0.0045 0.0034 -1.3275 0.1898 X2 -0.2392 0.0123 -19.3942 5.3581E-26 X3 -0.0002 0.0123 -0.0214 0.9829 Referring to the tables, predict the meter rate per hour if one parks outside of downtown and off campus, 3 blocks from the quad. a. $-0.0139 b. $0.2589 c. $0.2604 d. $0.4981 Q12. If we use the chi-squared method of analysis to test for the differences among 4 proportions, the degrees of freedom are equal to: a. 3 b. 4 c. 5 d. 1 Q13. An economist is interested to see how consumption for an economy (in $ billions) is influenced by gross domestic product ($ billions) and aggregate price (consumer price index). The Microsoft Excel output of this regression is partially reproduced below. SUMMARY OUTPUT Regression Statistics Multiple R 0.991 R Square 0.982 Adjusted R Square 0.976 Standard Error 0.299 Observations 10 ANOVA df SS MS F Signif F Regression 2 33.4163 16.7082 186.325 0.0001 Residual 7 0.6277 0.0897 Total 9 34.0440 Coeff StdError t Stat P-value Intercept -0.0861 0.5674 -0.152 0.8837 GDP 0.7654 0.0574 13.340 0.0001 Price -0.0006 0.0028 -0.219 0.8330 Referring to the tables, one economy in the sample had an aggregate consumption level of $4 billion, a GDP of $6 billion, and an aggregate price level of 200. What is the residual for this data point? a. $4.39 billion b. $0.39 billion c. -$0.39 billion d. -$1.33 billion Q14. Testing for the existence of correlation is equivalent to a. testing for the existence of the slope (β1). b. testing for the existence of the Y-intercept (β0). c. the confidence interval estimate for predicting Y. d. testing for the existence of the slope (β10). Q15. As a business statistics project, a student examined the factors that determine parking meter rates throughout the campus and downtown area. The campus is a group of buildings located in the center of downtown, with an open central quadrangle. Data were collected for the price of parking per hour and the number of blocks to the quadrangle. In addition, two dummy variables were coded to indicate the location of the parking meter (See below). The population regression model hypothesized is Yi = ß0 + ß1x1i + ß2x2i + ß3x3i + ei where Y is the price per hour x1 is a numerical variable = the number of blocks to the quadrangle (Note that if x1 is less than 2, then the meter is on campus; if x1 is less than 3, then the meter is downtown) x2 is a dummy variable = 1 if inside downtown and off campus, 0 otherwise x3 is a dummy variable = 1 if outside downtown and off campus, 0 otherwise The following Excel results are obtained. Regression Statistics Multiple R 9.9659 R Square 0.9331 Adjusted R Square 0.9294 Standard Error 0.0327 Observations 58 ANOVA df SS MS F Signif F Regression 3 0.8094 0.2698 251.1995 1.0964E-31 Residual 54 0.0580 0.0010 Total 57 0.8675 Coeff StdError t Stat P-value Intercept 0.5118 0.0136 37.4675 2.4904 X1 -0.0045 0.0034 -1.3275 0.1898 X2 -0.2392 0.0123 -19.3942 5.3581E-26 X3 -0.0002 0.0123 -0.0214 0.9829 Referring to the tables, if one is already outside of downtown and off campus but decides to park an additional 3 blocks from the quadrangle, the estimated average parking meter rate will: a. decrease by 0.0045. b. decrease by 0.0135. c. decrease by 0.0139. d. decrease by 0.4979. Q16. The following EXCEL output contains the results of a test to determine if the proportions of satisfied guests at two resorts are the same or different. Hypothesized Difference0 Level of Significance 0.05 Group 1 Number of Successes163 Sample Size227 Group 2 Number of Successes154 Sample Size262 Group 1 Proportion0.718061674 Group 2 Proportion0.58778626 Difference in Two Proportions 0.130275414 Average Proportion0.648261759 Test Statistic3.00875353 Two-Tailed Test Lower Critical Value -1.959961082 Upper Critical Value 1.959961082 p-Value 0.002623357 Referring to the data above, if you want to test the claim that "Resort 1 (Group 1) has a higher proportion of satisfied guests compared to Resort 2 (Group 2)," the p-value of the test will be a. 0.00262 b. 0.00262/2 c. 2*(0.00262) d. 1 - (0.00262/2) Q17. A real estate builder wishes to determine how house size (House) is influenced by family income (Income), family size (Size), and education of the head of household (School). House size is measured in hundreds of square feet, income is measured in thousands of dollars, and education is in years. The builder randomly selected 50 families and ran the µltiple regression. Microsoft Excel output is provided below: SUMMARY OUTPUT Regression Statistics Multiple R 0.865 R Square 0.748 Adjusted R Square 0.726 Standard Error 5.195 Observations 50 ANOVA df SS MS F Signif F Regression 3605.7736 901.4434 0.0001 Residual 1214.2264 26.9828 Total 49 4820.0000 Coeff StdError t Stat P-value Intercept -1.6335 5.8078 -0.281 0.7798 Income 0.4485 0.1137 3.9545 0.0003 Size 4.2615 0.8062 5.286 0.0001 School -0.6517 0.4319 -1.509 0.1383 Referring to the tables, one individual in the sample had an annual income of $10,000, a family size of 1, and an education of 8 years. This individual owned a home with an area of 1,000 square feet (House = 10.00). What is the residual (in hundreds of square feet) for this data point? a. 8.10 b. 5.40 c. -5.40 d. -8.10 Q18. A manager of a product sales group believes the number of sales made by an employee (Y) depends on how many years that employee has been with the company (X1) and how he/she scored on a business aptitude test (X2). A random sample of 8 employees provides the following: Employee Y X1 X2 1 100 10 7 2 90 3 10 3 80 8 9 4 70 5 4 5 60 5 8 6 50 7 5 7 40 1 4 8 30 1 1 Referring to the table, for these data, what is the estimated coefficient for the variable representing years an employee has been with the company, b1? a. 0.998 b. 3.103 c. 4.698 d. 21.293 Q19. In testing for differences between the means of 2 independent populations, the null hypothesis is: a. H0: μ1 - μ2 = 2. b. H0: μ1 - μ2 = 0. c. H0: μ1 - μ2 > 0. d. H0: μ1 - μ2 [removed] F Subject visibility 1 1380.24 1380.24 4.26 0.043 Test taker success 1 1325.16 1325.16 4.09 0.050 Interaction 1 3385.80 3385.80 10.45 0.002 Error 36 11,664.00 324.00 Total 39 17,755.20 Referring to the table, in the context of this study, interpret the statement: "Subject visibility and test taker success interact." a. The difference between the mean feedback time for visible and nonvisible subjects depends on the success of the test taker. b. The difference between the mean feedback time for test takers scoring in the top 20% and bottom 20% depends on the visibility of the subject. c. The relationship between feedback time and subject visibility depends on the success of the test taker. d. All of the above are correct interpretations. Q25. A candy bar manufacturer is interested in trying to estimate how sales are influenced by the price of their product. To do this, the company randomly chooses 6 small cities and offers the candy bar at different prices. Using candy bar sales as the dependent variable, the company will conduct a simple linear regression on the data below: City Price ($) Sales River Falls 1.30 100 Hudson 1.60 90 Ellsworth 1.80 90 Prescott 2.00 40 Rock Elm 2.40 38 Stillwater 2.90 32 Referring to the table, what is the coefficient of correlation for these data? a. -0.8854 b. -0.7839 c. 0.7839 d. 0.8854 Q26. A ____________ is a numerical quantity computed from the data of a sample and is used in reaching a decision on whether or not to reject the null hypothesis. a. significance level b. critical value c. test statistic d. parameter Q27. An investment specialist claims that if one holds a portfolio that moves in opposite direction to the market index like the S&P 500, then it is possible to reduce the variability of the portfolio's return. In other words, one can create a portfolio with positive returns but less exposure to risk. A sample of 26 years of S&P 500 index and a portfolio consisting of stocks of private prisons, which are believed to be negatively related to the S&P 500 index, is collected. A regression analysis was performed by regressing the returns of the prison stocks portfolio (Y) on the returns of S&P 500 index (X) to prove that the private prisons stock portfolio is negatively related to the S&P 500 index at a 5% level of significance. The results are given in the following EXCEL output. Coefficients Standard Error T Stat P-value Intercept 4.866004258 0.35743609 13.61363441 8.7932E-13 S&P -0.502513506 0.071597152 -7.01862425 2.94942E-07 Referring to the table, which of the following will be a correct conclusion? a. We cannot reject the null hypothesis and, therefore, conclude that there is sufficient evidence to show that the prison stock portfolio and S&P 500 index are negatively related. b. We can reject the null hypothesis and, therefore, conclude that there is sufficient evidence to show that the prison stock portfolio and S&P 500 index are negatively related. c. We cannot reject the null hypothesis and, therefore, conclude that there is not sufficient evidence to show that the prison stock portfolio and S&P 500 index are negatively related. d. We can reject the null hypothesis and conclude that there is not sufficient evidence to show that the prison stock portfolio and S&P 500 index are negatively related. Q28. One criterion used to evaluate employees in the assembly section of a large factory is the number of defective pieces per 1,000 parts produced. The quality control department wants to find out whether there is a relationship between years of experience and defect rate. Since the job is repetitious, after the initial training period any improvement due to a learning effect might be offset by a loss of motivation. A defect rate is calculated for each worker in a yearly evaluation. The results for 100 workers are given in the table below. Years Since Training Period [removed] 16.919 b. Reject H0 if chi-square > 15.507 c. Reject H0 if chi-square > 11.143 d. Reject H0 if chi-square > 9.488 Q29. A campus researcher wanted to investigate the factors that affect visitor travel time in a complex, multilevel building on campus. Specifically, he wanted to determine whether different building signs (building maps versus wall signage) affect the total amount of time visitors require to reach their destination and whether that time depends on whether the starting location is inside or outside the building. Three subjects were assigned to each of the combinations of signs and starting locations, and travel time in seconds from beginning to destination was recorded. An Excel output of the appropriate analysis is given below: ANOVA Source of Variation SS df MS F P-value F crit Signs 14008.33 14008.33 0.11267 5.317645 Starting Location 12288 2.784395 0.13374 5.317645 Interaction 48 48 0.919506 5.317645 Within 35305.33 4413.167 Total 61649.67 11 Referring to the table, the within (error) degrees of freedom is a. 1 b. 4 c. 8 d. 11 Q30. If we wish to determine whether there is evidence that the proportion of successes is higher in group 1 than in group 2, the appropriate test to use is a. the Z test. b. the chi-squared test. c. the W test. d. the X test. Q31. A researcher randomly sampled 30 graduates of an MBA program and recorded data concerning their starting salaries. Of primary interest to the researcher was the effect of gender on starting salaries. Analysis of the mean salaries of the females and males in the sample is given below. Size Mean Std Dev Females 18 48,266.7 13,577.63 Males 12 55,000 11,741.29 Std Error = 4,764.82 Means Diff = -6,733.3 Z = -1.4528 2-tailed p value = 0.1463 T = -1.4221 2-tailed p value = 0.1574 Referring to the table, the researcher was attempting to show statistically that the female MBA graduates have a significantly lower mean starting salary than the male MBA graduates. From the analysis in the table, the correct test statistic is: a. 4,634.72 b. -1.4221 c. -1.4528 d. -6,733.33 Q32. If we are performing a two-tailed test of whether μ = 100, the probability of detecting a shift of the mean to 105 will be ________ the probability of detecting a shift of the mean to 110. a. less than b. greater than c. equal to d. not comparable to Q33. Parents complain that children read too few storybooks and watch too µch television nowadays. A survey of 1,000 children reveals the following information on average time spent watching TV and average time spent reading storybooks Average time spent reading storybooks Average time spent watching TV Less than 1 hour Between 1 and 2 hours More than 2 hours Less than 2 hours 90 85 130 More than 2 hours 655 32 8 Referring to the table, to test whether there is any relationship between average time spent watching TV and average time spent reading storybooks, the value of the measured test statistic is: a. -12.59 b. 1.61 c. 481.49 d. 1,368.06 Q34. A large national bank charges local companies for using their services. A bank official reported the results of a regression analysis designed to predict the bank's charges (Y) -- measured in dollars per month -- for services rendered to local companies. One independent variable used to predict service charge to a company is the company's sales revenue (X) -- measured in millions of dollars. Data for 21 companies who use the bank's services were used to fit the model: E(Y) = ß0 + ß1X The results of the simple linear regression are provided below. Y = -2,700+20X, syx = 65, two-tailed p value = 0.034 (for testing ß1) Referring to Table 13-1, interpret the p value for testing whether ß1 exceeds 0. a. There is sufficient evidence (at the α = 0.05) to conclude that sales revenue (X) is a useful linear predictor of service charge (Y). b. There is insufficient evidence (at the α = 0.10) to conclude that sales revenue (X) is a useful linear predictor of service charge (Y). c. Sales revenue (X) is a poor predictor of service charge (Y). d. For every $1 million increase in sales revenue, we expect a service charge to increase $0.034. Q35. A study published in the American Journal of Public Health was conducted to determine whether the use of seat belts in motor vehicles depends on ethnic status in San Diego County. A sample of 792 children treated for injuries sustained from motor vehicle accidents was obtained, and each child was classified according to (1) ethnic status (Hispanic or non-Hispanic) and (2) seat belt usage (worn or not worn) during the accident. The number of children in each category is given in the table below. Hispanic Non-Hispanic Seat belts worn 31 148 Seat belts not worn 283 330 Referring to the table, which test would be used to properly analyze the data in this experiment? a. chi-square test for independence in a two-way contingency table. b. chi-square test for equal proportions in a one-way table. c. ANOVA F test for interaction in a 2 x 2 factorial design. d. chi-square test for a 2 x 2 factorial design. Q36. The Journal of Business Venturing reported on the activities of entrepreneurs during the organization creation process. As part of a designed study, a total of 71 entrepreneurs were interviewed and divided into 3 groups: those that were successful in founding a new firm (n1 = 34), those still actively trying to establish a firm (n2 = 21), and those who tried to start a new firm but eventually gave up (n3 = 16). The total number of activities undertaken (e.g., developed a business plan, sought funding, looked for facilities) by each group over a specified time period during organization creation was measured. The objective is to compare the mean number of activities of the 3 groups of entrepreneurs. Because of concerns over necessary assumption of the parametric analysis, it was decided to use a nonparametric analysis. Identify the nonparametric method that would be used to analyze the data. a. Wilcoxon Rank Sums Test b. Wilcoxon Signed Rank Test c. Kruskal-Wallis Rank Test for Differences in Medians d. One-way ANOVA F test Q37. A researcher randomly sampled 30 graduates of an MBA program and recorded data concerning their starting salaries. Of primary interest to the researcher was the effect of gender on starting salaries. Analysis of the mean salaries of the females and males in the sample is given below. Size Mean Std Dev Females 18 48,266.7 13,577.63 Males 12 55,000 11,741.29 Std Error = 4,764.82 Means Diff = -6,733.3 Z = -1.4528 2-tailed p value = 0.1463 T = -1.4221 2-tailed p value = 0.1574 Referring to the table, the researcher was attempting to show statistically that the female MBA graduates have a significantly lower mean starting salary than the male MBA graduates. According to the test run, which of the following is an appropriate alternative hypothesis? a. H1: μfemales > μmales b. H1: μfemales < μmales c. H1: μfemales ≠ μmales d. H1: μfemales = μmales Q38. A computer used by a 24-hour banking service is supposed to randomly assign each transaction to one of 5 memory locations. A check at the end of a day's transactions gave the counts shown in the table to each of the 5 memory locations, along with the number of reported errors. Memory Location: 1 2 3 4 5 Number of Transactions: 82 100 74 92 102 Number of Reported Errors 11 12 6 9 10 The bank manager wanted to test whether the proportion of errors in transactions assigned to each of the 5 memory locations differ. Referring to the table, which test would be used to properly analyze the data in this experiment? a. chi-square test for independence in a two-way contingency table b. chi-square test for equal proportions in a one-way table c. ANOVA F test for main treatment effect d. Z test for the difference in two proportions Q39. If a test of hypothesis has a Type I error probability (α) of 0.01, we mean a. if the null hypothesis is true, we don't reject it 1% of the time. b. if the null hypothesis is true, we reject it 1% of the time. c. if the null hypothesis is false, we don't reject it 1% of the time. d. if the null hypothesis is false, we reject it 1% of the time. Q40. The following EXCEL output contains the results of a test to determine if the proportions of satisfied guests at two resorts are the same or different. Hypothesized Difference 0 Level of Significance 0.05 Group 1 Number of Successes 163 Sample Size 227 Group 2 Number of Successes 154 Sample Size 262 Group 1 Proportion 0.718061674 Group 2 Proportion 0.58778626 Difference in Two Proportions 0.130275414 Average Proportion 0.648261759 Test Statistic 3.00875353 Two-Tailed Test Lower Critical Value -1.959961082 Upper Critical Value 1.959961082 p-Value 0.002623357 Referring to the table, if you want to test the claim that "Resort 1 (Group 1) has a higher proportion of satisfied guests compared to Resort 2 (Group 2)," the p-value of the test will be a. 0.00262 b. 0.00262/2 c. 2*(0.00262) d. 1 - (0.00262/2)