Module 1 – SL1
REGRESSION AND STATISTICAL EFFECTS
In this exercise, you’ll have a chance to practice some techniques and procedures related to regression analysis, using data (more or less) from the Darr and Johns (2003) academic politics study used as part of this module’s case.
Demographics
Age, Years in dept, Rank, Sex, salary, level, tch_rating, minority, admin_resp, Science, degree_tier, pub_prestige
Attitudes
Role ambiguity, Role conflict, Task conflict, Relationship conflict, Political perceptions
Respondent perceptions of responsibility in current crisis:
Fault, blame
codebook leads to a full codebook (Module 1 SLP Coding). You can find the data in SPSS format or Excel .
Please consult the Regression Presentation for specific guidance on how to run various tests.
For this exercise, we’ll be using the demographic variables. We’ll use the others later.
1. Open the excel file in SPSS and save it in SPSS format.
2. Prepare appropriate descriptive statistics – remember, means and standard deviations (and possibly histograms) for interval variables, frequency tables for categorical and dummy variables. Are there any things of interest in these descriptive statistics?
3. Construct three scatterplots looking at the relationships among some of the interval variables that interest you. What if anything do you discover this way?
4. Set up a regression model to predict salary (DV) from teaching ratings (IV). Be sure to request appropriate residuals plots. What are your results? What do you learn from the residuals plots? [HINT: here is a good guide on interpreting residuals plots .]
5. Since neither salary nor teaching ratings is particularly normally distributed (i.e., both are skewed), we might do better with the log transforms of these data (if you're not familiar with log transforms of data, here is a good general guide to the subject (Hopkins). Try your regression model with these transformed variables. Any better? How do you know?
6. Now let’s try a multiple predictor model. Use salary as DV, and Age, Years in dept, Sex, level, tch_rating, minority, admin_resp, Science, degree_tier, and pub_prestige as IV’s. How good is your prediction now? What are the best predictors? Anything in the residuals?
7. Your prediction is good, but it’s too complicated. See if you can reduce complexity by using a stepwise procedure on the same model. What is the efficiency of prediction of your final model here? What predictors are left? Which have been excluded? Of those left, which are best? Anything in the residuals?
8. Any overall comments on using regression techniques?
SLP Assignment Expectations
Your assignment will be assessed against the following expectations:
1. Your assignment should be 10-15 pages in total. Make sure you only report the important tables that support your answers to each question.
2. You are expected to show facility with regression analysis procedures and vocabulary, to use terminology correctly, and to correctly and concisely carry out the assignment steps.
3. It is expected that your assignments will be presented in good academic form, including tables formatted in approximate APA style rather than being simply pasted in from SPSS printout, statistical tests summarized in narrative text containing the information necessary, and appropriate analyses and comments included, not just a dump of the results of a test.
Notes:
13 pgs
REGRESSION AND STATISTICAL EFFECTS
Required Reading
Porter, A., Connolly, T., Heikes, R.G., & Park, C.Y. (1981). Misleading indicators: The limitations of multiple linear regression in formulation of policy recommendations . Policy Sciences, 13, 397-418.
Phau, I., & Teah, M. (2009). Devil wears (counterfeit) Prada: A study of antecedents and outcomes of attitudes towards counterfeits of luxury brands. The Journal of Consumer Marketing, 26(1), 15-27.
Evans. A.L. (2008). Portfolio manager ownership and mutual fund performance. Financial Management, 37(3), 513-535.
Banker, Rajiv; Hu, Nan; Pavlou, Paul A.; and Luftman, Jerry. 2011. "CIO Reporting Structure, Strategic Positioning, and Firm Performance," MIS Quarterly, (35: 2) pp.487-504.
Guzman, I. R., & Stanton, J. M. (2009). IT Occupational Culture: The Cultural Fit and Commitment of New Information Technologists. Information Technology & People, 22(2), 157-187.
Optional Reading
Decision 411 (n.d.). Interpreting plots of predicted values and residuals. Duke University. Retrieved from http://www.duke.edu/~rnau/regnotes.htm#plots
Hopkins, W. (2000). log transformation for better fits. A New View of Statistics. Retrieved from http://www.sportsci.org/resource/stats/logtrans.html
Lane, D. (n.d.). Prediction. HyperStat Online. Retrieved from http://davidmlane.com/hyperstat/prediction.html
Trochim, W. (2006). Regression. Research Methods Knowledge Base. Retrieved from http://www.socialresearchmethods.net/kb/genlin.php