Problem Description:
In this Statistical Analysis homework, we explore two different regression analysis scenarios: Multiple Regression and Logistic Regression. The first part aims to understand the predictors of depression levels among low-income women, while the second part investigates whether smoking status predicts health status after controlling for age and BMI. Below are the details and results for each part:
Part 1: Multiple Regression Analysis
- Research Question: What variables predict the level of depression among low-income women?
- Variables:
- CESD: Depression Score
- AGE: Chronological Age
- EDUCATN: Educational Attainment
- INCOME: Family Income Prior Month
- WORKNOW: Current Employment Status
- SF12PHYS: SF-12 Physical Health Component Score
- SF12MENT: SF-12 Mental Health Component Score
- Analysis: Multiple regression analysis (with 'exclude cases PAIRWISE option').
- Results:
Table 1: Regression Model Predicting Depression in Low-Income Women
Predictor Variable | b | Standard Error | Beta | t | p-value |
---|---|---|---|---|---|
Age | 0.01 | 0.05 | 0.01 | 0.19 | 0.85 |
Education | -1.19 | 0.53 | -0.06 | -2.23 | 0.03 |
Income | 0.00 | 0.00 | -0.08 | -2.71 | 0.01 |
Employment Status | -1.45 | 0.66 | -0.06 | -2.19 | 0.03 |
Physical Health | -0.15 | 0.03 | -0.14 | -5.17 | 0.00 |
Mental Health | -0.66 | 0.03 | -0.61 | -22.69 | 0.00 |
- R2 = 0.465
- Adjusted R2 = 0.461
Summary:
In this sample of low-income women, five of the six predictor variables examined were significant predictors of depression levels (F = 114.7, p < 0.001). Education (Beta = -0.06, p = 0.03), income (Beta = -0.08, p = 0.01), employment status (Beta = -0.06, p = 0.03), physical health score (Beta = -0.14, p < 0.01), and mental health score (Beta = -0.61, p < 0.01) were significant predictors of women’s depression levels. Higher education, higher income, being employed, and having better physical and mental health were associated with lower levels of depression. Age was not a significant predictor in this sample. Women's mental health score was the strongest predictor of depression. Slightly less than half of the variance (adjusted R2 = 0.461, p < 0.01) in depression levels was explained by this set of predictor variables.
Part 2: Logistic Regression Analysis
- Research Question: Does smoking status predict health status, after controlling for age and BMI?
- Variables:
- Dependent Variable: HEALTH (0 = fair to poor health, 1 = good to excellent health)
- Independent Variables: SMOKER (current smoker or not), BMI (respondent's BMI), AGE (chronological age)
- Results:
- Cases Included: 933
- Null Model (Block 0): 70.4% correctly classified.
- Full Model (Block 1): 71.8% correctly classified (improved when 3 variables were included).
- Misclassified Cases: 263
- Classification Table:
- Predicted to be in fair/poor health but were observed in good/excellent health: 20 cases
- Predicted to be in good/excellent health but were observed in fair/poor health: 243 cases
- Most common misclassification: Women in poor health classified as good/excellent health.
Interpretation:
The logistic regression analysis aimed to determine if smoking predicted health status, after controlling for age and BMI. The results showed that the odds ratio for smoking was 0.55 (95% CI: [0.40, 0.74]), indicating a 45% reduction in the odds of being in good/excellent health for smokers. This reduction ranged from 60% at worst to 26% at best within the 95% CI, demonstrating statistical significance. Among the three predictor variables, smoking had the weakest effect on health status, with BMI (OR = 0.96) and AGE (OR = 0.92) having smaller effects. Each unit increase in BMI was associated with a 4% reduction in the odds of being in good health, while each year of advancing age resulted in an 8% reduction in the odds of being in good health.