FIND DATA: By Author | Journal | Sites   ANALYZE DATA: Help with R | SPSS | Stata | Excel   WHAT'S NEW? US Politics | Int'l Relations | Law & Courts
   FIND DATA: By Author | Journal | Sites   WHAT'S NEW? US Politics | IR | Law & Courts
If this link is broken, please report as broken. You can also submit updates (will be reviewed).
Dropped Groups Explain Why Logit and OLS Diverge
Insights from the Field
fixed effects
grouped data
logit
OLS
binary outcome
Methodology
Pol. An.
1 R files
1 Stata files
4 text files
1 datasets
Dataverse
Estimating Grouped Data Models With a Binary Dependent Variable and Fixed Effect Via Logit vs OLS: The Impact of Dropped Units was authored by Nathaniel Beck. It was published by Cambridge in Pol. An. in 2020.

🔍 What’s the problem?

When estimating grouped data with a binary dependent variable and fixed effects, linear (OLS) and logit specifications can produce different results. The apparent reason is that logit excludes groups where the outcome is all zeros or all ones, while the linear specification retains all groups.

📊 How each model handles homogeneous groups

  • The linear specification averages slope estimates across all groups, including homogeneous-outcome groups.
  • Homogeneous groups (all zeros or all ones) have slope coefficients of zero by definition.
  • As a result, the linear estimate is a weighted average of zero slopes from homogeneous groups and the nonzero slopes from groups with mixed outcomes.

âś… What to compare for a fair test

  • The correct apples-to-apples comparison between linear and logit estimates restricts analysis to groups that exhibit within-group variation in the dependent variable (i.e., groups with some zeros and some ones).

đź§ľ Reporting recommendations for researchers

  • Report OLS results using all groups and also report OLS results limited to groups where the dependent variable varies.
  • Report logit results alongside the restricted OLS results to show whether differences stem from dropped groups or from functional-form differences.

⚠️ Why this matters

  • Differences between OLS and logit estimates may reflect which groups are included, not only differences in model form.
  • The substantive interpretation of any difference between the full-sample and restricted-sample results depends on assumptions about the excluded homogeneous groups—assumptions that cannot be verified empirically.
data
Find on Google Scholar
Find on JSTOR
Find on CUP
Political Analysis
Podcast host Ryan