FIND DATA: By Author | Journal | Sites   ANALYZE DATA: Help with R | SPSS | Stata | Excel   WHAT'S NEW? US Politics | Int'l Relations | Law & Courts
   FIND DATA: By Author | Journal | Sites   WHAT'S NEW? US Politics | IR | Law & Courts
If this link is broken, please report as broken. You can also submit updates (will be reviewed).
Insights from the Field

How Party Labels Improve Polarization Measurement in UK Commons Speeches (1811โ€“2015)


polarization
text analysis
supervised methods
House of Commons
party affiliation
Methodology
Pol. An.
1 R files
9 other files
Dataverse
Measuring Polarisation With Text Analysis: Evidence from the UK House of Commons, 1811-2015 was authored by Niels D. Goet. It was published by Cambridge in Pol. An. in 2019.

๐Ÿงพ What Was Studied

Political text methods are used to estimate ideology and polarization from speech, but unsupervised approaches often assume that the dominant variation in text equals the quantity of interest. This assumption frequently fails in real speech data. The paper assesses how supervised approaches that incorporate party affiliation perform compared with unsupervised models when measuring polarization in parliamentary speech.

๐Ÿ”Ž How Measurement Was Evaluated

A validation framework is introduced to compare supervised and unsupervised text-scaling methods directly. The framework is applied to a very large historical corpus to test whether including party information yields more meaningful polarization estimates.

๐Ÿ“š Data and Scope

  • 6.2 million records of parliamentary speeches from the UK House of Commons
  • Time period covered: 1811โ€“2015
  • Several adjustments to existing estimation techniques were implemented before comparison

๐Ÿ“ˆ Key Findings

  • Unsupervised methods often fail because the strongest sources of textual variation are unrelated to the target concept (polarization)
  • Supervised approaches that include party affiliation produce more interpretable and meaningful measures of polarization in speech data
  • The validation framework makes it possible to assess when supervised methods are necessary and how much improvement they deliver over unsupervised alternatives

๐Ÿ’ก Why It Matters

  • Demonstrates crucial limits of unsupervised text analysis for speech and provides a practical alternative
  • Offers a reproducible way to evaluate text-scaling choices and to justify the use of party information in polarization measurement
  • Contributes methodologically by outlining the specific challenges of speech-based unsupervised estimation and by proposing concrete adjustments and validation steps for more reliable inference
data
Find on Google Scholar
Find on JSTOR
Find on CUP
Political Analysis
Podcast host Ryan