
🔧 How the split-sample procedure works
Researchers send their dataset to an independent third party that randomly creates a training sample and a withheld testing sample. All model building, hypothesis selection, and revisions occur using the training sample, allowing feedback from colleagues, editors, and referees. Once the paper is accepted, the pre-specified analysis is applied to the testing sample, and those testing-sample results are the ones published.
📊 What the simulations show
⚖️ When this approach is most and least appropriate
🔍 How to interpret the method
🛠️ Practical considerations for implementation
Why it matters
This split-sample protocol offers a pragmatic middle ground between exploratory work and strict preanalysis plans: it preserves opportunities for refinement and feedback while producing published results that come from an independent test, improving credibility and—under many realistic conditions—statistical power.

| Using Split Samples to Improve Inference on Causal Effects was authored by Marcel Fachamps and Julien Labonne. It was published by Cambridge in Pol. An. in 2017. |
