Research Unit Tests

Research Unit Tests

Structured quality checks for academic research papers — analogous to unit tests in software engineering.

Tests range from deterministic checks — does the replication package run? — to judgment calls — is the contribution interesting? Each test specifies what to check, how an agent should reason about it, and what constitutes a pass.

29 tests · Quick Start · GitHub

Difference-in-Differences (3)

TestSeverityClarityScope
DiD: Pre-trends visualization shown and plausibleblockerheuristicpaper
DiD: Placebo/falsification test reportedwarningheuristicpaper
DiD: Staggered adoption uses heterogeneity-robust estimatorblockerheuristicpaper

Regression Discontinuity (3)

TestSeverityClarityScope
RDD: Estimates robust to bandwidth choiceblockerheuristicpaper
RDD: Pre-determined covariates smooth at cutoffblockerheuristicpaper
RDD: No manipulation of running variable (density test)blockerheuristicpaper

Instrumental Variables (3)

TestSeverityClarityScope
IV: Exclusion restriction explicitly arguedblockerjudgmentpaper, proposal
IV: First-stage F-statistic reported and sufficientblockerdeterministicpaper
IV: Reduced form reported alongside IV estimateswarningdeterministicpaper

Synthetic Control (3)

TestSeverityClarityScope
Synth: Donor pool selection justifiedblockerjudgmentpaper
Synth: In-space and/or in-time placebo tests reportedblockerheuristicpaper
Synth: Single-unit design limitations acknowledgedwarningheuristicpaper

Lab & Online Experiments (3)

TestSeverityClarityScope
Experiment: Attrition and differential attrition testedblockerheuristicpaper
Experiment: Baseline covariate balance table reportedblockerdeterministicpaper
Experiment: Power calculation reported or MDE statedwarningheuristicpaper, proposal

Field Experiments (4)

TestSeverityClarityScope
Field experiment: Spillover effects addressedblockerjudgmentpaper
Experiment: Attrition and differential attrition testedblockerheuristicpaper
Experiment: Baseline covariate balance table reportedblockerdeterministicpaper
Experiment: Power calculation reported or MDE statedwarningheuristicpaper, proposal