Contributing

Contributing to Research Unit Tests

Research unit tests improve through community refinement. Anyone can contribute a new test or improve an existing one.

Read SPEC.md fully before writing a test.
Choose the right directory:
- core/ — proposed for the curated core set (reviewed by maintainers)
- community/{your-github-username}/ — your own tests, merged with minimal review
Create a file named {id}.md where id follows the convention {methodology}-{what-is-checked}.
Fill in all required frontmatter fields and body sections.
Run python scripts/validate.py and fix all errors.
Update registry.yaml with your test’s entry (or the CI will reject the PR).
Open a pull request. Title: add: {test-id}.

Econometrics and statistics papers are a source of new unit tests. The workflow:

Identify a paper that introduces a new diagnostic, estimator, or check (e.g., a new test for pre-trends, a new validity condition for IV).
Read the paper and identify the specific check it recommends.
Write a test file that operationalizes that check for agents.
Cite the paper in ## References.

Open an issue titled paper: {citation} to flag papers that need to be distilled. Others can claim them.

A test is accepted if it is:

Actionable: an agent following the “How to Check” instructions would reach the same verdict as a careful human reviewer ≥80% of the time.
Scoped: the test checks one thing, not many.
Referenced: claims about best practices are backed by citations.
Calibrated: the severity level matches the actual cost of failure.

Tests in core/ face higher scrutiny than community/. Core tests are intended to reflect near-consensus best practices.