How Propagate collects, verifies, and labels data. Every claim in this dataset is either directly documented with a primary source or explicitly labeled as analytical inference. There is no middle ground.
Every bill record in Propagate must be verified against at least one primary source before inclusion. "Primary source" means an official government database, official legislature website, or a direct legal analysis citing the official record.
The verified_by field records the specific sources used to confirm each bill record. The verification_date records when the verification was last performed. Contributors must update both fields when correcting or updating a record.
Propagate distinguishes between two categories of claims: documented fact (directly sourced from a primary record) and analytical inference (derived from patterns, analogies, or indirect evidence). Both are valuable — but they must never be conflated.
Directly sourced from a primary record. Can be independently verified by following the cited URL.
Derived from patterns, analogies, or indirect evidence. Cannot be independently verified from a single primary source.
The opposition_salience field is particularly susceptible to inference because formal opposition testimony records are not always publicly accessible. The following rules apply:
inference_flag = falseinference_flag = trueinference_flag = trueEvery bill record includes three verification fields that document the provenance of the data:
verified_byFree-text description of the sources used to verify this record. Should name specific databases and URLs.
verification_dateDate the record was last verified against primary sources. Must be updated whenever the record is modified.
verification_notesOptional extended notes about the verification process, corrections made, or fields that still require verification.
Propagate is a static, open-source dataset. Contributions are made via GitHub Pull Request against the canonical YAML files. This model ensures all changes are peer-reviewed, versioned, and auditable.
Propagate is a research tool, not a comprehensive legislative database. The following limitations are known and should be considered when using this data:
The ADMT family currently tracks 7 bills across 5 states. Many states with ADMT activity are not yet represented. 'No activity' on the Diffusion Map means 'no activity in this dataset,' not 'no activity in that state.'
4 of 7 ADMT bill records have inference_flag=true for opposition_salience. Formal opposition testimony records are not always publicly accessible. Scores should be treated as hypotheses pending primary source verification.
Primary sponsors are verified; cosponsor lists may be incomplete for some bills. The verification_notes field documents which fields still require additional verification.
The genealogy tree represents Propagate's analytical judgment about bill relationships, not an official categorization. 'Direct model' vs. 'parallel development' distinctions involve interpretive judgment.
Propagate is updated via GitHub PR. Bills that change status (e.g., a vetoed bill that is re-introduced) will not be reflected until a contributor submits a PR with the update.