Aging
Interventions: Calibrated Verdict Index
A continuously updated, evidence-weighted map of aging interventions.
Verdicts follow a locked methodology (methodology.md) and
are triangulated against five calibration anchors
(CALIBRATION_ANCHORS.md).
Last index update: 2026-04-25. Total pages:
42 (38 interventions + 4 interactions).
๐ Paper: Where the
Longevity Discourse is Systematically Wrong โ synthesis paper
covering the methodology + the eight findings the popular discourse gets
wrong + what deserves more attention. HTML
version ยท Machine-readable verdict
table (CSV)
Verdicts use these bands (lowest evidence โ highest):
- Insufficient evidence โ too little data to form a
verdict
- Mostly hype โ popular intervention; tested
rigorously and failed, or never had higher-tier support
- Mixed โ tier-appropriate evidence exists but
replication has failed or population dependence is severe
- Suggestive โ T3-T4 evidence with replication; human
data absent or null/early
- Probable โ T3 evidence with at least suggestive
human data, OR T1 evidence in defined population
- Strong โ T1 / massive cohort + RCT evidence on hard
endpoints; mechanism understood
If you do nothing else from this database, do exercise
(aerobic + resistance), prioritize sleep
(~7h), and do caloric moderation without specific
timing fixation. These represent the bulk of the
population-level longevity benefit.
Strong (in humans)
The interventions with the most overwhelming evidence base. These are
the non-negotiables.
- Aerobic
exercise โ anchor; 20-40% all-cause mortality reduction at
moderate doses across millions of person-years.
- Resistance
training โ 10-20% all-cause mortality + unique sarcopenia /
functional-capacity protection.
- Sleep โ Strong
observational / Probable causal; ~7 hours nightly is the population
optimum.
- Statins (secondary
prevention) โ overwhelming evidence in established
CVD.
Strong (in mice
โ anchor band for animal evidence)
- Caloric
restriction โ anchor for mouse-Strong; Suggestive in humans
(CALERIE-2 biomarker results, no mortality data).
Probable
Solid evidence at the relevant evidence tier and population.
Suggestive
Real biology, replication or human translation incomplete.
Mixed
Tier-appropriate evidence but replication failures or severe
population dependence.
- Metformin
โ ITP null in mice; observational human data confounded; MET-PREVENT
2025 shows exercise blunting.
- Time-restricted
eating โ Liu 2022 NEJM showed null vs matched-calorie
comparator.
- Vitamin D
โ VITAL primary null; modest cancer-mortality signal real; hip-fracture
signal in women.
- Hyperbaric
oxygen โ single-group + COI; replication missing.
- Statins (>75 healthy
primary) โ USPSTF: insufficient evidence; STAREE
pending.
Mostly hype
Popular interventions where the evidence has failed or never reached
higher tiers.
Insufficient evidence
Not enough data to form a verdict, generally because the field is
preclinical-only.
Interactions /
Combinations / Antagonisms
Synergies and important antagonisms between interventions.
How to use this index
- For decision-making: prioritize Strong-band
interventions; treat Probable as supportable; treat Suggestive as worth
tracking but not committing to; treat Mostly hype as supplement-industry
signal rather than evidence.
- For staying current: verdict changes are logged in
the repository commit history. Open issues on the repo for new evidence
or contested verdicts.
- For the methodology behind verdicts: read
methodology.md and CALIBRATION_ANCHORS.md.
Both are locked; revisions require explicit changelog entries.
- For unresolved questions: open a GitHub issue.
What this database is not
- Not personal medical advice. Verdicts are population-level evidence
summaries.
- Not exhaustive. New interventions and verdict updates appear via
commits to the repository.
- Not free of judgment calls โ every verdict is a calibration claim
defensible against the methodology and anchors. Disagreements should be
raised via the question-resolver process, not silent edits.