Methodology

Methodology

Status: LOCKED. Date locked: 2026-04-24.

This document defines how every intervention page is produced. It is written before any specific intervention is assessed so that future-me cannot move goalposts to fit a preferred conclusion. Changes are logged as commits to this repository with explicit "methodology revision" reasoning, and force re-review of every existing page.


1. Scope

This project synthesizes the evidence for healthspan and lifespan interventions in:

Endpoints we evaluate: median lifespan, maximum lifespan, healthspan markers (frailty, grip strength, gait, cognition), validated biomarkers of aging (epigenetic clocks, GlycanAge, etc.), all-cause mortality where available.

Endpoints we do not evaluate: subjective wellness, single-domain outcomes (e.g., "improves sleep") unless tied to an aging mechanism, marketing claims.

We do not issue supplement or treatment recommendations. The output is a calibration aid for readers who already plan to make their own decisions.


2. Evidence tiers

Every claim is anchored to one of these tiers. Higher tiers dominate lower tiers in conflict.

Tier Description
T1 Human RCT, pre-registered, n ≥ 100, hard endpoint (mortality, validated aging biomarker), independently replicated
T2 Human RCT, pre-registered, single trial OR surrogate endpoint OR n < 100; OR large prospective cohort with strong confounder control
T3 ITP-replicated mouse lifespan result (any sex), or RP2-replicated, or ≥ 2 independent labs with consistent direction in mice
T4 Single-lab mouse lifespan extension, or NHP biomarker improvement, or healthspan-only mouse data
T5 Invertebrate lifespan extension (C. elegans, fly, yeast); mechanistic plausibility only
T0 Anecdote, n-of-1, uncontrolled human observation, biohacker self-report, in vitro cell line only

Cross-species translation discount: results in tier T5 do not transfer to mammals without independent confirmation. T4 → human translation is discounted by 1 tier when applied to human verdict.


3. Verdict bands

Every intervention receives one of:

A verdict band must be defensible against the calibration anchors. If a new intervention I'm rating "Probable" has weaker evidence than rapamycin (the anchor for Probable), I have miscalibrated.


4. Required fields per intervention page

Every page must populate ALL of the following. Missing data is recorded as "no data found" — never silently omitted.

  1. TL;DR verdict (one sentence + verdict band)
  2. What it is (chemical class, dose ranges, route)
  3. Proposed mechanism with confidence level (established / plausible / hypothetical)
  4. Evidence ladder — separate subsections for invertebrate, mouse, NHP, human; each with study count, effect size range, replication status
  5. Sex, strain, dose dependence — required for mouse data; if not specified in the literature, that is itself a flag
  6. Confounds — control diet adequacy, baseline mortality of the strain (short-lived strains inflate apparent gains), publication bias signal
  7. Conflict of interest scan — industry funding, author equity, supplement industry ties
  8. Human translation — what RCTs exist, what they actually measured, what they actually showed
  9. Calibrated verdict — band + 2-3 sentence rationale + comparison to nearest calibration anchor
  10. Confidence interval on verdict — "could plausibly move to X if Y replicates"
  11. Open questions — things that would change the verdict if resolved
  12. Last reviewed — date stamp
  13. Sources — every citation with link if available

5. Conflict-of-interest discounts


6. Replication standards


7. Verdict change protocol

Verdicts can move. The thresholds:

This is the anti-drift mechanism. If I find myself wanting to upgrade NMN because a new podcast made a compelling case, the methodology says no without new evidence at the right tier.


8. When the methodology is silent

Some questions cannot be resolved from these rules alone (e.g., "is pre-print evidence admissible?", "how do we handle a retracted paper that's been re-published?"). Decision protocol:

  1. First pass: triangulate from the published positions of the calibration authorities — Matt Kaeberlein, the ITP team, Cochrane reviewers if relevant. What would they do?
  2. Second pass: open a GitHub issue on this repository for broader review.
  3. Third pass: leave the question open; flag it explicitly on the affected intervention page.

Never silently invent a rule. If a new rule is needed, it goes through the methodology-revision process (section header).


9. What this methodology deliberately excludes


10. Methodology revision log

(none yet — methodology locked 2026-04-24)