Skip to main content

Population-Level Estimation

Population-Level Estimation (PLE) enables comparative effectiveness and safety research using causal inference methods applied to observational OMOP CDM data. PLE goes beyond descriptive analyses (characterization, incidence rates) to estimate the causal effect of a treatment on an outcome while accounting for confounding by indication.

Parthenon implements two major causal inference approaches: the New-User Cohort Design (CohortMethod) for between-patient comparisons and Self-Controlled Case Series (SCCS) for within-patient comparisons. Both are available as fully configurable analysis types with design forms, diagnostics, and result visualization.


New-User Cohort Design (CohortMethod)

The CohortMethod approach compares outcomes between a target cohort (treatment) and a comparator cohort (alternative treatment or no treatment) of new users, adjusting for baseline confounders.

Study Design Overview

Target Cohort:      New users of Drug A (with washout)
Comparator Cohort: New users of Drug B (with washout)
Outcome Cohort: Clinical event of interest (e.g., MI, stroke)

Index Outcome?
|<---- Time at Risk ----------->|
| |
Baseline covariates |
(pre-index features) Follow-up end

Creating a CohortMethod Analysis

  1. Navigate to Analyses and select the Estimations tab.
  2. Click New Analysis and select Estimation.
  3. Configure the design:

Cohort Selection:

SettingDescription
Target cohortTreatment cohort (e.g., new users of Drug A)
Comparator cohortAlternative treatment cohort (e.g., new users of Drug B)
Outcome cohortsOne or more outcomes to estimate effects for

Model Settings:

SettingDescriptionDefault
Model typeCox proportional hazards regressioncox
Time at risk startDays after index to begin observation0
Time at risk endDays after index/cohort end to stop observation0 (at cohort end)
End anchorWhether end offset is from cohort start or cohort endcohort end

Propensity Score Settings:

The propensity score (PS) is the estimated probability of receiving the target treatment given baseline covariates. Parthenon supports three PS-based adjustment methods:

MethodHow it worksConfiguration
PS MatchingMatch each target patient to one or more comparator patients with similar PSratio: match ratio (1:1, 1:4); caliper: maximum PS distance for a valid match (default 0.2)
PS StratificationDivide patients into strata (quintiles) by PS and compare within stratastrata: number of strata (default 5)
IPTWWeight each patient by inverse of their PS to create a pseudo-populationtrimming: truncate extreme PS values (default 0.05)
SettingDescriptionDefault
Enable PSWhether to use propensity score adjustmentEnabled
TrimmingRemove patients with extreme PS values (PS < trim or > 1-trim)0.05

Covariate Settings:

Covariates are the baseline features used to build the propensity score model. Parthenon uses large-scale regularized regression (LASSO) over thousands of covariates automatically extracted from the CDM.

SettingDescriptionDefault
DemographicsAge, gender, race, ethnicity, year of birthEnabled
Condition occurrenceAll condition diagnoses in the time windowEnabled
Drug exposureAll drug exposures in the time windowEnabled
Procedure occurrenceAll procedures in the time windowDisabled
MeasurementLab values and vitalsDisabled
ObservationClinical observationsDisabled
Time windowsLookback period for covariates[-365, 0]

Negative Control Outcomes:

Negative controls are outcomes with no plausible causal relationship to the exposure. They are used to calibrate effect estimates and detect residual systematic bias.

  • Add concept IDs for negative control outcomes (e.g., appendicitis, ingrown toenail for a cardiovascular drug study).
  • After execution, the negative control distribution is used to empirically calibrate the true outcome effect estimates.
Negative controls are essential

Including 50--100 negative control outcomes is considered best practice for OHDSI studies. If the negative control effect estimates cluster around HR = 1.0 with appropriate coverage probability, the analysis is well-calibrated. If negative controls show systematic bias (e.g., most HRs < 1.0), confounding adjustment is insufficient.


Self-Controlled Case Series (SCCS)

SCCS uses each patient as their own control, comparing event rates during exposed time windows to unexposed time within the same patient. This design inherently controls for all time-invariant confounders (genetics, chronic conditions, socioeconomic status) because each patient serves as their own baseline.

When to Use SCCS

SCCS is particularly suited for:

  • Vaccine safety studies: Comparing adverse event rates in the days following vaccination vs. baseline periods
  • Acute adverse event detection: Short-term risks after drug initiation
  • Drug-event associations: When the exposure is transient and the outcome is acute
  • Situations with strong unmeasured confounding: Since patients serve as their own controls, time-invariant confounders are eliminated by design

Creating an SCCS Analysis

  1. Navigate to Analyses and select the SCCS tab.
  2. Click New Analysis and select SCCS.
  3. Configure the design:

Cohort Selection:

SettingDescription
Exposure cohortThe drug or vaccine exposure to study
Outcome cohortThe adverse event or clinical event of interest

Risk Windows:

Risk windows define the time periods during which you hypothesize the exposure increases (or decreases) the event rate. Multiple risk windows can be defined:

SettingDescriptionDefault
StartDays after era start (or era end) to begin the risk window1
EndDays after era start (or era end) to end the risk window30
Start anchorera_start or era_endera_start
End anchorera_start or era_endera_start
LabelDescriptive name for this risk window"Risk window 1"

Example risk windows for a vaccine safety study:

  • Window 1: Days 1--7 after vaccination ("acute risk")
  • Window 2: Days 8--28 after vaccination ("sub-acute risk")
  • Window 3: Days 29--42 after vaccination ("extended risk")

Model Settings:

SettingDescriptionDefault
Model typesimple (standard SCCS) or with additional adjustmentssimple

Study Population:

SettingDescriptionDefault
Naive periodDays at the start of observation excluded from analysis180
First outcome onlyWhether to count only the first event per patientEnabled

Effect Estimates and Results

CohortMethod Results

After execution, the estimation results page shows:

MetricDescription
Hazard Ratio (HR)Estimated relative hazard of the outcome in the target vs. comparator cohort
95% Confidence IntervalPrecision of the HR estimate
P-valueStatistical significance (null hypothesis: HR = 1.0)
Target eventsNumber of outcome events in the target cohort
Comparator eventsNumber of outcome events in the comparator cohort
Target personsNumber of patients in the adjusted target cohort
Comparator personsNumber of patients in the adjusted comparator cohort

Interpretation Guide

HRInterpretation
HR = 1.0No difference between target and comparator
HR < 1.0Target treatment is associated with lower risk
HR > 1.0Target treatment is associated with higher risk
HR = 0.75 (0.65--0.86)25% risk reduction, statistically significant
HR = 1.42 (0.98--2.05)42% risk increase, not statistically significant

SCCS Results

SCCS produces an Incidence Rate Ratio (IRR) comparing event rates during risk windows to unexposed time:

IRRInterpretation
IRR = 1.0No change in event rate during risk window
IRR = 3.2 (2.1--4.8)3.2x higher event rate during exposure, significant
IRR = 0.8 (0.5--1.3)No significant change in event rate

Diagnostics

Proper interpretation of PLE results requires checking diagnostic outputs:

Propensity Score Diagnostics (CohortMethod)

  • PS distribution plot: Shows the overlap between target and comparator PS distributions. Good overlap (clinical equipoise) is required for valid estimation.
  • Covariate balance: Table of SMDs before and after PS adjustment. All SMDs should be < 0.1 after adjustment.
  • Preference score: A rescaled PS that accounts for different cohort sizes.

Negative Control Calibration

  • Null distribution: Plot of negative control effect estimates. Should center around 1.0.
  • Calibrated estimates: True outcome estimates adjusted for empirical bias observed in negative controls.
Check diagnostics before interpreting results

An HR that appears statistically significant may be entirely driven by residual confounding if diagnostics show poor covariate balance or biased negative controls. Always review diagnostics before drawing causal conclusions.


Evidence Synthesis

Parthenon includes an Evidence Synthesis analysis type (accessible from the Evidence Synthesis tab on the Analyses page) for combining PLE results across multiple data sources using meta-analytic methods.

Available Methods

MethodDescription
Fixed-effectsAssumes a common true effect across all sources; weights by precision
Random-effectsAllows the true effect to vary across sources; accounts for heterogeneity
BayesianBayesian hierarchical model with configurable chain length and burn-in

Configuration

SettingDescriptionDefault
EstimatesSelect individual PLE results to combine(required)
MethodMeta-analysis methodbayesian
Chain lengthMCMC chain length (Bayesian only)1,100,000
Burn-inMCMC burn-in iterations (Bayesian only)100,000
Sub-sampleMCMC thinning interval (Bayesian only)1,000

Results include a forest plot showing individual source estimates and the combined meta-analytic estimate with confidence/credible intervals.


Best Practices

  1. Use active comparators: Comparing Drug A to Drug B (active comparator) is far more robust than comparing Drug A to "no treatment." Active comparator designs reduce confounding by indication.

  2. Require new-user designs: Both target and comparator cohorts should be incident (new-user) cohorts with adequate washout periods to avoid prevalent-user bias.

  3. Include comprehensive covariates: Enable demographics, conditions, and drugs at minimum. The LASSO regularization handles the dimensionality --- more covariates generally improve confounding adjustment.

  4. Always use negative controls: 50--100 negative control outcomes provide the empirical calibration needed to assess and correct for residual systematic bias.

  5. Check all diagnostics before interpreting: PS overlap, covariate balance, and negative control calibration are not optional quality checks --- they are required for valid causal inference.

  6. Consider SCCS for self-controlled questions: When time-invariant confounding is a major concern and the exposure is transient, SCCS provides stronger causal evidence than CohortMethod.

  7. Synthesize across sources: Single-database results may reflect database-specific biases. Meta-analysis across multiple databases provides more robust evidence.

R runtime connection

PLE analyses are designed to leverage the OHDSI HADES R packages (CohortMethod, SelfControlledCaseSeries, EvidenceSynthesis) running in the R runtime container. The configuration UI is fully functional for specifying analysis designs. Execution against the R runtime is being connected. In the interim, designs can be exported as R-ready configuration objects for manual execution.