Skip to main content

Appendix B — OMOP Domains

The OMOP CDM v5.4 organizes clinical data into domains, each stored in one or more tables. Understanding domains is essential for building correct cohort criteria, analyses, and for mapping extended data types (genomics, imaging) into the standardized model.

Domain Reference

DomainPrimary TableStandard VocabularyDescription
Conditioncondition_occurrenceSNOMED CTDiagnoses, problems, clinical findings
Drugdrug_exposureRxNormDrug prescriptions, dispensings, administrations
MeasurementmeasurementLOINCLab tests, vital signs, clinical assessments with numeric or categorical values
Procedureprocedure_occurrenceSNOMED CT, CPT4, HCPCSMedical procedures performed on patients
ObservationobservationSNOMED CT, LOINCClinical facts, survey responses, social determinants of health
Visitvisit_occurrenceOMOP VisitHealthcare encounters (inpatient, outpatient, ED, telehealth)
Visit Detailvisit_detailOMOP VisitSub-encounters within a visit (e.g. individual department stays)
Devicedevice_exposureSNOMED CTMedical devices used (implants, prosthetics, monitors)
DeathdeathSNOMED CT, ICD10CMCause of death and mortality status
SpecimenspecimenSNOMED CTBiological specimens collected
Notenote---Unstructured clinical notes (text)
Note NLPnote_nlp---NLP-extracted entities from clinical notes

Genomics and the Measurement Domain

Parthenon's Genomics module maps genomic variant data into the OMOP CDM using the Measurement domain. This follows the OHDSI Genomic CDM Working Group conventions:

Genomic Data ElementOMOP MappingDetails
Genetic variant (SNV, indel)measurement rowmeasurement_concept_id maps to a genomic measurement concept; value_as_concept_id encodes the variant allele
Gene namemeasurement.measurement_source_valueHUGO gene symbol (e.g., BRCA1, TP53) stored as source value
Variant classificationmeasurement.value_as_concept_idPathogenic, Likely Pathogenic, VUS, Benign, etc. mapped to OMOP concepts
Zygositymeasurement.qualifier_concept_idHeterozygous, Homozygous, Hemizygous
Allele frequencymeasurement.value_as_numberVariant allele frequency (0.0 -- 1.0)
VCF to OMOP Pipeline

When you upload a VCF file in the Genomics module, Parthenon parses variants and creates corresponding measurement rows linked to the patient via person_id. ClinVar annotations are stored as additional measurement attributes. This allows genomic variants to participate in standard OMOP cohort definitions and analyses alongside clinical data.

Imaging and the Procedure Domain

DICOM imaging studies are mapped into the OMOP CDM using the Procedure domain, following radiology procedure coding conventions:

Imaging Data ElementOMOP MappingDetails
Imaging study (e.g., CT Chest)procedure_occurrence rowprocedure_concept_id maps to the CPT4/SNOMED procedure concept for the imaging modality
Study dateprocedure_dateDate the imaging study was performed
Modality (CT, MR, US, XR)procedure_source_valueDICOM modality code stored as source value
Imaging report findingsnote rowRadiology report text linked to the procedure via visit_occurrence_id
NLP-extracted findingsnote_nlp + condition_occurrenceStructured findings extracted from radiology reports may also create condition rows
DICOM Metadata

Parthenon stores the full DICOM metadata (study UID, series UID, SOP instance UID, pixel data references) in its own imaging tables (imaging_studies, imaging_series, imaging_instances). The procedure_occurrence row links the OMOP clinical record to the imaging metadata via procedure_occurrence_id, enabling cohort definitions that combine imaging criteria with standard clinical criteria.

Standard Vocabularies by Domain

Condition Domain

  • Standard: SNOMED CT (OMOP concept class: Clinical Finding, Disorder)
  • Source: ICD-10-CM, ICD-10, ICD-9-CM, MedDRA, Read codes

Drug Domain

  • Standard: RxNorm (Ingredient, Clinical Drug, Clinical Drug Form)
  • Source: NDC, NDF-RT, GPI, AMIS, Multilex

Measurement Domain

  • Standard: LOINC
  • Source: LOINC (local labs may use LOINC directly), custom lab codes
  • Genomic extension: Genomic variant measurements use LOINC panel codes (e.g., LOINC 81247-9 for master HL7 genetic variant reporting)

Procedure Domain

  • Standard: SNOMED CT, CPT4, HCPCS
  • Source: ICD-10-PCS, ICD-9-CM procedures, local procedure codes
  • Imaging extension: CPT4 radiology codes (70000--79999 range) map imaging studies to procedures

CDM v5.4 Key Tables

Person Table

ColumnTypeDescription
person_idBIGINTUnique person identifier
gender_concept_idINTOMOP gender concept (8507=Male, 8532=Female)
year_of_birthINTBirth year
race_concept_idINTOMOP race concept
ethnicity_concept_idINTOMOP ethnicity concept

Condition Occurrence Table

ColumnTypeDescription
condition_occurrence_idBIGINTRow identifier
person_idBIGINTForeign key to person
condition_concept_idINTStandard SNOMED concept
condition_start_dateDATEOnset date
condition_end_dateDATEResolution date (optional)
condition_type_concept_idINTSource of record (EHR, claims, etc.)
condition_source_valueVARCHAROriginal ICD code
condition_source_concept_idINTConcept mapped from source code

Measurement Table (Including Genomic Variants)

ColumnTypeDescription
measurement_idBIGINTRow identifier
person_idBIGINTForeign key to person
measurement_concept_idINTStandard LOINC concept
measurement_dateDATEDate of measurement
value_as_numberNUMERICNumeric result (lab value, allele frequency)
value_as_concept_idINTCategorical result concept (variant classification)
unit_concept_idINTUnit of measure concept
measurement_source_valueVARCHAROriginal code or gene symbol

Procedure Occurrence Table (Including Imaging)

ColumnTypeDescription
procedure_occurrence_idBIGINTRow identifier
person_idBIGINTForeign key to person
procedure_concept_idINTStandard CPT4/SNOMED concept
procedure_dateDATEDate procedure was performed
procedure_type_concept_idINTSource of record
procedure_source_valueVARCHAROriginal code or DICOM modality

Observation Period

The observation_period table defines the windows during which a person's data is expected to be complete. OMOP-based analyses restrict to events within observation periods to avoid bias from incomplete records.

ColumnDescription
observation_period_start_dateStart of continuous observation
observation_period_end_dateEnd of continuous observation
period_type_concept_idHow the period was defined (insurance enrollment, EHR registration, etc.)
Genomic and Imaging Data Outside Observation Periods

Genomic test results and imaging studies must fall within a patient's observation period to be included in cohort definitions and analyses. If your genomic or imaging data has dates outside the observation period, those records will be excluded from OMOP-standard analyses. Ensure observation periods are updated when integrating new data sources.