Day 15: ADaM Architecture & Admiral Core Engine

Week 3, Day 15: Understanding ADaM Structures, Admiral Philosophy, and Core Derivation Patterns

1 Welcome to Week 3: ADaM Deep Dive & Admiral Mastery

Week 3 Overview: After mastering SDTM creation and validation in Weeks 1-2, we now transition to ADaM (Analysis Data Model) - the analysis-ready datasets that feed directly into regulatory Tables, Listings, and Figures (TLFs).

This week is intensive - we’ll use the {admiral} package to build every major ADaM dataset type. Admiral was co-developed by multiple pharmaceutical companies as a tidyverse-based, modular, non-black-box toolbox for ADaM creation, and is now maintained by 50+ pharma company programmers across the pharmaverse community.

Day 15 is foundational - today we learn the architecture before building datasets tomorrow.

2 1 Learning Objectives

By the end of Day 15, you will be able to:

Understand the three ADaM data structures: ADSL (subject-level), BDS (Basic Data Structure), and OCCDS (Occurrence Data Structure)
Explain Admiral’s design philosophy: Modular, transparent, tidyverse-native, and cross-company collaborative
Master Admiral’s four core derivation function categories: derive_vars_*, derive_var_*, derive_param_*, and derive_extreme_*
Map each of 8 key ADaM datasets to their source SDTM domain, ADaM structure type, and key Admiral functions
Navigate Admiral’s function taxonomy to quickly find the right function for any derivation task
Understand Admiral’s input-output contract and how functions compose sequentially

Clinical Context: Today is conceptual but critical. Understanding ADaM architecture prevents hours of debugging later. Knowing which Admiral functions to use for which patterns makes you 10x more efficient in Days 16-30.

3 2 What is ADaM? (Deep Dive)

3.1 ADaM in the Regulatory Workflow

Clinical Trial → EDC Data → SDTM (Tabulation) → ADaM (Analysis) → TLFs → Regulatory Submission
                                                      ↑
                                              Admiral creates this

ADaM (Analysis Data Model) is the CDISC standard for analysis-ready datasets used in regulatory submissions to FDA, EMA, and other health authorities.

3.2 Why ADaM Exists

SDTM limitations for analysis: - One record per observation (not per analysis unit) - No derived variables (change from baseline, treatment-emergent flags) - No baseline or analysis visit structure - Not optimized for statistical procedures

ADaM solves this by: - Structuring data for analysis (one row per subject/parameter/timepoint) - Adding derived variables (flags, categories, calculated values) - Ensuring traceability back to SDTM - Enabling direct input to SAS PROC or R statistical functions

3.3 ADaM Fundamental Principles (CDISC ADaM IG)

Analysis-ready: Data can be directly input to statistical procedures
Self-contained: All variables needed for analysis are in one dataset
Traceable: Every value traces back to SDTM source
One record per analysis unit: Structure depends on analysis type
Standardized naming: Consistent variable names across studies

Clinical Example:

In SDTM (LB - Laboratory):

USUBJID  LBTESTCD  LBSTRESN  LBDTC        VISIT
001      ALT       45        2024-01-15   Baseline
001      ALT       52        2024-02-15   Week 4
001      ALT       48        2024-03-15   Week 8

In ADaM (ADLB - BDS structure):

USUBJID  PARAMCD  AVISIT    AVAL  ABLFL  BASE  CHG  ANRIND
001      ALT      Baseline  45    Y      45    NA   NORMAL
001      ALT      Week 4    52    N      45    7    NORMAL
001      ALT      Week 8    48    N      45    3    NORMAL

Analysis benefit: - Baseline flag (ABLFL) clearly identifies reference measurement - Change from baseline (CHG) pre-calculated - Reference range indicator (ANRIND) ready for shift tables - Structure enables lm(CHG ~ AVISIT + BASE, data = adlb_filtered)

4 3 The Three ADaM Data Structures

CDISC defines three primary ADaM structures. Understanding which structure to use is the first decision in any ADaM creation.

4.1 3.1 ADSL: Subject-Level Analysis Dataset

Structure: One record per subject

Purpose: - Demographics and baseline characteristics - Treatment assignment (planned and actual) - Population flags (Safety, ITT, Per-Protocol) - Study dates and disposition - Foundation for all other ADaM datasets

Key Variables: - Identifiers: USUBJID, SUBJID, SITEID - Demographics: AGE, SEX, RACE, ETHNIC - Treatment: TRT01P, TRT01A, TRTSDT, TRTEDT - Flags: SAFFL, ITTFL, COMPLFL - Disposition: EOSSTT, DCSREAS

SDTM Sources: - Primary: DM (Demographics) - Secondary: EX (Exposure for treatment dates), DS (Disposition), VS/LB (baseline values)

Clinical Use: - Demographic tables (Table 14.1.1 in submissions) - Population definitions for all analyses - Subgroup definitions (age groups, sex, baseline severity)

Example Record:

USUBJID: "01-701-1015"
AGE: 75, SEX: "F", RACE: "WHITE"
TRT01P: "Xanomeline High Dose", TRT01A: "Xanomeline High Dose"
TRTSDT: 2024-01-20, TRTEDT: 2024-07-15, TRTDURD: 178
SAFFL: "Y", ITTFL: "Y", COMPLFL: "Y"
EOSSTT: "COMPLETED"

4.2 3.2 BDS: Basic Data Structure

Structure: One record per subject per parameter per analysis timepoint

Purpose: - Repeated measures over time (labs, vital signs, efficacy scales) - Baseline identification and change from baseline - Reference range and shift analysis

Key Variables: - PARAMCD/PARAM: Parameter identifier (e.g., “ALT”, “DIABP”) - AVAL/AVALC: Analysis value (numeric/character) - AVISIT/AVISITN: Analysis visit - ABLFL: Baseline flag (Y for baseline record) - BASE: Baseline value (carried forward to all post-baseline) - CHG: Change from baseline (AVAL - BASE) - ANRIND: Analysis reference range indicator (LOW/NORMAL/HIGH)

SDTM Sources: - LB → ADLB (Lab results) - VS → ADVS (Vital signs) - QS → ADQS (Questionnaires) - EG → ADEG (ECG results)

BDS Formula:

Number of Records = N subjects × N parameters × N timepoints

Example: ADLB

USUBJID  PARAMCD  AVISIT    AVAL  ABLFL  BASE  CHG   PCHG   ANRIND
001      ALT      Baseline  45    Y      45    NA    NA     NORMAL
001      ALT      Week 4    52    N      45    7     15.6   NORMAL
001      ALT      Week 8    48    N      45    3     6.7    NORMAL
001      ALKPH    Baseline  78    Y      78    NA    NA     NORMAL
001      ALKPH    Week 4    82    N      78    4     5.1    NORMAL

Clinical Use: - Change from baseline analysis - Shift tables (baseline NORMAL → Week 4 HIGH) - Time-course plots - Repeated measures ANCOVA

4.3 3.3 OCCDS: Occurrence Data Structure

Structure: One record per subject per event occurrence

Purpose: - Adverse events - Medications - Medical history events - Any event that can occur 0, 1, or multiple times per subject

Key Variables: - ASTDT/AENDT: Analysis start/end dates - ADURN: Analysis duration - TRTEMFL: Treatment-emergent flag - AOCCFL: First occurrence flag - ANL01FL: Primary analysis flag - Event-specific variables (e.g., severity, causality for AE)

SDTM Sources: - AE → ADAE (Adverse events) - CM → ADCM (Concomitant medications) - MH → ADMH (Medical history)

OCCDS Formula:

Number of Records = Variable (0 to N events per subject)

Example: ADAE

USUBJID  AESEQ  AETERM      ASTDT       AENDT       TRTEMFL  AOCCFL  ASEV
001      1      Headache    2024-01-22  2024-01-23  Y        Y       MILD
001      2      Nausea      2024-02-10  2024-02-12  Y        N       MODERATE
001      3      Dizziness   2024-03-05  2024-03-05  Y        N       MILD
002      1      Headache    2024-01-18  2024-01-19  N        N       MILD

Clinical Use: - Adverse event frequency tables - Time-to-first-event analysis - Treatment-emergent vs pre-existing events - Causality and severity analysis

5 4 Admiral Philosophy & Design Principles

5.1 4.1 Why Admiral Was Created

Pre-Admiral Era (2010-2020): - Every pharma company had proprietary SAS macro libraries - Black-box functions (couldn’t see or modify code) - Duplicated effort (50 companies solving the same problems) - Difficult to validate (macros hidden in compiled code) - Knowledge siloed within companies

Admiral Vision (2020-present): - Open-source: Code is fully transparent and inspectable - Cross-company collaboration: Multiple pharmaceutical companies co-develop - Tidyverse-native: Leverages dplyr, tidyr, lubridate - familiar to R users - Modular: Small, composable functions instead of monolithic macros - Non-black-box: Every derivation is explicit in your script - Validated: Extensive test coverage and regulatory use

5.2 4.2 Admiral Core Design Principles

5.2.1 Principle 1: Modularity

Bad (monolithic):

create_adsl(dm, ex, ds, ae, lb)  # What does this do? Can't see inside!

Good (modular Admiral):

adsl <- dm %>%
  derive_vars_merged(dataset_add = ex, ...) %>%  # Clear: merging EX
  derive_var_trtdurd() %>%                       # Clear: deriving duration
  derive_var_merged_exist_flag(dataset_add = ex, new_var = SAFFL, ...)

Every step is visible and modifiable.

5.2.2 Principle 2: Tidyverse Integration

Admiral functions work seamlessly with %>% pipes and dplyr verbs:

adsl <- dm %>%
  filter(ARMCD != "SCRNFAIL") %>%          # dplyr
  derive_vars_merged(ex, ...) %>%          # admiral
  mutate(AGEGR1 = case_when(...)) %>%      # dplyr
  derive_var_trtdurd()                     # admiral

5.2.3 Principle 3: Transparency

Every Admiral function: - Returns a modified data frame (never a hidden state) - Has explicit input and output - Can be inspected mid-pipeline with glimpse() or View()

adsl_step1 <- dm %>% derive_vars_merged(ex, ...)
glimpse(adsl_step1)  # Inspect intermediate result
adsl_step2 <- adsl_step1 %>% derive_var_trtdurd()

5.2.4 Principle 4: Reusability

Admiral functions are parameterized - same function, different arguments:

# Derive first dose date
derive_vars_merged(dataset_add = ex, mode = "first", ...)

# Derive last dose date (same function, different mode)
derive_vars_merged(dataset_add = ex, mode = "last", ...)

5.2.5 Principle 5: Traceability

Admiral encourages inline documentation:

adsl <- dm %>%
  # Treatment start date: first dose from EX
  derive_vars_merged(
    dataset_add = ex,
    by_vars = exprs(STUDYID, USUBJID),
    new_vars = exprs(TRTSDTM = EXSTDTM),
    mode = "first",
    filter_add = EXDOSE > 0
  )

5.3 4.3 Admiral vs Traditional SAS Macros

Aspect	SAS Macros (Legacy)	Admiral (Modern)
Source code	Hidden in compiled `.sas7bcat`	Fully open on GitHub
Collaboration	Company-specific	Cross-company
Debugging	Difficult (macro expansion errors)	Easy (standard R debugging)
Testing	Manual	Automated with `testthat`
Community	Internal only	1000+ GitHub stars, Slack
Learning curve	Steep (macro language)	Moderate (tidyverse knowledge)
Validation	Company validates internally	Shared validation evidence

6 5 Admiral’s Four Core Function Categories

Admiral has 170+ functions, but they fall into 4 main categories. Learning these categories is like learning grammar - it lets you construct any derivation.

6.1 5.1 Category 1: `derive_vars_*` - Add Multiple Variables

Pattern: derive_vars_*(dataset, dataset_add, by_vars, new_vars, ...)

Purpose: Add multiple variables to a dataset by merging from another dataset

Key Functions:

6.1.1 `derive_vars_merged()`

Merge variables from another dataset (most versatile function in Admiral)

# Pseudo-code example
adsl <- dm %>%
  derive_vars_merged(
    dataset_add = ex,
    by_vars = exprs(USUBJID),
    new_vars = exprs(TRTSDTM = EXSTDTM),
    mode = "first",       # or "last"
    order = exprs(EXSTDTM)
  )

Use cases: - Treatment dates from EX - Disposition reasons from DS - Baseline values from LB/VS

6.1.2 `derive_vars_dt()` and `derive_vars_dtm()`

Convert character dates to Date or POSIXct

adsl <- adsl %>%
  derive_vars_dt(dtc = RFSTDTC, new_vars_prefix = "RFST")
  # Creates RFSTDT

Use cases: - All date conversions in ADSL, ADAE, ADLB, etc. - Handles partial dates with imputation

6.1.3 `derive_vars_merged_lookup()`

Add variables from a lookup table (e.g., parameter metadata)

adlb <- lb %>%
  derive_vars_merged_lookup(
    dataset_add = param_lookup,
    by_vars = exprs(PARAMCD),
    new_vars = exprs(PARAM, PARAMN)
  )

Use cases: - Parameter labels in BDS datasets - Decode values from reference tables

6.2 5.2 Category 2: `derive_var_*` - Add One Variable

Pattern: derive_var_*(dataset, ...)

Purpose: Add a single derived variable to a dataset

Key Functions:

6.2.1 `derive_var_trtdurd()`

Treatment duration (days)

adsl <- adsl %>%
  derive_var_trtdurd()
  # Creates TRTDURD = TRTEDT - TRTSDT + 1

6.2.2 `derive_var_merged_exist_flag()`

Flag if a condition exists in another dataset

adsl <- adsl %>%
  derive_var_merged_exist_flag(
    dataset_add = ex,
    by_vars = exprs(USUBJID),
    new_var = SAFFL,
    condition = EXDOSE > 0
  )
  # SAFFL = "Y" if subject has any EX record with dose > 0

Use cases: - Population flags (SAFFL, ITTFL) - Event flags (had SAE, had dose interruption)

6.2.3 `derive_var_extreme_flag()`

Flag first/last/min/max record within a group

adlb <- adlb %>%
  derive_var_extreme_flag(
    by_vars = exprs(USUBJID, PARAMCD),
    order = exprs(ADT),
    new_var = ABLFL,
    mode = "last",
    filter = ADT < TRTSDT
  )
  # Flags last pre-treatment lab as baseline

Use cases: - Baseline flags (ABLFL) - First occurrence flags (AOCCFL)

6.3 5.3 Category 3: `derive_param_*` - Add Computed Parameters

Pattern: derive_param_*(dataset, ...)

Purpose: Add new rows with computed parameters (BDS-specific)

Key Functions:

6.3.1 `derive_param_computed()`

Create a parameter computed from other parameters

advs <- advs %>%
  derive_param_computed(
    by_vars = exprs(USUBJID, AVISIT),
    parameters = c("SYSBP", "DIABP"),
    set_values_to = exprs(
      PARAMCD = "MAP",
      PARAM = "Mean Arterial Pressure",
      AVAL = (SYSBP + 2 * DIABP) / 3
    )
  )
  # Adds new rows with PARAMCD = "MAP"

Use cases: - Derived parameters (MAP from BP, BMI from height/weight) - Ratios (LDL/HDL)

6.3.2 `derive_param_exist_flag()`

Create a parameter indicating if an event occurred

adlb <- adlb %>%
  derive_param_exist_flag(
    condition = ANRIND == "HIGH",
    set_values_to = exprs(
      PARAMCD = "ANYHIGH",
      PARAM = "Any High Lab Value"
    )
  )
  # Adds new rows flagging subjects with any high lab

6.4 5.4 Category 4: `derive_extreme_records()` and Utilities

Pattern: derive_extreme_records(dataset, dataset_add, ...)

Purpose: Compute records (e.g., LOCF, average across visits)

Key Functions:

6.4.1 `derive_extreme_records()`

Create records by selecting extremes (first, last, min, max)

adlb_locf <- adlb %>%
  derive_extreme_records(
    dataset_add = adlb,
    by_vars = exprs(USUBJID, PARAMCD, AVISITN),
    order = exprs(ADT),
    mode = "last",
    filter_add = !is.na(AVAL)
  )
  # LOCF imputation: carry forward last observed value

Use cases: - LOCF/WOCF imputation - Summary records (average of multiple readings)

6.4.2 Higher-Order Functions (HOFs)

call_derivation() Apply the same derivation with different parameters

adsl <- call_derivation(
  dataset = adsl,
  derivation = derive_var_extreme_flag,
  variable_params = list(
    params(new_var = AOCCFL, filter = !is.na(ASTDT)),
    params(new_var = AOCCSFL, filter = AESER == "Y")
  ),
  by_vars = exprs(USUBJID)
)
# Derives two flags with one call

restrict_derivation() Apply derivation only to subset

adae <- restrict_derivation(
  dataset = adae,
  derivation = derive_var_extreme_flag,
  args = params(...),
  filter = TRTEMFL == "Y"
)
# Only applies to treatment-emergent AEs

slice_derivation() Apply different derivations to different slices

adlb <- slice_derivation(
  dataset = adlb,
  derivation = derive_var_base,
  args = params(by_vars = exprs(USUBJID, PARAMCD)),
  slices = list(
    slice(filter = PARAMCD %in% c("ALT", "AST"), args = params(source_var = AVAL)),
    slice(filter = PARAMCD == "BILI", args = params(source_var = AVALC))
  )
)

7 6 Mapping ADaM Datasets to Admiral Patterns

This table is your rosetta stone for ADaM creation. Memorize this mapping.

ADaM Dataset	SDTM Source(s)	Structure	Key Admiral Functions	Primary Use
ADSL	DM, EX, DS, AE, VS, LB	Subject-level (1 row/subject)	`derive_vars_merged()`, `derive_var_merged_exist_flag()`, `derive_var_trtdurd()`	Demographics, population flags, treatment dates
ADAE	AE, ADSL	OCCDS (1 row/event)	`derive_vars_merged()` (ADSL), `derive_var_extreme_flag()` (TRTEMFL, AOCCFL), `derive_vars_dt()`	Safety tables, AE listings, time-to-first-AE
ADLB	LB, ADSL	BDS (1 row/subject/param/visit)	`derive_var_extreme_flag()` (ABLFL), `derive_var_base()`, `derive_var_chg()`, `derive_var_pchg()`	Lab result tables, shift tables, change analysis
ADVS	VS, ADSL	BDS	`derive_var_extreme_flag()` (ABLFL), `derive_var_base()`, `derive_var_chg()`, `derive_vars_joined()` (visit mapping)	Vital signs tables, plots
ADTTE	DM, DS, AE, ADSL	BDS-like (1 row/subject/endpoint)	`event_source()`, `censor_source()`, `derive_param_tte()`	Kaplan-Meier, Cox regression, time-to-event
ADCM	CM, ADSL	OCCDS	`derive_vars_merged()` (ADSL), `derive_vars_dt()`, period flags (CMPREFL, CMCONFL)	Concomitant meds tables
ADEX	EX, ADSL	BDS (1 row/subject/param/dose)	`derive_vars_merged()` (ADSL), `derive_param_exposure()` (cumulative dose, intensity)	Dose intensity, compliance
ADRS	RS, TR, TU, ADSL	BDS (oncology-specific)	`admiral.onco` package: `derive_param_response()`, `derive_param_confirmed_resp()`	Tumor response, RECIST criteria

8 7 Hands-On: Exploring Admiral Function Taxonomy

Let’s load Admiral and explore its structure.

# Load packages
library(admiral)
library(dplyr)
library(pharmaversesdtm)  # For example SDTM data
library(tibble)

# Check Admiral version
packageVersion("admiral")

[1] '1.4.0'

8.1 7.1 Listing All Admiral Functions

# Get all Admiral functions
admiral_functions <- ls("package:admiral")

# Filter by category
cat("derive_vars_* functions (add multiple variables):\n")

derive_vars_* functions (add multiple variables):

admiral_functions[grepl("^derive_vars_", admiral_functions)] %>% head(10)

 [1] "derive_vars_aage"      "derive_vars_atc"       "derive_vars_cat"      
 [4] "derive_vars_computed"  "derive_vars_crit_flag" "derive_vars_dt"       
 [7] "derive_vars_dtm"       "derive_vars_dtm_to_dt" "derive_vars_dtm_to_tm"
[10] "derive_vars_duration"

cat("\nderive_var_* functions (add one variable):\n")


derive_var_* functions (add one variable):

admiral_functions[grepl("^derive_var_[^s]", admiral_functions)] %>% head(10)

 [1] "derive_var_age_years"      "derive_var_analysis_ratio"
 [3] "derive_var_anrind"         "derive_var_atoxgr"        
 [5] "derive_var_atoxgr_dir"     "derive_var_base"          
 [7] "derive_var_chg"            "derive_var_dthcaus"       
 [9] "derive_var_extreme_dt"     "derive_var_extreme_dtm"

cat("\nderive_param_* functions (add computed parameters):\n")


derive_param_* functions (add computed parameters):

admiral_functions[grepl("^derive_param_", admiral_functions)] %>% head(10)

 [1] "derive_param_bmi"            "derive_param_bsa"           
 [3] "derive_param_computed"       "derive_param_doseint"       
 [5] "derive_param_exist_flag"     "derive_param_exposure"      
 [7] "derive_param_extreme_record" "derive_param_framingham"    
 [9] "derive_param_map"            "derive_param_qtc"

cat("\nderive_extreme_* functions:\n")


derive_extreme_* functions:

admiral_functions[grepl("^derive_extreme_", admiral_functions)]

[1] "derive_extreme_event"   "derive_extreme_records"

cat("\nHigher-order functions (HOFs):\n")


Higher-order functions (HOFs):

admiral_functions[admiral_functions %in% c("call_derivation", "restrict_derivation", "slice_derivation")]

[1] "call_derivation"     "restrict_derivation" "slice_derivation"

8.2 7.2 Examining Function Arguments

# Example: What arguments does derive_vars_merged() take?
cat("derive_vars_merged() arguments:\n")

derive_vars_merged() arguments:

cat(paste(names(formals(derive_vars_merged)), collapse = ", "))

dataset, dataset_add, by_vars, order, new_vars, filter_add, mode, exist_flag, true_value, false_value, missing_values, check_type, duplicate_msg, relationship

cat("\n\nderive_var_extreme_flag() arguments:\n")



derive_var_extreme_flag() arguments:

cat(paste(names(formals(derive_var_extreme_flag)), collapse = ", "))

dataset, by_vars, order, new_var, mode, true_value, false_value, flag_all, check_type

8.3 7.3 Finding the Right Function

Scenario: “I need to flag the first adverse event per subject.”

Thought process: 1. First occurrence → need a flag → derive_var_* 2. First/extreme → derive_var_extreme_flag() 3. Check help: ?derive_var_extreme_flag

# Search Admiral help
?derive_var_extreme_flag

# Or browse all Admiral vignettes
browseVignettes("admiral")

9 8 Deliverable: ADaM Dataset Mapping Matrix

Your deliverable today is a mapping matrix documenting the architecture of your future ADaM package.

# Create mapping matrix
adam_mapping <- tribble(
  ~ADAM_Dataset, ~SDTM_Sources, ~Structure_Type, ~Admiral_Function_1, ~Admiral_Function_2, ~Admiral_Function_3, ~Clinical_Purpose,

  "ADSL", 
  "DM (primary), EX, DS, AE, VS, LB", 
  "Subject-level", 
  "derive_vars_merged()", 
  "derive_var_merged_exist_flag()", 
  "derive_var_trtdurd()",
  "Demographics tables, population flags, treatment assignments",

  "ADAE",
  "AE (primary), ADSL",
  "OCCDS",
  "derive_vars_merged() [ADSL]",
  "derive_var_extreme_flag() [TRTEMFL]",
  "derive_vars_dt() [dates]",
  "Adverse event frequency tables, AE listings, safety summaries",

  "ADLB",
  "LB (primary), ADSL",
  "BDS",
  "derive_var_extreme_flag() [ABLFL]",
  "derive_var_base()",
  "derive_var_chg()",
  "Lab result tables, shift tables, change from baseline analysis",

  "ADVS",
  "VS (primary), ADSL",
  "BDS",
  "derive_var_extreme_flag() [ABLFL]",
  "derive_var_base()",
  "derive_vars_joined() [visit mapping]",
  "Vital signs tables, BP plots, vital sign changes",

  "ADTTE",
  "DS, AE, ADSL",
  "BDS-like (time-to-event)",
  "event_source()",
  "censor_source()",
  "derive_param_tte()",
  "Kaplan-Meier curves, Cox regression, survival analysis",

  "ADCM",
  "CM (primary), ADSL",
  "OCCDS",
  "derive_vars_merged() [ADSL]",
  "derive_vars_dt() [dates]",
  "Period flags (CMPREFL, CMCONFL)",
  "Concomitant medications tables, prior/concomitant distinction",

  "ADEX",
  "EX (primary), ADSL",
  "BDS (exposure metrics)",
  "derive_vars_merged() [ADSL]",
  "derive_param_exposure() [cumulative dose]",
  "derive_var_duration()",
  "Dose intensity tables, treatment compliance, exposure summaries",

  "ADRS",
  "RS, TR, TU, ADSL",
  "BDS (oncology)",
  "admiral.onco::derive_param_response()",
  "admiral.onco::derive_param_confirmed_resp()",
  "admiral.onco::derive_param_clinbenefit()",
  "Tumor response tables, RECIST assessments, best overall response"
)

# Display mapping
adam_mapping %>%
  select(ADAM_Dataset, SDTM_Sources, Structure_Type, Clinical_Purpose) %>%
  knitr::kable(caption = "ADaM Dataset Architecture Overview")

ADaM Dataset Architecture Overview
ADAM_Dataset	SDTM_Sources	Structure_Type	Clinical_Purpose
ADSL	DM (primary), EX, DS, AE, VS, LB	Subject-level	Demographics tables, population flags, treatment assignments
ADAE	AE (primary), ADSL	OCCDS	Adverse event frequency tables, AE listings, safety summaries
ADLB	LB (primary), ADSL	BDS	Lab result tables, shift tables, change from baseline analysis
ADVS	VS (primary), ADSL	BDS	Vital signs tables, BP plots, vital sign changes
ADTTE	DS, AE, ADSL	BDS-like (time-to-event)	Kaplan-Meier curves, Cox regression, survival analysis
ADCM	CM (primary), ADSL	OCCDS	Concomitant medications tables, prior/concomitant distinction
ADEX	EX (primary), ADSL	BDS (exposure metrics)	Dose intensity tables, treatment compliance, exposure summaries
ADRS	RS, TR, TU, ADSL	BDS (oncology)	Tumor response tables, RECIST assessments, best overall response

# Display function mappings
adam_mapping %>%
  select(ADAM_Dataset, Admiral_Function_1, Admiral_Function_2, Admiral_Function_3) %>%
  knitr::kable(caption = "Key Admiral Functions by ADaM Dataset")

Key Admiral Functions by ADaM Dataset
ADAM_Dataset	Admiral_Function_1	Admiral_Function_2	Admiral_Function_3
ADSL	derive_vars_merged()	derive_var_merged_exist_flag()	derive_var_trtdurd()
ADAE	derive_vars_merged() [ADSL]	derive_var_extreme_flag() [TRTEMFL]	derive_vars_dt() [dates]
ADLB	derive_var_extreme_flag() [ABLFL]	derive_var_base()	derive_var_chg()
ADVS	derive_var_extreme_flag() [ABLFL]	derive_var_base()	derive_vars_joined() [visit mapping]
ADTTE	event_source()	censor_source()	derive_param_tte()
ADCM	derive_vars_merged() [ADSL]	derive_vars_dt() [dates]	Period flags (CMPREFL, CMCONFL)
ADEX	derive_vars_merged() [ADSL]	derive_param_exposure() [cumulative dose]	derive_var_duration()
ADRS	admiral.onco::derive_param_response()	admiral.onco::derive_param_confirmed_resp()	admiral.onco::derive_param_clinbenefit()

9.1 Save Mapping for Reference

# Save as CSV for future reference
write.csv(adam_mapping, "adam_dataset_mapping.csv", row.names = FALSE)

cat("✓ Mapping matrix saved to adam_dataset_mapping.csv\n")

✓ Mapping matrix saved to adam_dataset_mapping.csv

cat("\nThis matrix will guide your ADaM builds in Days 16-30!\n")


This matrix will guide your ADaM builds in Days 16-30!

10 9 Admiral Input-Output Contract

Understanding Admiral’s input-output contract prevents common errors.

10.1 Rule 1: Functions Return Modified Data Frames

# Input: data frame
# Output: modified data frame (never modifies in place)

adsl_new <- adsl_old %>%
  derive_var_trtdurd()

# adsl_old is unchanged
# adsl_new has TRTDURD column added

10.2 Rule 2: Functions Compose with Pipes

adsl <- dm %>%
  derive_vars_dt(...) %>%
  derive_vars_merged(...) %>%
  derive_var_trtdurd() %>%
  mutate(...)

Each function: - Takes a data frame as first argument - Returns a modified data frame - Can be inspected mid-pipeline

10.3 Rule 3: Required Variables Must Exist

# This will error if TRTSDT or TRTEDT don't exist
adsl <- adsl %>%
  derive_var_trtdurd()

# Check required variables
required_vars <- exprs(TRTSDT, TRTEDT)
assert_data_frame(adsl, required_vars = required_vars)

10.4 Rule 4: Functions Don’t Modify Unexpected Variables

Admiral functions only add new variables or modify explicitly named variables. They never silently change existing variables.

# derive_var_trtdurd() only adds TRTDURD
# It never modifies TRTSDT, TRTEDT, or any other existing variable

11 10 Navigating Admiral Documentation

11.1 10.1 Admiral Website

URL: https://pharmaverse.github.io/admiral/

Key sections: - Get Started: Quick introduction - Vignettes: Topic-based guides (ADSL, ADAE, ADLB, dates, etc.) - Reference: Alphabetical function list with examples - Articles: Advanced topics (Higher-Order Functions, metadata, programming strategy)

11.2 10.2 Function Help

# In R console
?derive_vars_merged
?derive_var_extreme_flag

# Or click "Help" tab in RStudio and search

11.3 10.3 Vignettes (Most Important!)

# List all vignettes
browseVignettes("admiral")

# Key vignettes to read:
# - "Creating ADSL"
# - "Creating a BDS Finding ADaM"
# - "Date and Time Imputation"
# - "Higher Order Functions"

11.4 10.4 GitHub Issues & Discussions

URL: https://github.com/pharmaverse/admiral

Issues: Bug reports and feature requests
Discussions: Q&A, best practices
Search before asking - your question may be answered

11.5 10.5 Pharmaverse Slack

#admiral channel: Active community support
Join at pharmaverse.org

12 11 Common Admiral Patterns (Cheat Sheet)

Memorize these patterns - they appear in 90% of ADaM derivations.

12.1 Pattern 1: Merge Variables from Another Dataset

adsl <- adsl %>%
  derive_vars_merged(
    dataset_add = ex,
    by_vars = exprs(USUBJID),
    new_vars = exprs(TRTSDTM = EXSTDTM),
    mode = "first",  # or "last"
    order = exprs(EXSTDTM),
    filter_add = EXDOSE > 0
  )

Use for: Treatment dates, disposition reasons, any variable from another SDTM domain

12.2 Pattern 2: Flag if Condition Exists

adsl <- adsl %>%
  derive_var_merged_exist_flag(
    dataset_add = ex,
    by_vars = exprs(USUBJID),
    new_var = SAFFL,
    condition = EXDOSE > 0
  )

Use for: Population flags (SAFFL, ITTFL), event occurrence flags

12.3 Pattern 3: Flag First/Last/Extreme Record

adlb <- adlb %>%
  derive_var_extreme_flag(
    by_vars = exprs(USUBJID, PARAMCD),
    order = exprs(ADT),
    new_var = ABLFL,
    mode = "last",
    filter = ADT < TRTSDT
  )

Use for: Baseline flags, first occurrence flags, min/max flags

12.4 Pattern 4: Add Baseline Value

adlb <- adlb %>%
  derive_var_base(
    by_vars = exprs(USUBJID, PARAMCD),
    source_var = AVAL,
    new_var = BASE,
    filter = ABLFL == "Y"
  )

Use for: Carrying baseline forward in BDS datasets

12.5 Pattern 5: Compute Change from Baseline

adlb <- adlb %>%
  derive_var_chg() %>%
  derive_var_pchg()

Use for: CHG, PCHG in all BDS datasets

12.6 Pattern 6: Date Conversion with Imputation

adsl <- adsl %>%
  derive_vars_dt(
    dtc = RFSTDTC,
    new_vars_prefix = "RFST",
    highest_imputation = "M"
  )

Use for: All date conversions in all ADaM datasets

13 12 ADaM Creation Workflow (Conceptual)

Standard sequence for any ADaM dataset:

1. Start with primary SDTM domain
   ↓
2. Merge ADSL (for demographics, treatment, flags)
   ↓
3. Derive dates (convert DTC → DT/DTM)
   ↓
4. Derive analysis values (AVAL, AVALC)
   ↓
5. Derive baseline flags (ABLFL for BDS)
   ↓
6. Derive baseline values (BASE for BDS)
   ↓
7. Derive change from baseline (CHG, PCHG for BDS)
   ↓
8. Derive analysis flags (TRTEMFL, ANL01FL, etc.)
   ↓
9. Derive computed parameters (for BDS)
   ↓
10. Add metadata (variable labels, formats)
   ↓
11. Export to XPT

Admiral makes each step explicit in your code.

14 13 Key Takeaways

14.1 Technical Takeaways

Three ADaM structures: ADSL (subject-level), BDS (parameters × visits), OCCDS (events)
Admiral is modular: Small, composable functions that chain with %>%
Four function categories: derive_vars_* (multi-var), derive_var_* (single-var), derive_param_* (computed parameters), derive_extreme_* (record computations)
Functions compose: Each takes and returns a data frame
Transparency: Every derivation is explicit in your script
Reusability: Same functions work across datasets with different arguments

14.2 Strategic Takeaways

ADSL is the foundation: Build ADSL first, then all other ADaMs merge from it
Structure determines function choice: BDS needs baseline derivations, OCCDS needs occurrence flags
Admiral documentation is extensive: Vignettes are your best friend
Community support: Pharmaverse Slack, GitHub discussions
Cross-company patterns: What works at one company works at another

14.3 Prepare for Tomorrow

Day 16: Hands-on ADSL creation (treatment dates, population flags)
Day 17: ADSL demographics and baseline values
Days 18-24: Build every ADaM type (ADAE, ADLB, ADVS, ADTTE, ADCM, ADEX)

15 14 Additional Resources

15.1 Official Admiral Resources

Admiral Website: https://pharmaverse.github.io/admiral/
Admiral GitHub: https://github.com/pharmaverse/admiral
Pharmaverse Website: https://pharmaverse.org
Admiral Get Started: https://pharmaverse.github.io/admiral/articles/admiral.html

15.2 Key Vignettes (Read These!)

Creating ADSL: Full ADSL workflow
Creating a BDS Finding ADaM: ADLB/ADVS patterns
Occurrence Datasets: ADAE/ADCM patterns
Higher Order Functions: Advanced patterns
Date and Time Imputation: Complete date handling guide

15.3 CDISC Standards

ADaM IG v1.3: https://www.cdisc.org/standards/foundational/adam
ADaM Examples: https://github.com/cdisc-org/sdtm-adam-pilot-project

15.4 Community

Pharmaverse Slack: #admiral channel
Pharmaverse Blog: https://pharmaverse.github.io/blog/
Pharmaverse Examples: https://pharmaverse.github.io/examples/

16 15 Homework (Optional)

Prepare for Day 16 (hands-on ADSL) by:

Read Admiral ADSL vignette: https://pharmaverse.github.io/admiral/articles/adsl.html
Explore pharmaversesdtm datasets: Load DM, EX, DS and examine their structure
Review your ADaM mapping matrix: Understand how ADSL sources from multiple SDTM domains
Browse Admiral function reference: Get familiar with function names and categories

Tomorrow we build ADSL from scratch!

End of Day 15

Today you mastered ADaM architecture and Admiral philosophy.
Tomorrow you’ll put it into practice building your first ADaM dataset: ADSL!

1 Welcome to Week 3: ADaM Deep Dive & Admiral Mastery

2 1 Learning Objectives

3 2 What is ADaM? (Deep Dive)

3.1 ADaM in the Regulatory Workflow

3.2 Why ADaM Exists

3.3 ADaM Fundamental Principles (CDISC ADaM IG)

4 3 The Three ADaM Data Structures

4.1 3.1 ADSL: Subject-Level Analysis Dataset

4.2 3.2 BDS: Basic Data Structure

4.3 3.3 OCCDS: Occurrence Data Structure

5 4 Admiral Philosophy & Design Principles

5.1 4.1 Why Admiral Was Created

5.2 4.2 Admiral Core Design Principles

5.2.1 Principle 1: Modularity

5.2.2 Principle 2: Tidyverse Integration

5.2.3 Principle 3: Transparency

5.2.4 Principle 4: Reusability

5.2.5 Principle 5: Traceability

5.3 4.3 Admiral vs Traditional SAS Macros

6 5 Admiral’s Four Core Function Categories

6.1 5.1 Category 1: derive_vars_* - Add Multiple Variables

6.1.1 derive_vars_merged()

6.1.2 derive_vars_dt() and derive_vars_dtm()

6.1.3 derive_vars_merged_lookup()

6.2 5.2 Category 2: derive_var_* - Add One Variable

6.2.1 derive_var_trtdurd()

6.2.2 derive_var_merged_exist_flag()

6.2.3 derive_var_extreme_flag()

6.3 5.3 Category 3: derive_param_* - Add Computed Parameters

6.3.1 derive_param_computed()

6.3.2 derive_param_exist_flag()

6.4 5.4 Category 4: derive_extreme_records() and Utilities

6.4.1 derive_extreme_records()

6.4.2 Higher-Order Functions (HOFs)

7 6 Mapping ADaM Datasets to Admiral Patterns

8 7 Hands-On: Exploring Admiral Function Taxonomy

8.1 7.1 Listing All Admiral Functions

8.2 7.2 Examining Function Arguments

8.3 7.3 Finding the Right Function

9 8 Deliverable: ADaM Dataset Mapping Matrix

9.1 Save Mapping for Reference

10 9 Admiral Input-Output Contract

10.1 Rule 1: Functions Return Modified Data Frames

10.2 Rule 2: Functions Compose with Pipes

10.3 Rule 3: Required Variables Must Exist

10.4 Rule 4: Functions Don’t Modify Unexpected Variables

11 10 Navigating Admiral Documentation

11.1 10.1 Admiral Website

11.2 10.2 Function Help

11.3 10.3 Vignettes (Most Important!)

11.4 10.4 GitHub Issues & Discussions

11.5 10.5 Pharmaverse Slack

12 11 Common Admiral Patterns (Cheat Sheet)

12.1 Pattern 1: Merge Variables from Another Dataset

12.2 Pattern 2: Flag if Condition Exists

12.3 Pattern 3: Flag First/Last/Extreme Record

12.4 Pattern 4: Add Baseline Value

12.5 Pattern 5: Compute Change from Baseline

12.6 Pattern 6: Date Conversion with Imputation

13 12 ADaM Creation Workflow (Conceptual)

14 13 Key Takeaways

14.1 Technical Takeaways

14.2 Strategic Takeaways

14.3 Prepare for Tomorrow

15 14 Additional Resources

15.1 Official Admiral Resources

15.2 Key Vignettes (Read These!)

15.3 CDISC Standards

15.4 Community

16 15 Homework (Optional)

6.1 5.1 Category 1: `derive_vars_*` - Add Multiple Variables

6.1.1 `derive_vars_merged()`

6.1.2 `derive_vars_dt()` and `derive_vars_dtm()`

6.1.3 `derive_vars_merged_lookup()`

6.2 5.2 Category 2: `derive_var_*` - Add One Variable

6.2.1 `derive_var_trtdurd()`

6.2.2 `derive_var_merged_exist_flag()`

6.2.3 `derive_var_extreme_flag()`

6.3 5.3 Category 3: `derive_param_*` - Add Computed Parameters

6.3.1 `derive_param_computed()`

6.3.2 `derive_param_exist_flag()`

6.4 5.4 Category 4: `derive_extreme_records()` and Utilities

6.4.1 `derive_extreme_records()`