Markov Models in Economic Evaluations IV

Quarto
R
Academia
Health Economics
Published

March 11, 2025

Hello dear readers and welcome to a new post. Today, I would like to resume my current series of posts related to decision analytic models in health economics and focus on some extensions of the simple Markov model structure which I introduced in a post a few months ago. I suggest to check out my previous post for a summary introduction and description of what Markov models are and for what they are used.

A simplifying assumption of traditional Markov models is that the transition probabilities regulating the movement between health states remain constant across the model cycles, i.e. over time. In some cases, this simplification of the reality may represent a too strong assumption that, if kept, may lead to misleading cost-effectiveness results. When this occurs, there are different approaches that can be used to overcome this problem and allow at least some transition probabilities to vary over time. Before going into the technical details on how to do this, I would like first to provide a quick description of the different types of time-dependency that can be allowed for transition probabilities and when a specific type might be preferable to use compared to another. For the topics discussed in this post, I took inspiration from Caldwell (2007), which I strongly suggest to consult if you would like to have a more in-depth look into the matter.

Type of time-dependency

The first type of time-dependency allows transition probabilities to vary according to the time in the model, so that the probability of a transition between states changes as the cohort ages. For example, when modelling the evolution of a deadly disease, it may be more reasonable to assume that the probability of moving to a death state increases as the cohort ages. To ensure this is possible, the starting age of a cohort should be stated, so that the age of the cohort at any cycle of the model is also known. We can then implement age-dependent transition probabilities by having a unique probability for each cycle. Commonly, these cycle-dependent transition probabilities are taken from the literature under the form of national life-tables, where death probabilities for different age groups for the target population are usually reported. In general, it makes sense for the transition probabilities to increase as the cohort ages. In addition, since all transition probabilities in a given cycle need to sum up to 1, the probability of remaining in a given state for the next cycle has also to allow for an age-dependent probability.

The second type of time-dependency allows transition probabilities to vary according to time in state, so that the probability changes as the time spent in a state increases. For example, when modelling the evolution of a deadly disease, it may be more reasonable to assume that the probability to move to a death state from a specific disease state increases as the time spent by patients in that disease state also increases. This can be modelled by assuming that the probability of moving between a given state A to another in the model varies according on how long the patient had been in the given state A, given that all patients start in the model in state A at a given time and that once they leave it they cannot return in it. This allows to keep track of how long a patient has been in state A and makes the transition probability dependent on time in a state equivalent to making it dependent on the age of the cohort.

When external sources providing information on age-dependent transition probabilities are not available, the relationship between a transition probability and time can be estimated using patient-level data, i.e. in the form of a longitudinal study recording the time to an event for each patient. Estimation can then be performed by means of survival analysis methods, which allow to handle (noninformative) censoring which frequently occurs in longitudinal studies, i.e. when observations stop being recorded before a patient has had the chance of experiencing the event of interest. Usually, parametric survival models are employed in order to obtain an estimate of the relation between the risk of experiencing an event and time, represented by the functional form of the hazard function. In particular, of key importance is the survivor function

\[ S(t)=P(T>t)=1-F(t), \]

denoting the probability of surviving (i.e. not experiencing the event) for a period of time greater than \(t\), with \(F(t)=P(T\leq t)\) being the cumulative density function. We can also express the survivor function in terms of the cumulative hazard function such that

\[ S(t)=\text{exp}\{-H(t)\}, \]

where, $H(t) $denotes the cumulative instantaneous chance of experiencing the event (i.e. hazard rate) at time \(t\), conditional on having survived to time \(t\). Using this relationship, it is possible to derive transition probabilities for Markov models. For example, let’s consider a single type of Markov model with only two states, i.e. alive and death, so that only one transition probability needs to be estimated. Define the length of the Markov cycle as \(u\) and let the instantaneous hazard of death at time \(t\) be \(h(t)\). We now need to estimate the discrete transition probability between the moments \((t-u)\) and \(t\), say \(p(t_u)\), where \(t_u\) indicates that \(t\) is now measured as integer multiples of the cycle length of the model \(u\).

The baseline transition probability of the event of interest can be defined as

\[ p(t_u)=1-\frac{S(t)}{S(t-u)}, \]

which can be re-written in terms of the cumulative hazard as

\[ p(t_u)=1-\frac{\text{exp}\{-H(t)\}}{\text{exp}\{-H(t-u)\}}=1-\text{exp}\{H(t-u)-H(t)\} \]

A simple example

As a demonstrative example, let’s consider the transition matrix for the assumed transition probabilities for three health states (Well, Sick and Dead) for an hypothetical experimental treatment. Let’s also assume that patients were randomised to one of two treatments, namely Experimental vs Control, and they were considered to be in either a Well or Sick state. Normally, if we kept the assumption of constant transition probabilities within the model, I would be able to compute what the transition matrix will look like after, say, \(1\) month by simply multiplying the initial probabilities by the assumed \(3\times3\) transition matrix. In general, one can compute the proportion of patients for the \(n+1\)-th step in any of these three health states by simply multiplying the updated transition matrix at step \(n\) by the given transition matrix. However, if we relax the assumption of constant transition probabilities over time, we need to create an entirely new transition matrix and specify how its entries changes over time.

For this example, I will focus on the second type of time-dependency under the assumption that time dependence is related to the time spent in a specific state, which requires expanding the state of interest with as many transient states as the number of cycles required to track the history of the cohort in that state. I will be mostly using references and take inspiration from a very nice example code in R provided by Alarid-Escudero et al. (2023). Make sure to check it out if you want to have a more in-depth look at the code and related explanations. For example, if a transition rate decreases over the first three cycles spent in a state and then is constant from the fourth cycle onward, then four transient or “tunnel” states are needed for that state. The cohort resides in each tunnel state for one cycle, after which they either exit the tunnel or transition to the next tunnel state, with the total number of states being \(n_{S\text{tunnel}}=n_{S}+n_{\text{tunnel}}-1\), where \(n_{\text{tunnel}\) is the total number of times a health state needs to be expanded to capture the relevant time dependence. The transition probability matrix also needs to be expanded to incorporate these additional transient states, resulting in a transition probability matrix of dimensions \(n_{S\text{tunnel}}\times n_{S\text{tunnel}}\). If transition probabilities are also dependent on simulation time, then we also need to expand it to a three-dimensional array with dimensions \(n_{S\text{tunnel}}\times n_{S\text{tunnel}}\times n_{T}\).

We will assume that healthy individuals face an age-specific background mortality. Typically, all-cause mortality rates can be obtained from life tables in the form of age-specific mortality hazard rates \(\mu_a\), where \(a\) refers to the specific age considered. Since this is just an example, let’s simulate age-specific mortality rates which are assumed to be specified on a yearly basis for people aged \(50-75\) years.

#define age intervals by years
age <- rep(50:75)
#set length of age vector
age_n <- length(age)
#generate age-specific mortality rates in the healthy population (Well)
mu_a <- pexp(seq(from=0.5, to=3,by=0.1),rate = 0.01) 

data_age <- data.frame(age,mu_a)

data_age
   age        mu_a
1   50 0.004987521
2   51 0.005982036
3   52 0.006975557
4   53 0.007968085
5   54 0.008959621
6   55 0.009950166
7   56 0.010939721
8   57 0.011928287
9   58 0.012915865
10  59 0.013902456
11  60 0.014888060
12  61 0.015872680
13  62 0.016856315
14  63 0.017838968
15  64 0.018820638
16  65 0.019801327
17  66 0.020781035
18  67 0.021759765
19  68 0.022737516
20  69 0.023714290
21  70 0.024690088
22  71 0.025664910
23  72 0.026638758
24  73 0.027611633
25  74 0.028583536
26  75 0.029554466

To compute the transition probability from state Well to state Sick, corresponding to the cohort’s age at each cycle, we transform the rate \(\mu_a\) to a transition probability assuming a constant exponential hazard rate within each year of age:

\[ p_{[\text{Well},\text{Sick},t]}=1-\text{exp}(-\mu_{a_0+t}), \] where \(a_{0}=50\) is the starting age of the cohort. We transform the vector of age-specific rates to a vector of probabilities by adjusting for the cycle length.

# Transform to age-specific background mortality risk for all cycles adjusting by cycle
p_a <- 1 - exp(-mu_a*age_n)

Because mortality in the state Sick is relative to background mortality which depends on age, mortality in this state will also be age-dependent. To generate the age-specific mortality in this state, we multiply the age-specific background mortality rate \(\mu_a\) by the constant hazard ratio for Sick vs Well. We then convert the resulting age-specific mortality rate to probability.

### Transition rates (annual), and hazard ratios (HRs) ----
r_ws  <- 0.15  # constant annual rate of becoming Sick when Well
p_ws <-  1 - exp(- r_ws * age_n)  # constant annual probability of becoming Sick when Healthy conditional on surviving 
r_sw  <- 0.01   # constant annual rate of becoming Well when Sick
p_sw <-  1 - exp(- r_sw * age_n)  # constant annual probability of becoming Well when Sickly conditional on surviving 
# hazard ratio of death in Sick vs Well 
hr_sick  <- 3     
## Age-specific mortality rate in the Sick state
mu_a_sick <- mu_a * hr_sick # when Sick
## Age-specific probability of dying in the Sick state
p_a_sick <- 1 - exp(-mu_a_sick)

To incorporate simulation-time dependence into the transition probability matrix, we expand the dimensions of the matrix and create a \(3\)-dimensional transition probability array, \(tm_a\) and use it in R, of dimensions \(n_S \times n_S \times n_T\). The first two dimensions of this array correspond to transitions between states, where the rows determine departing states and columns destination states. The third dimension refers to the time since the simulation started. The \(t\)-th element in the third dimension corresponds to the transition probability matrix at cycle \(t\).

First, we initialize the transition probability array with a default value of zero for all transition probabilities.

#set simulation parameters
n_t <- age_n
n_s <- 3
n_c <- 1000
state_names <- c("Well","Sick","Dead")

#initialise
tm_a <- array(0, dim=c(n_s,n_s,n_t), dimnames = list(state_names, state_names, 0:(n_t - 1)))

The code below illustrates how to assign age-dependent transition probabilities in the third dimension of the array. Most of this code is equivalent to populating the matrix tm_a for time-independent models. The only difference is the empty index added to the third dimension to represent the vector of transitions over time. For time-varying transitions, we provide a vector of transition probabilities. We only need to provide one value for the transition probability for constant transitions over time, and R replicates the value of such transitions as many times as the number of cycles (\(n_T+1\) times in our example). To ensure that the sum of each row of each cycle-specific matrix equals one, we define the probability of staying in each state by subtracting the sum of the exiting probability vectors from one after conditioning on surviving each cycle. Finally, we initialize the transition probability array for an hypothetical control group as a copy of the matrix for the treatment but assuming a different (worse) hazard ratio for moving from Sick to the Dead state.

### Fill in array
## From Well
tm_a["Well", "Well", ] <- (1 - p_a) * (1 - p_ws)
tm_a["Well", "Sick", ] <- (1 - p_a) * p_ws
tm_a["Well", "Dead", ] <- p_a
## From Sick
tm_a["Sick", "Well", ] <- (1 - p_a_sick) * p_sw
tm_a["Sick", "Sick", ] <- (1 - p_a_sick) * (1 - p_sw)
tm_a["Sick", "Dead", ] <- p_a_sick
## From D
tm_a["Dead", "Dead", ] <- 1

## Initialize transition probability matrix for control as a copy of the treatment TP matrix
tm_a0 <- tm_a

# hazard ratio of death in Sick vs Well under control
hr_sick_0  <- 3.5     
## Age-specific mortality rate in the Sick state under control
mu_a_sick_0 <- mu_a * hr_sick_0 # when Sick
## Age-specific probability of dying in the Sick state under control
p_a_sick_0 <- 1 - exp(-mu_a_sick_0)

## Update only transition probabilities from Sick to Dead involving p_S
## From Sick
tm_a0["Sick", "Well", ] <- (1 - p_a_sick_0) * p_sw
tm_a0["Sick", "Sick", ] <- (1 - p_a_sick_0) * (1 - p_sw)
tm_a0["Sick", "Dead", ] <- p_a_sick_0

Next, we initialise, set up and run the model under the assumption that everyone in the cohort starts in the Well state and progresses towards the Dead state according to age-specific and treatment-specific transition probability matrices.

# Construct state-transition models ----
## Initial state vector ----
#* All starting healthy
v_m_init <- c(Well = 1, Sick = 0, Dead = 0) # initial state vector

## Initialize cohort traces ----
### Initialize cohort trace under control ----
m_M_0 <- matrix(NA, 
              nrow = (n_t + 1), ncol = n_s, 
              dimnames = list(0:n_t, state_names))
#* Store the initial state vector in the first row of the cohort trace
m_M_0[1, ] <- v_m_init

### Initialize cohort trace for the new treatment ---
#* Structure and initial states are the same as for control
m_M_1  <- m_M_0 # trt 1

## Create transition dynamics arrays ----
#* These arrays will capture transitions from each state to another over time 
### Initialize transition dynamics array for control ----
arr_0 <- array(0,
             dim = c(n_s, n_s, n_t + 1),
             dimnames = list(state_names, state_names, 0:n_t))
#* Set first slice of arr_0 with the initial state vector in its diagonal
diag(arr_0[, , 1]) <- v_m_init
### Initialize transition-dynamics array for new treatment ----
#* Structure and initial states are the same as for control
arr_1  <- arr_0


#  Run Markov model ----
#* Iterative solution of age-dependent TPs
for(t in 1:n_t){
  ## Fill in cohort trace
  # For 0
  m_M_0[t + 1, ]  <- m_M_0[t, ]  %*% tm_a0[, , t]
  # For 1
  m_M_1[t + 1, ] <- m_M_1[t, ] %*% tm_a[, , t]
  ## Fill in transition-dynamics array
  # For 0
  arr_0[, , t + 1]  <- diag(m_M_0[t, ]) %*% tm_a0[, , t]
  # For 1
  arr_1[, , t + 1] <- diag(m_M_1[t, ]) %*% tm_a[, , t]
}


#trt names
trt_names <- c("Control","Treatment")

## Store the cohort traces in a list ----
l_m_0 <- list(Control =  m_M_0,
              Treatment   =  m_M_1)
names(l_m_0) <- trt_names

## Store the transition dynamics array for each strategy in a list ----
l_a_1 <- list(Control =  arr_0,
              Treatment   =  arr_1)
names(l_a_1) <- trt_names


## Store the transition dynamics array for each strategy in a list ----
l_a_A <- list(Control =  arr_0,
              Treatment   =  arr_1)
names(l_a_A) <- trt_names

The state vector of the cohort’s distribution at cycle \(t + 1\) is obtained by applying updating the transition probability matrix, obtained by the the matrix product between the row vector m_M_0[t, ] and the transition probability matrix tm_a0[, , t]. The vectors across all cycles are obtained by iteratively applying this update to each transition probability matrix of tm_a0 across the third dimension using a for loop. Figure 1 and Figure 2 shows the cohort’s trace for all cycles of the age-dependent model under the control and treatment group, respectively.

Figure 1: Cohort trace under control.
Figure 2: Cohort trace under treatment.

Once the transition probabilities over the desired number of cycles are estimated/filled in, further data manipulation can be carried out to estimate the expected costs and effects, which can be included in an additional vector and then multiplied with the elements of the matrices after each cycle to generate the expected total costs and effects. We need to associate to each health state and treatment in the model a corresponding value for each of the outcomes that we want to measure, i.e. costs and utilities. Thus, we may assume that (yearly) utilities for each health state are \(u^{well}=1\), \(u^{sick}=0.75\), and \(u^{dead}=0\) (across treatments). We may also assume that (yearly) costs vary between treatment groups and health states (Well, Sick, Dead), i.e. \(c_{trt}=(2000,16000,0)\) and \(c^{ctr}=(2000,4000,0)\). We also include utility and cost specific (yearly) values depending on whether the new treatment was received by the patient, such that: \(u^{\text{treated}}=0.95\), \(c^{\text{treated}}=12000\). Similarly, we introduce some penalties in terms of disutility or costs for those patients moving from Well to Sick or Dead state: \(u^{\text{WS}}=-0.01\), \(c^{\text{WS}}=1000\), \(c^{\text{D}}=2000\). Next, we need to determine the number of cycles of the model. In this case, we will use \(26\) cycles with each cycle being \(1\) year long. Hence, the expected costs and QALYs for each month will be generated and then added up. Since the assessment covers a period longer than one year, we also apply discounting to utilities and costs using some pre-defined (annual) discount rates: \(d^u=0.015\) and \(d^c=0.035\)

### State rewards ----
#### Costs ----
c_well    <- 2000  # annual cost of being Well
c_sick   <- 4000  # annual cost of being Sick
c_D    <- 0     # annual cost of being dead
c_1 <- 12000 # annual cost of receiving treatment 
#### Utilities ----
u_well    <- 1     # annual utility of being Healthy
u_sick   <- 0.75  # annual utility of being Sick
u_D    <- 0     # annual utility of being dead
u_1 <- 0.95  # annual utility when receiving treatment 

### Transition rewards ----
du_ws <- 0.01  # disutility when transitioning from Well to Sick
ic_ws <- 1000  # increase in cost when transitioning from Well to Sick
ic_D   <- 2000  # increase in cost when dying

### Discount weight for costs and effects ----
cycle_length <- 1 #1 year
d_c = 0.035 # annual discount rate for costs 
d_e = 0.015 # annual discount rate for QALYs,
v_dwc  <- 1 / ((1 + (d_c * cycle_length)) ^ (0:n_t))
v_dwe  <- 1 / ((1 + (d_e * cycle_length)) ^ (0:n_t))

# State Rewards ----
## Scale by the cycle length ----
#* Vector of state utilities under control
v_u_0    <- c(Well= u_well, 
                Sick = u_sick, 
                Dead  = u_D) * cycle_length
#* Vector of state costs under control
v_c_0    <- c(Well  = c_well, 
                Sick = c_sick,
                Dead  = c_D) * cycle_length
#* Vector of state utilities under trt
v_u_1   <- c(Well  = u_well, 
                Sick = u_1, 
                Dead  = u_D) * cycle_length
#* Vector of state costs under trt
v_c_1   <- c(Well  = c_well, 
                Sick = c_sick + c_1,
                Dead  = c_D) * cycle_length


## Store state rewards ----
#* Store the vectors of state utilities for each strategy in a list 
l_u <- list(Control = v_u_0,
            Treatment   = v_u_1)
#* Store the vectors of state cost for each strategy in a list 
l_c <- list(Control =  v_c_0,
            Treatment   =  v_c_1)

#* assign strategy names to matching items in the lists
names(l_u) <- names(l_c) <- trt_names

# Compute expected outcomes ----
#* Create empty vectors to store total utilities and costs 
v_tot_qaly <- v_tot_cost <- vector(mode = "numeric", length = 2)
names(v_tot_qaly) <- names(v_tot_cost) <- trt_names

We are then ready to compute the expected outcomes based on the model results.

#define number of strategies 
n_str <- 2

## Loop through each strategy and calculate total utilities and costs ----
for (i in 1:n_str) { # i <- 1
  v_u_str <- l_u[[i]]   # select the vector of state utilities for the i-th strategy
  v_c_str <- l_c[[i]]   # select the vector of state costs for the i-th strategy
  a_A_str <- l_a_A[[i]] # select the transition array for the i-th strategy
  
  ##* Array of state rewards 
  #* Create transition matrices of state utilities and state costs for the i-th strategy 
  m_u_str   <- matrix(v_u_str, nrow = n_s, ncol = n_s, byrow = T)
  m_c_str   <- matrix(v_c_str, nrow = n_s, ncol = n_s, byrow = T)
  #* Expand the transition matrix of state utilities across cycles to form a transition array of state utilities
  a_R_u_str <- array(m_u_str, 
                     dim      = c(n_s, n_s, n_t + 1),
                     dimnames = list(state_names, state_names, 0:n_t))
  # Expand the transition matrix of state costs across cycles to form a transition array of state costs
  a_R_c_str <- array(m_c_str, 
                     dim      = c(n_s, n_s, n_t + 1),
                     dimnames = list(state_names, state_names, 0:n_t))
  
  ##* Apply transition rewards
  #* Apply disutility due to transition from Well to Sick
  a_R_u_str["Well", "Sick", ]      <- a_R_u_str["Well", "Sick", ] - du_ws
  #* Add transition cost per cycle due to transition from Well to Sick
  a_R_c_str["Well", "Sick", ]      <- a_R_c_str["Well", "Sick", ] + ic_ws
  #* Add transition cost  per cycle of dying from all non-dead states
  a_R_c_str[-n_s, "Dead", ] <- a_R_c_str[-n_s, "Dead", ] + ic_D
  
  ###* Expected QALYs and costs for all transitions per cycle
  #* QALYs = life years x QoL
  a_Y_c_str <- a_A_str * a_R_c_str
  a_Y_u_str <- a_A_str * a_R_u_str 
  
  ###* Expected QALYs and costs per cycle
  ##* Vector of QALYs and costs
  v_qaly_str <- apply(a_Y_u_str, 3, sum) # sum the proportion of the cohort across transitions 
  v_cost_str <- apply(a_Y_c_str, 3, sum) # sum the proportion of the cohort across transitions
  
  ##* Discounted total expected QALYs and Costs per strategy
  #* QALYs
  v_tot_qaly[i] <- t(v_qaly_str) %*% (v_dwe)
  #* Costs
  v_tot_cost[i] <- t(v_cost_str) %*% (v_dwc)
}

We can then assess the model results, for example by looking at the total cost and QALY difference over the model time horizon.

v_tot_cost
  Control Treatment 
  32246.3  108303.2 
v_tot_qaly
  Control Treatment 
 7.794361  9.458081 

from which an estimate of the mean costs and QALYs difference between the two groups can be obtained as

#experimental
mu_c_exp <- v_tot_cost[2]
mu_e_exp <- v_tot_qaly[2]

#control
mu_c_ctr <- v_tot_cost[1]
mu_e_ctr <- v_tot_qaly[1]

mu_c_diff <- mu_c_exp-mu_c_ctr
mu_e_diff <- mu_e_exp-mu_e_ctr

and from this an estimate of the ICER can be obtained:

\[ \text{ICER} = \frac{1.0830317\times 10^{5} - 3.2246297\times 10^{4}}{9.4580812-7.7943606} =\frac{7.6056876\times 10^{4}}{1.6637206} = 4.571493\times 10^{4} \]

Perhaps next time I will focus on how to perform sensitivity analysis for these models. Thanks for following and till the next time!

References

Alarid-Escudero, Fernando, Eline Krijkamp, Eva A Enns, Alan Yang, MG Myriam Hunink, Petros Pechlivanoglou, and Hawre Jalal. 2023. “A Tutorial on Time-Dependent Cohort State-Transition Models in r Using a Cost-Effectiveness Analysis Example.” Medical Decision Making 43 (1): 21–41.
Caldwell, Deborah. 2007. “Decision Modelling for Health Economic Evaluation. A Briggs, m Sculpher, k Claxton.” Oxford University Press.