Recurrent event analysis

Recurrent event analysis is a branch of survival analysis that analyzes the time until recurrences occur, such as recurrences of traits or diseases. Recurrent events are often analysed in social sciences and medical studies, for example recurring infections, depressions or cancer recurrences. Recurrent event analysis attempts to answer certain questions, such as: how many recurrences occur on average within a certain time interval? Which factors are associated with a higher or lower risk of recurrence?

The processes which generate events repeatedly over time are referred to as recurrent event processes, which are different from processes analyzed in time-to-event analysis: whereas time-to-event analysis focuses on the time to a single terminal event, individuals may be at risk for subsequent events after the first in recurrent event analysis, until they are censored.

Introduction[edit]

Objectives of recurrent event analysis include:^[1]

Understanding and describing individual event processes
Identifying and characterizing variation across a population of processes
Comparing groups of processes
Determining the relationship of fixed covariates, treatments, and time-varying factors to event occurrence

Notation and frameworks[edit]

For a single recurrent event process starting at $t=0$ , let $0\leq T_{1}<T_{2}<\dots$ denote the event times, where $T_{k}$ is the time of the $k$ th event. The associated counting process $\{N(t),0\leq t\}$ records the cumulative number of events generated by the process; specifically, ${\textstyle N(t)=\sum _{k=1}^{\infty }I(T_{k}\leq t)}$ is the number of events occurring over the time interval $[0,t]$ .

Models for recurrent events can be specified by considering the probability distribution for the number of recurrences in short intervals $[t,t+\Delta t)$ , given the history of event occurrence before time $t$ . The intensity function describes the instantaneous probability of an event occurring at time $t$ , conditional on the process history, and describes the process mathematically. Define the process history as $H(t)=\{N(s):0\leq s<t\}$ , then the intensity is formally defined as

\lambda (t|H(t))=\lim _{\Delta t\downarrow 0}{\frac {P(N(t+\Delta t)-N(t)=1)}{\Delta t}}.

When a heterogenous group of individuals or processes is considered, the assumption of a common event intensity is no longer plausible. Greater generality can be achieved by incorporating fixed or time-varying covariates in the intensity function.

Description of recurrent event data[edit]

As a counterpart of the Kaplan–Meier curve, which is used to describe the time to a terminal event, recurrent event data can be described using the mean cumulative function, which is the average number of cumulative events experienced by an individual in the study at each point in time since the start of follow-up.

Statistical models for recurrent event data[edit]

Poisson model[edit]

The Poisson model is a popular model for recurrent event data, which models the number of recurrences that have occurred. Poisson regression assumes that the number of recurrences has a Poisson distribution with a fixed rate of recurrence over time. The logarithm of the expected number of recurrences is modeled by a linear combination of explanatory variables.

Marginal means/rates model[edit]

The marginal means/rates model considers all recurrent events of the same subject as a single counting process and does not require time-varying covariates to reflect the past history of the process, which makes it a more flexible model.^[2] Instead, the full history of the counting process may influence the mean function of recurrent events.

Multi-state model[edit]

In multi-state models, the recurrent event processes of individuals are described by different states. The different states may describe the recurrence number, or whether the subject is at risk of recurrence. A change of state is called a transition (or an event) and is central in this framework, which is fully characterized through estimation of transition probabilities between states and transition intensities that are defined as instantaneous hazards of progression to one state, conditional on occupying another state.^[2]

Extended Cox proportional hazards (PH) models[edit]

Extensions of the Cox proportional hazard models are popular models in social sciences and medical science to assess associations between variables and risk of recurrence, or to predict recurrent event outcomes. Many extensions of survival models based on the Cox proportional hazards approach have been proposed to handle recurrent event data. These models can be characterized by four model components:^[3]

Risk intervals
Baseline hazard
Risk set
Correction for within-subject correlation

Well-known examples of Cox-based recurrent event models are the Andersen and Gill model,^[4] the Prentice, Williams and Petersen model^[5] and the Wei-Lin–Weissfeld model^[6]

Correlated event times within subjects[edit]

Time to recurrence is often correlated within subjects, as some subjects can be more frail to experiencing recurrences. If the correlated nature of the data is ignored, the confidence intervals (CI) for the estimated rates could be artificially narrow, which may result in false positive results.

Robust variance[edit]

It is possible to use robust 'sandwich' estimators for the variance of regression coefficients. Robust variance estimators are based on a jacknife estimate, which anticipates correlation within subjects and provides robust standard errors.

Frailty models[edit]

In frailty models, a random effect is included in the recurrent event model which describes the individual excess risk that can not be explained by the included covariates. The frailty term induces dependence among the recurrence times within subjects.

References[edit]

^ The Statistical Analysis of Recurrent Events. Statistics for Biology and Health. 2007. doi:10.1007/978-0-387-69810-6. ISBN 978-0-387-69809-0.
^ ^a ^b Amorim, Leila DAF; Cai, Jianwen (2014-12-09). "Modelling recurrent events: a tutorial for analysis in epidemiology". International Journal of Epidemiology. 44 (1): 324–333. doi:10.1093/ije/dyu222. ISSN 1464-3685. PMC 4339761. PMID 25501468.
^ Kelly, Patrick J.; Lim, Lynette L-Y. (2000-01-15). <13::aid-sim279>3.0.co;2-5 "Survival analysis for recurrent event data: an application to childhood infectious diseases". Statistics in Medicine. 19 (1): 13–33. doi:10.1002/(sici)1097-0258(20000115)19:1<13::aid-sim279>3.0.co;2-5. ISSN 0277-6715. PMID 10623910.
^ Andersen, P. K.; Gill, R. D. (1982-12-01). "Cox's Regression Model for Counting Processes: A Large Sample Study". The Annals of Statistics. 10 (4). doi:10.1214/aos/1176345976. ISSN 0090-5364.
^ R. L. Prentice, B. J. Williams, A. V. Peterson, On the regression analysis of multivariate failure time data, Biometrika, Volume 68, Issue 2, August 1981, Pages 373–379, doi:10.1093/biomet/68.2.373
^ Wei, L. J.; Lin, D. Y.; Weissfeld, L. (1989). "Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal Distributions". Journal of the American Statistical Association. 84 (408): 1065–1073. doi:10.1080/01621459.1989.10478873. ISSN 0162-1459.

[1] The Statistical Analysis of Recurrent Events. Statistics for Biology and Health. 2007. doi:10.1007/978-0-387-69810-6. ISBN 978-0-387-69809-0.

[:0-2] Amorim, Leila DAF; Cai, Jianwen (2014-12-09). "Modelling recurrent events: a tutorial for analysis in epidemiology". International Journal of Epidemiology. 44 (1): 324–333. doi:10.1093/ije/dyu222. ISSN 1464-3685. PMC 4339761. PMID 25501468.

[3] Kelly, Patrick J.; Lim, Lynette L-Y. (2000-01-15). <13::aid-sim279>3.0.co;2-5 "Survival analysis for recurrent event data: an application to childhood infectious diseases". Statistics in Medicine. 19 (1): 13–33. doi:10.1002/(sici)1097-0258(20000115)19:1<13::aid-sim279>3.0.co;2-5. ISSN 0277-6715. PMID 10623910.

[4] Andersen, P. K.; Gill, R. D. (1982-12-01). "Cox's Regression Model for Counting Processes: A Large Sample Study". The Annals of Statistics. 10 (4). doi:10.1214/aos/1176345976. ISSN 0090-5364.

[5] R. L. Prentice, B. J. Williams, A. V. Peterson, On the regression analysis of multivariate failure time data, Biometrika, Volume 68, Issue 2, August 1981, Pages 373–379, doi:10.1093/biomet/68.2.373

[6] Wei, L. J.; Lin, D. Y.; Weissfeld, L. (1989). "Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal Distributions". Journal of the American Statistical Association. 84 (408): 1065–1073. doi:10.1080/01621459.1989.10478873. ISSN 0162-1459.

[1]

[2]

[3]

[4]

[5]

[6]