Package 'coxrt' reference manual

Title:	Cox Proportional Hazards Regression for Right Truncated Data
Description:	Fits Cox regression based on retrospectively ascertained times-to-event. The method uses Inverse-Probability-Weighting estimating equations.
Authors:	Bella Vakulenko-Lagun, Micha Mandel, Rebecca A. Betensky
Maintainer:	Bella Vakulenko-Lagun <[email protected]>
License:	GPL (>= 2)
Version:	1.0.2
Built:	2025-02-07 05:28:35 UTC
Source:	https://github.com/bella2001/coxrt

Fits Cox Regression Model Using Right Truncated Data

Description

Estimates covariate effects in a Cox proportional hazard regression from right-truncated survival data assuming positivity, that is P(lifetime>max(right) | Z=0)=0.

Usage

coxph.RT(formula, right, data, bs = FALSE, nbs.rep = 500,
  conf.int = 0.95)
coxph.RT(formula, right, data, bs = FALSE, nbs.rep = 500,
  conf.int = 0.95)

Arguments

`formula`	a formula object, with the response on the left of a ~ operator, and covariates on the right. The response is a target lifetime variable.
`right`	a right truncation variable.
`data`	a data frame that includes the variables used in both sides of `formula` and in `right`. The observations with missing values in one of the variables are dropped.
`bs`	logical value: if `TRUE`, the bootstrap esimator of standard error, confidence interval, and confidence upper and lower limits for one-sided confidence intervals based on the bootstrap distribution are calculated. The default value is `FALSE`.
`nbs.rep`	number of bootstrap replications. The default number is 200.
`conf.int`	The confidence level for confidence intervals and hypotheses tests. The default level is 0.95.

Details

When positivity does not hold, the estimator of regression coefficients will be biased. But if all the covariates are independent in the population, the Wald test performed by this function is still valid and can be used for testing partial hypotheses about regression coefficients even in the absence of positivity. If the covariates are not independent and positivity does not hold, the partial tests cannot guarantee the correct level of type I error.

Value

A list with components:

`coef`	an estimate of regression coefficients
`var`	covariance matrix of estimates of regression coefficients based on the analytic formula
`n`	the number of observations used to fit the model
`summary`	a data frame with a summary of fit:

coef a vector of coefficients
exp.coef exponent of regression coefficients (=hazard ratio)
SE asymptotic standard error estimate based on the analytic formula derived in Vakulenko-Lagun et al. (2018)
CI.L lower confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0
CI.U upper confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0
pvalue p-value from a Wald test for a two-sided hypothesis H₀: $\beta$ _i = 0
pvalue.H1.b.gr0 p-value from the Wald test for a one-sided partial hypothesis H₀: $\beta$ _i $\le 0$ based on the analytical asymptotic standard error estimate
pvalue.H1.b.le0 p-value from the Wald test a for one-sided partial hypothesis H₀: $\beta$ _i $\ge 0$ based on the analytical asymptotic standard error estimate

`bs`	if the input argument `bs` was TRUE, then an output list also includes an element `bs` with
	statistics from the bootstrap distribution of estimated coefficients:

num.bs.rep number of bootsrap replications used to obtain the sample distribution
var estimated variance
summary a data frame with a summary of bootstrap distribution that includes: SE, a bootstrap estimated standard error; CI.L, a quantile estimated lower confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0; CI.U, a quantile estimated upper confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0; CI.L.H1.b.gr0, a quantile estimated the limit for one-sided hypothesis H₀: $\beta$ _i $\le 0$ ; CI.U.H1.b.le0, a quantile estimated the limit for one-sided hypothesis H₀: $\beta$ _i $\ge 0$ .

Examples

# loading AIDS data set
library(gss)
data(aids)
all <- data.frame(age=aids$age, ageg=as.numeric(aids$age<=59), T=aids$incu, R=aids$infe, hiv.mon =102-aids$infe)
all$T[all$T==0] <- 0.5 # as in Kalbfeisch and Lawless (1989)
s <- all[all$hiv.mon>60,] # select those who were infected in 1983 or later
# analysis assuming positivity
# we request bootstrap SE estimate as well:
sol <- coxph.RT(T~ageg, right=R, data=s, bs=FALSE)
sol
sol$summary # print the summary of fit based on the analytic Asymptotic Standard Error estimate

# loading AIDS data set
library(gss)
data(aids)
all <- data.frame(age=aids$age, ageg=as.numeric(aids$age<=59), T=aids$incu, R=aids$infe, hiv.mon =102-aids$infe)
all$T[all$T==0] <- 0.5 # as in Kalbfeisch and Lawless (1989)
s <- all[all$hiv.mon>60,] # select those who were infected in 1983 or later
# analysis assuming positivity
# we request bootstrap SE estimate as well:
sol <- coxph.RT(T~ageg, right=R, data=s, bs=FALSE)
sol
sol$summary # print the summary of fit based on the analytic Asymptotic Standard Error estimate

Fits Cox Regression with Adjustment for the Lack of Positivity

Description

Estimates covariate effects in a Cox proportional hazard regression from right truncated survival data for a given value of a0=P(lifetime>max(right) | Z=0). This probability reflects the chance of falling to the right of the upper bound of the support of the right truncation variable in the reference stratum where all the covariates are zero. Right truncation might result in a completely unobserved right tail of the distribution of the target lifetime. That means that it can happen there will be no "representatives" in a sample from the right tail. Ignoring this and using coxph.RT in general will result in biased estimation of regression coefficients, whereas coxph.RT.a0 allows to account for this violation.

Usage

coxph.RT.a0(formula, right, data, a0 = 0, bs = FALSE, nbs.rep = 200,
  conf.int = 0.95)
coxph.RT.a0(formula, right, data, a0 = 0, bs = FALSE, nbs.rep = 200,
  conf.int = 0.95)

Arguments

`formula`	a formula object, with the response on the left of a ~ operator, and covariates on the right. The response is a target lifetime variable.
`right`	a right truncation variable.
`data`	a data frame that includes the variables used in `formula` and in `right`.
`a0`	probability of falling into the unobservable region in the stratum of `Z=0`, i.e. `P(lifetime>max(right) \| Z=0)`. By default `a0=0`, which is equivalent to assuming positivity.
`bs`	logical value: if TRUE, the bootstrap esimator of standard error, confidence interval, and confidence upper and lower limits for one-sided confidence intervals based on the bootstrap distribution are calculated. The default value is FALSE.
`nbs.rep`	number of bootstrap replications. The default number is 200.
`conf.int`	The confidence level for confidence intervals and hypotheses tests. The default level is 0.95.

Value

a list with components:

`convergence`	convergence code as returned by `BBsolve`.
	`convergence` > 0 means that the algorithm diverged and a solution was not found.
	`BBsolve` is used with a default parameters setting.
`coef`	a vector of estimated regression coefficients.
`var`	covariance matrix of regression coefficients, if the input argument `bs` was `TRUE`;
	`NULL`, otherwise.
`n`	the number of observations used to fit the model.
`a0`	plugged-in value of `a0`.
`bs`	if the input argument `bs` was TRUE, then an output list also includes an element `bs`
	with statistics from the bootstrap distribution of estimated coefficients:

num.bs.rep number of successful bootsrap replications used to obtain the sample distribution
var estimated variance of regression coefficients
summary a data frame with a summary of bootstrap distribution that includes: coef, a vector of estimated regression coefficients; exp.coef, an exponent of regression coefficients (=hazard ratio); SE, a bootstrap estimated standard error; CI.L, a quantile estimated lower confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0; CI.U, a quantile estimated upper confidence limit for two-sided hypothesis H₀: $\beta$ _i = 0; CI.L.H1.b.gr0, a quantile estimated the limit for one-sided hypothesis H₀: $\beta$ _i $\le 0$ ; CI.U.H1.b.le0, a quantile estimated the limit for one-sided hypothesis H₀: $\beta$ _i $\ge 0$ .

Examples

# loading AIDS data set
library(gss)
data(aids)
all <- data.frame(age=aids$age, ageg=as.numeric(aids$age<=59), T=aids$incu, R=aids$infe, hiv.mon =102-aids$infe)
all$T[all$T==0] <- 0.5 # as in Kalbfeisch and Lawless (1989)
s <- all[all$hiv.mon>60,] # select those who were infected in 1983 or later

# analysis using adjusted estimating equations for a0=0.2
sol.02 <- try(coxph.RT.a0(T~ageg, right=R, data=s, a0=0.2, bs=FALSE))
sol.02
# for a0=0
sol <- try(coxph.RT(T~ageg, right=R, data=s, bs=FALSE) )
sol$summary # print the summary of fit based on the asymptotic SE estimate


# sensitivity analysis for different values of a0
a_ <- seq(0.05, 0.55, by=0.05)
est <- NULL

for(q in 1:length(a_))
{
  sol.a <- try(coxph.RT.a0(T~ageg, right=R, data=s, a0=a_[q], bs=FALSE))
  if (sol.a$convergence!=0)
  {
    cat("a0=", a_[q], ". Error occurred in BBsolve.\n")
  } else
  {
    cat("a=", a_[q]," ", " IPW.adj.est=", sol.a$coef, "\n")
    est <- c(est, sol.a$coef)
  }
}
require(ggplot2)
res.d <- data.frame(a0=c(0, a_), beta=c(sol$coef, est))

p <- ggplot(res.d, aes(x=a0, y=beta)) +
  geom_line() + geom_point() +
  geom_hline(yintercept=0)
p + xlab(expression( paste(a[0], "=P(T>", r['*']," | z=0)" , sep="")) )+
  ylab(expression( paste(hat(beta), "(", a[0], ")" , sep="")) ) +
  scale_x_continuous(breaks=res.d$a0, labels=res.d$a0) +
  theme(axis.text.x = element_text(face="bold", angle=45),
        axis.text.y = element_text(face="bold"))

# loading AIDS data set
library(gss)
data(aids)
all <- data.frame(age=aids$age, ageg=as.numeric(aids$age<=59), T=aids$incu, R=aids$infe, hiv.mon =102-aids$infe)
all$T[all$T==0] <- 0.5 # as in Kalbfeisch and Lawless (1989)
s <- all[all$hiv.mon>60,] # select those who were infected in 1983 or later

# analysis using adjusted estimating equations for a0=0.2
sol.02 <- try(coxph.RT.a0(T~ageg, right=R, data=s, a0=0.2, bs=FALSE))
sol.02
# for a0=0
sol <- try(coxph.RT(T~ageg, right=R, data=s, bs=FALSE) )
sol$summary # print the summary of fit based on the asymptotic SE estimate


# sensitivity analysis for different values of a0
a_ <- seq(0.05, 0.55, by=0.05)
est <- NULL

for(q in 1:length(a_))
{
  sol.a <- try(coxph.RT.a0(T~ageg, right=R, data=s, a0=a_[q], bs=FALSE))
  if (sol.a$convergence!=0)
  {
    cat("a0=", a_[q], ". Error occurred in BBsolve.\n")
  } else
  {
    cat("a=", a_[q]," ", " IPW.adj.est=", sol.a$coef, "\n")
    est <- c(est, sol.a$coef)
  }
}
require(ggplot2)
res.d <- data.frame(a0=c(0, a_), beta=c(sol$coef, est))

p <- ggplot(res.d, aes(x=a0, y=beta)) +
  geom_line() + geom_point() +
  geom_hline(yintercept=0)
p + xlab(expression( paste(a[0], "=P(T>", r['*']," | z=0)" , sep="")) )+
  ylab(expression( paste(hat(beta), "(", a[0], ")" , sep="")) ) +
  scale_x_continuous(breaks=res.d$a0, labels=res.d$a0) +
  theme(axis.text.x = element_text(face="bold", angle=45),
        axis.text.y = element_text(face="bold"))

A Package to Fit the Cox Regression from Right Truncated Data

Description

The method assumes that truncation is independent of covariates, and of lifetime, and that there is no censoring. The method uses Inverse-Probability-Weighting estimating equations with stabilized weights, IPW-S and IPW-SA, as described in Vakulenko-Lagun et al. (2018). Currently the code allows only time-independent covariates.

Details

The coxrt package provides two functions: coxph.RT (IPW-S) that assumes positivity and coxph.RT.a0 (IPW-SA) that allows for adjustment of estimation using plugged-in a0. The illustrative examples in these functions include analysis of AIDS latency data with age as a covariate, where the AIDS cases were retrospectively ascertained at June 30, 1986, and only those who developed AIDS by that time were included in the analysis (Kalbfeisch and Lawless, 1989).

References

Vakulenko-Lagun, B., Mandel, M., Betensky, R.A. Inverse probability weighting methods for Cox regression with right-truncated data. 2019, submitted to Biometrics

Kalbfeisch, J.D. and Lawless, J.F. Inference based on retrospective ascertainment: an analysis of the data on transfusion-related AIDS. Journal of the American Statistical Association, 84 (406):360-372, 1989.

Package 'coxrt'

Help Index

Fits Cox Regression Model Using Right Truncated Data

Description

Usage

Arguments

Details

Value

See Also

Examples

Fits Cox Regression with Adjustment for the Lack of Positivity

Description

Usage

Arguments

Value

See Also

Examples

A Package to Fit the Cox Regression from Right Truncated Data

Description

Details

References