JAGS の「カウントプロセス」形式でパラメトリック生存モデルを表現する質問する

Question

rstanarmSTAN のラッパーであるパッケージを使用できます。これにより、標準の R 式表記を使用して生存モデルを記述できます。stan_surv関数は、「カウントプロセス」形式で引数を受け入れます。ワイブルを含むさまざまな基本ハザード関数を使用してモデルを適合できます。

rstanarm-関数の存続部分はstan_survCRANではまだ利用できないため、パッケージを直接インストールする必要があります。mc-stan.org。

install.packages("rstanarm", repos = c("https://mc-stan.org/r-packages/", getOption("repos")))

以下のコードをご覧ください:

library(dplyr)
library(survival)
library(rstanarm)

## Make the Data: -----
set.seed(3)
n_sub <- 1000
current_date <- 365*2

true_shape <- 2
true_scale <- 365

dat <- data_frame(person = 1:n_sub,
                  true_duration = rweibull(n = n_sub, shape = true_shape, scale = true_scale),
                  person_start_time = runif(n_sub, min= 0, max= true_scale*2),
                  person_censored = (person_start_time + true_duration) > current_date,
                  person_duration = ifelse(person_censored, current_date - person_start_time, true_duration)
)

## Split into multiple observations per person: --------
cens_point <- 300 # <----- try changing to 0 for no split; if so, model correctly estimates
dat_split <- dat %>%
  group_by(person) %>%
  do(data_frame(
    split = ifelse(.$person_duration > cens_point, cens_point, .$person_duration),
    START = c(0, split[1]),
    END = c(split[1], .$person_duration),
    TINTERVAL = c(split[1], .$person_duration - split[1]), 
    CENS = c(ifelse(.$person_duration > cens_point, 1, .$person_censored), .$person_censored), # <— edited original post here due to bug; but problem still present when fixing bug
    TINTERVAL_CENS = ifelse(CENS, NA, TINTERVAL),
    END_CENS = ifelse(CENS, NA, END)
  )) %>%
  filter(TINTERVAL != 0)
dat_split$CENS <- as.integer(!(dat_split$CENS))


# Fit STAN survival model
mod_tvc <- stan_surv(
  formula = Surv(START, END, CENS) ~ 1,
  data = dat_split,
  iter = 1000,
  chains = 2,
  basehaz = "weibull-aft")

# Print fit coefficients
mod_tvc$coefficients[2]
unname(exp(mod_tvc$coefficients[1]))

出力は真の値（）と一致するtrue_shape <- 2; true_scale <- 365：

> mod_tvc$coefficients[2]
weibull-shape 
     1.943157 
> unname(exp(mod_tvc$coefficients[1]))
[1] 360.6058

また、を使用して STAN ソースを調べ、rstan::get_stanmodel(mod_tvc$stanfit)STAN コードを JAGS で行った試行と比較することもできます。

Answer 1

rstanarmSTAN のラッパーであるパッケージを使用できます。これにより、標準の R 式表記を使用して生存モデルを記述できます。stan_surv関数は、「カウントプロセス」形式で引数を受け入れます。ワイブルを含むさまざまな基本ハザード関数を使用してモデルを適合できます。

rstanarm-関数の存続部分はstan_survCRANではまだ利用できないため、パッケージを直接インストールする必要があります。mc-stan.org。

install.packages("rstanarm", repos = c("https://mc-stan.org/r-packages/", getOption("repos")))

以下のコードをご覧ください:

library(dplyr)
library(survival)
library(rstanarm)

## Make the Data: -----
set.seed(3)
n_sub <- 1000
current_date <- 365*2

true_shape <- 2
true_scale <- 365

dat <- data_frame(person = 1:n_sub,
                  true_duration = rweibull(n = n_sub, shape = true_shape, scale = true_scale),
                  person_start_time = runif(n_sub, min= 0, max= true_scale*2),
                  person_censored = (person_start_time + true_duration) > current_date,
                  person_duration = ifelse(person_censored, current_date - person_start_time, true_duration)
)

## Split into multiple observations per person: --------
cens_point <- 300 # <----- try changing to 0 for no split; if so, model correctly estimates
dat_split <- dat %>%
  group_by(person) %>%
  do(data_frame(
    split = ifelse(.$person_duration > cens_point, cens_point, .$person_duration),
    START = c(0, split[1]),
    END = c(split[1], .$person_duration),
    TINTERVAL = c(split[1], .$person_duration - split[1]), 
    CENS = c(ifelse(.$person_duration > cens_point, 1, .$person_censored), .$person_censored), # <— edited original post here due to bug; but problem still present when fixing bug
    TINTERVAL_CENS = ifelse(CENS, NA, TINTERVAL),
    END_CENS = ifelse(CENS, NA, END)
  )) %>%
  filter(TINTERVAL != 0)
dat_split$CENS <- as.integer(!(dat_split$CENS))


# Fit STAN survival model
mod_tvc <- stan_surv(
  formula = Surv(START, END, CENS) ~ 1,
  data = dat_split,
  iter = 1000,
  chains = 2,
  basehaz = "weibull-aft")

# Print fit coefficients
mod_tvc$coefficients[2]
unname(exp(mod_tvc$coefficients[1]))

出力は真の値（）と一致するtrue_shape <- 2; true_scale <- 365：

> mod_tvc$coefficients[2]
weibull-shape 
     1.943157 
> unname(exp(mod_tvc$coefficients[1]))
[1] 360.6058

また、を使用して STAN ソースを調べ、rstan::get_stanmodel(mod_tvc$stanfit)STAN コードを JAGS で行った試行と比較することもできます。

JAGS の「カウントプロセス」形式でパラメトリック生存モデルを表現する質問する

編集：

ベストアンサー1

おすすめ記事