§ 瀏覽學位論文書目資料
  
系統識別號 U0002-2807201013420700
DOI 10.6846/TKU.2010.01057
論文名稱(中文) 以隨機效應模型分析區間設限下的多變量存活資料
論文名稱(英文) A flexible model approach for regression analysis of multivariate interval-censored data
第三語言論文名稱
校院名稱 淡江大學
系所名稱(中文) 統計學系碩士班
系所名稱(英文) Department of Statistics
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度 98
學期 2
出版年 99
研究生(中文) 林坤宏
研究生(英文) Kuen-Hung Lin
學號 697650447
學位類別 碩士
語言別 英文
第二語言別
口試日期 2010-06-29
論文頁數 81頁
口試委員 指導教授 - 陳蔓樺
委員 - 李百靈
委員 - 陳瓊梅
關鍵字(中) EM演算法
脆弱模型
區間設限
最大概似法
多變量失效時間資料
關鍵字(英) EM algorithm
frailty model
interval censoring
maximum-likelihood estimate
multivariate failure time data
第三語言關鍵字
學科別分類
中文摘要
過去已經有許多學者,在考慮每一個併發症之間彼此獨立下,對於回歸分析應用在多變量區間設限型的資料上,提出相關的邊際模型進行討論。
然而這些模型有著無法探討併發症之間相關性的問題,因此近來學者
英文摘要
For regression analysis of multivariate interval-censored data, several authors proposed some marginal model approaches, which modeled each time of interest individually. For the problem, those models does not allow for inference about the relationship or association between correlated failure times. The frailty model approach has been commonly used in the analysis of multivariate failure time data and it provides a flexible approach for directly modeling the relationship between correlated failure times.

In the thesis, we present a full likelihood approach based on the proportional hazard frailty model and estimate of regression parameters by Expectation Maximization (EM) algorithm. The method is applied to a set of bivariate interval-censored data arising from an AIDS clinical trial.
第三語言摘要
論文目次
Contents
Chapter 1 Introduction .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 1
1.1 Survival Analysis .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 2
1.2 Censored Data.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 4
1.3 Terminology and Notation .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 6
1.4 Concepts and Some Regression Models .. .. .. .. .. .. .. .. 9
Chapter 2 Model and Assumptions .. .. .. .. .. .. .. .. .. .. .. .. 11
2.1 Types of Interval-censored Analysis .. .. .. .. .. .. .. .. .. .. 12
2.2 The Proportinal Hazards Model .. .. .. .. .. .. .. .. .. .. .. .. 16
2.3 The Proportional Hazards Frailty Model.. .. .. .. .. .. .. 20
Chapter 3 Parameter Estimation .... .. .. .. .. .. .. .. .. .. .. .. .. 23
3.1 Expectation-maximization Algorithm.. .. .. .. .. .. .. .. .. 24
3.2 E-step .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 28
3.3 M-step .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 32
3.4 Estimative Procedure .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 35
Chapter 4 Application .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .... .. .. .. .. 40
4.1 Source of Data .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 41
4.2 Fundamental Analysis .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 45
4.3 Turnbull’s Algorithm.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 47
4.4 Result of Analysis .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 58
Chpater 5 Conclusion.... .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 62
5.1 Baseline Hazard Function .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 63
5.2 Future Work .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. 68
Appendix I Fisher Information Matrix .. .. .. .. .. .. .. .. .. .. 71
Appendix II ACTG181.. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .... .. .. .. 74
Reference .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .. .... .. .. .. .. .. .. .. 75
List of Tables
2.1 Summary of Case I and Case II Interval-censoring . 15
4.1 Number of Censored Patients and Censored Status 43
4.2 Numerical Check Difference Between Turnbull and KM for Blood . . . . . . . . . . . . . . . . . . . . . . . 52
4.3 Numerical Check Difference Between Turnbull and KM for Blood . . . . . . . . . . . . . . . . . . . . . . . 52
4.4 Numerical Check Difference Between Turnbull and KM for Urine . . . . . . . . . . . . . . . . . . . . . . . . 53
4.5 Numerical Check Difference Between Turnbull and KM for Urine . . . . . . . . . . . . . . . . . . . . . . . . 53
4.6 Observe Time and Estimate Survival Function under Turnbull Method . . . . . . . . . . . . . . . . . . . 57
4.7 Proportional Hazards Model . . . . . . . . . . . . . . 59
4.8 Hazard Ratio . . . . . . . . . . . . . . . . . . . . . . . . 59
4.9 Wald test: H0 :   = 0 v.s H1 :   6= 0 . . . . . . . . . . . . 60
4.10Marginal Proportional Hazards Model . . . . . . . . 61
List of Figures
3.1 The idea of algorithm . . . . . . . . . . . . . . . . . . . 35
4.1 Number of Patients in Each Type of Censoring . . . 45
4.2 (a)Pie Chart of Blood Sample. (b)Pie Chart of
Urine Sample . . . . . . . . . . . . . . . . . . . . . . . . 46
4.3 Estimates of Baseline Survival Functions for Blood
under Turnbull Method . . . . . . . . . . . . . . . . . 49
4.4 Estimates of Baseline Survival Functions for Urine
under Turnbull Method . . . . . . . . . . . . . . . . . 50
4.5 Estimates of Baseline Survival Functions for Blood
Comparing with Turnbull and KM . . . . . . . . . . 51
4.6 Estimates of Baseline Survival Functions for Urine
Comparing with Turnbull and KM . . . . . . . . . . 51
4.7 Estimates of Baseline Survival Functions for Blood
and Urine under Turnbull Method . . . . . . . . . . . 55
5.1 The Relationship of Failure Time . . . . . . . . . . . 64
參考文獻
Aalen, O. O. (1980). A model for nonparametric regression analysis of counting processes. Spring Lect. Notes Statist. 2, 1-25. Mathematical Statistics
and Probability Theory. W. Klonecki, A. Kozek, and J. Rosi′nski, editors.

Agresti, A. (2002). Categorical Data Analysis. Second edition, Wiley: New York.

Breslow, N. E. and Day, N. E. (1987). Statistical methods in cancer research,
2, The design and analysis chort studies. Lyon: IARC.

Chen, H. (2001). Weighted Semiparametric Likelihood Method for Fitting
a Proportional Odds Regression Model to Data From the Case-Cohort Design.
Journal of the American Statistical Association, 96, 1446-1457.

Chen, M., Tong, X. and Sun J. (2007). The proportional odds model for
multivariate interval-censored failure time data. Statistics in Medicine, 26,
5147-5161.

Chen, M., Tong, X. and Sun, J. (2009). A frailty model approach for re-
gression analysis of multivariate current status data. Statistics in Medicine,
28, 3424-3436.

C. F. Jeff Wu. (1983). On the Convergence Properties of the EM Algorithm.
Annals of Statistics, 11, 95-103.

Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal
Statistical Society, Series B, 34, 187-220.

Cox, D. R. (1975). Partial likelihood. Biometrika, 62, 269-276.

Cox, D. R. and Oakes, D. (1984). Analysis of survival data., Chapman &
Hall: London.

Crowley, J. and Hu, M. (1977). Covariance analysis of heart transplant data.
J. Amer. Stat. Assoc., 72, 27-36.

Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood
from incomplete data via the EM algorithm. Journal of the Royal Statistical
Society, Series B, 39, 1-38.

Finkelstein, D. M. (1986). A proportional hazard model for interval-censored
failure time data. Biometrics, 42, 845-854.

Freireich, E. O. et al. (1963). The effect of 6-mercaptopmine on the duration of steroid induced remission in acute leukemia. Blood, 21, 699-716.

Giolo, S. R. (2004). Turnbull’s nonparametric estimator for interval-censored
data. Technical Report, 81531-990-Curitiba, Paran′a, Brazil.

Goggins, W. B. and Finkelstein, D. M. (2000). A proportional hazards model for multivariate interval-censored failure time data. Biometrics, 56, 940-943

Groeneboom, P. and Wellner, J. A. (1992). Information bounds and nonparametric
maximum likelihood estimation. DMV Seminar, Band 19, Birkhauser,
New York.

Gumbel, E. J. (1960). Bivariate exponential distribution. Journal of the American
Statistical Association, 55, 698-707.

Huang, J. (1996). Efficient estimation for the proportional odds regression
model with interval censoring. The Annals of Statistics, 24, 540-568.

Huang, J. and Rossini, A. J. (1997). Sieve estimation for the proportional
odds failure-time regression model with interval censoring. Journal of the
American Statistical Association, 92, 960-967.

Huang, J. and Wellner, J. A. (1997). Interval censored survival data: a review
of recent progress. Proceedings of the First Seattle Symposium in Biostatistics:
Survival Analysis, eds. Lin, D. and Fleming, T. and SpringerVerlag, New
York, 123-169.

Kalbfleisch, J. D. and Prentice, R. L. (2002). The statistical analysis of failure
time data. Second edition, John Wiley: New York.

Kim, J. and Lee, S. (1998). Two-sample goodness-of-fit tests for additive
risk models with censored observations. Biometrika, 85, 593-603.

Kim, M. Y. and Xue, X. (2002). The analysis of multivariate interval-censored
survival data. Statistics in Medicine, 21, 3715-3726.

Kleinbaum, D. G. and Klein, M. (2005). Survival Analysis: A Self-Learing
Text. Second Edition, Spring: New York

Klein, J. P. (1992). Semiparametric estimation of random effects using the
Cox model based on the EM algorithm. Biometrics, 48, 795-806.

Kulich, M. and Lin, D. Y. (2000). Additive hazards regression with covariate
measurement error. Journal of the American Statistical Association, 95, 238-
248.

Lin, D. Y. and Ying, Z. (1994). Semiparametric analysis of the additive risk
model, Biometrika, 81, 61-71.

Lindsey, J. C. and Ryan, L. M. (1998). Tutorial in biostatistics: methods
for interval-censored data. Statistics in Medicine, 17, 219-238.

Murphy, S. A., Rossini, A. J. and Van der Vaart, A. W. (1997). Maximum
likelihood estimation in the proportional odds model. Journal of the American
Statistical Association, 92, 968-976.

Rossini, A. J. and Tsiatis, A. A. (1996). A semiparametric proportional odds
regression model for the analysis of current status data. Journal of the American Statistical Association, 91:713-721.

Schick, A. and Yu, Q. (2000). Consistency of the GMLE with mixed case
interval-censored data. Scandinavian Journal of Statistics, 27, 45-55.

Schoenbach, V. J., Kaplan, B. H., Fredman, L. and Kleinbaum, D. G. (1986).
Social ties and mortality in Evans County, Georgia. Amer. J. Epid., 123, 4,
577-591.

Sun, J. (1998). Interval censoring. Encyclopedia of Biostatistics, John Wi-
ley, First Edition, 2090-2095.

Sun, J. (2005). Interval censoring. Encyclopedia of Biostatistics, John Wi-
ley, Second Edition, 2603-2609.

Sun, J. (2006). The Statistical Analysis of Interval-censoring Failure Time
Data. Spring, New York.

Sundberg, Rolf. (1974). Maximum likelihood theory for incomplete data from
an exponential family. Scandinavian Journal of Statistics, 1, 49-58.

Tong, X., Chen, M. and Sun, J. (2008). Regression analysis of multivariate
interval-censored failure time data with application to tumorigenicity experiments. Biometrical Journal, 50, 364-374.

Turnbull, B. W. (1976). The empirical distribution function from arbitrarily grouped, censored and truncated data. Journal of the Royal Statistical
Society, Series B, 38, 290-295.

Wang, L., Sun, J. and Tong, X. (2008). Efficient estimation for the propor-
tional hazards model with bivariate current status data. Lifetime Data Anal,
14, 134-153.

Wei, L. J., Lin, D. Y. and Weissfeld, L. (1989). Regression analysis of multi-
variate incomplete failure time data by modeling marginal distributions. Journal
of the American Statistical Association, 84, 1065-1073.

Wellner, J. A. (1995). Interval-censoring case 2: alternative hypotheses. Analysis
of Censored Data (Pune, 1994/1995), eds. H. L. Koul and J. V. Deshoande,
IMS Lecture Notes, Monograph Series 27, 271-219

Yang, S. and Prentice, R. L. (1999). Semiparametric inference in the propor-
tional odds regression model. Journal of the American Statistical Association,
94, 125-136.

Yu, Q., Li, L. and Wong, G. Y. C. (2000). On consistency of the self-
consistent estimator of survival functions with interval-censored data. Sacndinavian
Journal of Statistics, 27, 35-44.

Yu, Q., Wong, G. Y. C. and He, Q. (2000). Estimation of a joint distribu-
tion function with multivariate interval-censored dara when the nonparametric
MLE is not unique. Biometrical Journal, 6, 747-763.

Zeng, D. and Lin, D.Y. (2007). Maximum Likelihood Estimation in Semipara-
metric Models with Censored Data (with discussion). Journal of the Royal
Statistical Society, Series B, 69, 507-564.

林建甫. (2008). 存活分析. 雙葉書廊: 台北.
論文全文使用權限
校內
紙本論文於授權書繳交後5年公開
同意電子論文全文授權校園內公開
校內電子論文於授權書繳交後5年公開
校外
同意授權
校外電子論文於授權書繳交後5年公開

如有問題,歡迎洽詢!
圖書館數位資訊組 (02)2621-5656 轉 2487 或 來信