標題: 最佳基因個數的評估
Evaluation of the top gene number
作者: 李佳瑾
Li, Chia-Chin
關鍵字: Gene expression profiles;基因表現圖譜;Prediction;Gene ranking;Dimension reduction;Proportional hazards model.;預測;基因排序;降維;比例風險模型。
出版社: 應用數學系所
One important application of microarray gene expression data in the statistical analysis is used to predict diseased patients'' clinical outcomes. Accurate selection of significant genes is a crucial step for building a good performance prediction model. In this study, we adopt the
statistics p-value and Cox score separately to rank the lung cancer patients'' genes, and then pick out the optimal number of top genes via exploring the effect of the top ranked gene number on prediction with principal component analysis, supervised principal components and partial least squares methods combined with Cox proportional hazards model. Finally, we use the selected significant genes to re-build a predictive model for different methods and compare with other reference''s methods. Furthermore, we assess the predictive performance by three different evaluation criteria. The results show that our predictive methods through gene selection procedure really achieve better predictive performances.
