Please use this identifier to cite or link to this item: http://hdl.handle.net/11455/36779
標題: 植物病害發生率中解決過度離勢方法之研究
Research of Modeling Overdispersion for Analyzing Plant Disease Incidence Data
作者: 陳珮芬
Chen, Pei-Fen
關鍵字: Hierarchical sampling;層次取樣法;Heterogeneity;Overdispersion;Beta-binomial model;Williams model;Generalized linear mixed model;異質性;過度離勢;貝他-二項模式;威廉斯模式;廣義線性混合模式
出版社: 農藝學系所
引用: Agresti, A. 2002. Categorical data analysis. 2nd ed. New York:Wiley. Breslow, N.R. and D.G. Clayton. 1993. Approximate inference in generalized linear mixed models. Journal of the American Statistical Association 88:9-25. Brown, H. and R. Prescott. 1999. Applied mixed models in medicine. Chichester : John Wiley & Sons. Collett, D. 2003. Modelling Binary Data. London: Chapman & Hall. Dorfman, R. 1943. The detection of defective members of large populations. Ann. Math. Statist. 14:436-440. Engel, B. and A. Keen. 1994. A simple approach for the analysis of generalized mixed linear models. Statistica Neerlandica 48:1-22. Hinde, J. and C. G. B. Demétrio. 1998. Overdispersion : Models and estimation. Comput. Statist. Data Anal. 27: 151-170. Hughes, G. and T. R. Gottwald. 1998. Survey Methods for Assessment of Citrus Tristeza Virus Incidence. Phytopathology 88:715-723. Hughes, G. and L.V. Madden. 1995. Some methods allowing for aggregated patterns of disease incidence in the analysis of data from designed experiments. Plant pathol. 44:927-43. Hughes, G., L. V. Madden, and G. P. Munkvold. 1996. Cluster sampling for Disease incidence data. Phytopathology 86:132-137. Madden, L.V., M. Nita and W.W. Turechek. 2002. Evaluation of Generalized Linear Mixed Models for Analyzing Disease Incidence Data Obtained in Designed Experiments. Plant disease 86:316-325. McCullagh, P. and J.A. Nelder. 1989. Generalized Linear Models. 2nd ed. London:Chapman&Hall. McCulloch, C. E. and S. R. Searle. 2001. Generalized, linear, and mixed models. New York:Wiley. Moore, D. F. 1987. Modeling the extraneous variance in the presence of extra-binomial variation. Appl. Statist. 36:8-14. Oliver, S. and F. J. Pierce. 2002. Contemporary statistical models for the plant and soil sciences. Boca Raton :CRC Press. Piepho, J.C. 1999. Analysing disease incidence data from designed experiments by generalized linear mixed models. Plant pathol. 48:668-674. Verbeke, G. and G. Molenberghs. 2000. Linear mixed models for longitudinal data. New York: Springer. Wedderburn, R. W. M. 1974. Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method. Biometrika 61:439-447. Williams, D. A. 1975. The analysis of binary responses from toxicological experiments involving reproduction and teratogenicity. Biometrics 31:949-952. Williams, D. A. 1982. Extra-binomial variation in logistic linear models. Appl. Statist. 31:144-148. Wolfinger, R. and M. O'Connell. 1993. Generalized linear mixed models: A pseudo-likelihood approach. J. Stat. Comput. Simul. 48:223-243.
摘要: 
在植物流行病學上病害發生率(即健康或罹病植株之二元資料)常具有群聚(aggregated)現象,當資料具有群聚現象時常會造成資料異質性(heterogeneity)的產生,使得進行統計分析時出現過度離勢(overdispersion)現象,過度離勢是指所欲估計參數的實際變異數大於原先期望的變異數,此現象會使得進行統計推論時發生謬誤,是分析的過程中極待克服之問題。近幾年來以統計方法在植物流行病學上處理群聚病害發生率之應用研究,主要為運用貝他-二項模式(Beta-binomial model)和威廉斯模式(Williams model)在於葡萄病害之研究,以及使用廣義線性混合模式(Generalized Linear Mixed Model; GLMM)於葡萄和草莓病害之調查。
本研究旨在探討病害發生率的研究中,以層次取樣法(Hierarchical sampling)進行抽樣,並以實際資料葡萄露菌病和草莓葉枯病的病害發生率為例,藉由計算離勢參數、-2 Res Log Likelihood、AIC、BIC和繪製殘差圖,比較邏輯斯回歸、貝他-二項模式、威廉斯模式與廣義線性混合模式四種模式對資料進行配適情形,來探討這些方法克服過度離勢的能力並比較其優劣。最後並用模擬的方法,模擬四組不同離散程度下之數據,以上述四種方法進行配適,探討病害發生率在層次取樣法下,各模式解決過度離勢的能力,冀望所得之結果能對植物流行病學上之統計方法運用有所助益。

Incidence data tend to be clustered or aggregated in plant epidemiology. The data with cluster exhibit overdispersion, a phenomenon known as heterogeneity when the statistical analysis is going on. Overdispersion means that the actual variance of the interested parameter exceeds the expected variance. It leads to the incorrect estimate of standard error for the interested parameter. Thus, the final conclusion is often misled. Recently, there have existed a lot literatures discussing overdispersion phenomenon in the plant epidemiology, for example: Beta-binomial model and Williams model for grape research and Generalized Linear Mixed Model (GLMM) for the investigations of grape and strawberry.
The objective of this research is to evaluate the different methods for analyzing the aggregated plant disease incidence data at hierarchical sampling. By heterogeneity factor criterion, it will be investigated to use Logistic regression, Beta-binomial model, Williams method and GLMM to analyze the secondary data collected at the incidence of Downy mildew of grape and Phomopsis leaf blight of strawberry. Also, there will be a thorough discussion on the advantage and disadvantage among the different model approaches according to -2 Res Log Likelihood, AIC, BIC and residual plots. Finally, these models will be performed in order to analyze four simulated data sets. The result of the study can provide some objective suggestions for analyzing the binary data with overdispersion in the plant epidemiology.
URI: http://hdl.handle.net/11455/36779
其他識別: U0005-1108200614014400
Appears in Collections:農藝學系

Show full item record
 

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.