Please use this identifier to cite or link to this item: http://hdl.handle.net/11455/95957
標題: 使用變方分析與羅吉斯回歸分析比例數據資料之比較
Comparison of Using ANOVA and Logistic Regression to Analyze the Proportional Data
作者: Szu-Wei Yang
楊斯㵟
關鍵字: analysis of variance;logistic regression;proportional data;binary data;arcsine square root transformation;變方分析;羅吉斯回歸;比例數據;二元資料;反正弦平方根轉換
引用: Agresti, A. 2002. Categorical data analysis. New York, New York: John Wiley and Sons Volume 2. Archontoulis, S. V., and F. E. Miguez. 2015. Nonlinear regression models and applications in agricultural research. Agronomy Journal 107: 786–798. Battilani, P., A. Pietri, C. Barbano, A. Scandolara, T. Bertuzzi, and A. Marocco. 2008. Logistic regression modeling of cropping systems to predict fumonisin contamination in maize. Journal of Agricultural and Food Chemistry 56: 10433–10438. Bock, J. K. 1986. Syntactic Persistence in Language Production. Cognitive Psychology 18: 355–387. Cochran, W. G. 1940. The analysis of variances when experimental errors follow the Poisson or binomial laws. The Annals of Mathematical Statistics 11: 335–347. Cox, D. R. 1958. The regression analysis of binary sequences. Journal of the Royal Statistical Society Series B 20: 215–242. Cox, D. R. 1970. The analysis of binary data. London: Chapman & Hall. Dyke, G. V., and H. D. Patterson. 1952. Analysis of factorial arrangements when the data are proportions. Biometrics 8: 1–12. Hogg, R., and A. T. Craig. 1995. Introduction into mathematical statistics. Englewood Cliffs, NJ: Prentice Hall. Hosmer, T. A., D. W. Hosmer, L. Fisher. 1983. A comparison of three methods of estimating the logistic regression coefficient. Communication Statistics Computation 12: 727-751. Jaeger, T. F. 2008. Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models. Journal of Memory and Language 59: 434-446. Nakagawa, S., and H. Schielzeth.2013. A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods in Ecology and Evolution 4: 133–142. Nelder, J.A., and R.W.M. Wedderburn. 1972. Generalized linear models. Journal of the Royal Statistical Society: Series A 135: 370–384. Pickering, M. J., and H. P. Branigan. 1998. The Representation of Verbs: Evidence from Syntactic Priming in Language Production. Journal of Memory and Language 39: 633–651. Press, S. J., and S. Wilson. 1978. Choosing between logistic regression and discriminate analysis. Journal of the American Statistical Association 73: 699-705. Rao, M. M. 1960. Some asymptotic results on transformations in the analysis of variance. ARL Technical Note. Aerospace Research Laboratory, Wright-Patterson Air Force Base. Shao, J. 1999. Mathematical statistics. Springer, New York, USA. Stroup, W. W. 2015. Rethinking the analysis of non-normal data in plant and soil science. Agronomy Journal 107: 811–827. Warton, D.I., and F. K. C. Hui. 2011. The arcsine is asinine: the analysis of proportions in ecology. Ecology 92: 3-10. Winer, B. J., D. R. Brown, K. M. Michels. 1971. Statistical principles in experimental design. New York: McGraw-Hill. Wu, J., J. N. Jenkins, J. C. McCarty, and C. E. Watson. 2005. Comparisons of Two Statistical Models for Evaluating Boll Retention in Cotton. Agronomy Journal 97: 1291–1294. Yates, D. S., D. S. Moore, D. S. Starnes. 2003. The Practice of Statistics Second edition. New York: Freeman.
摘要: 
類別資料常存在於農業科學中,例如:植物病害、種子發芽、不同處理對產量高低及植株性狀影響等。對於類別資料的分析,常會將原始服從二項分布的資料轉換成比例數據進行分析,其分析方法通常是使用變方分析,但也可以使用羅吉斯回歸來進行相關的分析。為了比較變方分析與羅吉斯回歸,此兩種分析方式在二元資料所轉換的比例數據上分析之適用性,此論文進行不同情形資料數據的模擬,包括在不同樣本數、不同比例差距以及不同比例位置的情況下,進行變方分析與羅吉斯回歸分析,並以假設檢定方法之p-value去探討其檢定力,使用盒鬚圖及折線圖來呈現其結果,由此來判別在何種情況下,使用何種方法進行分析較為適當。結果顯示樣本數不論大小,比例不論接近0.1、接近0.5,變方分析與羅吉斯回歸兩種分析方法在型一誤差發生的機率α定為0.05時,型一誤差皆沒有增加;不論比例差距大小,比例越接近0.1時,羅吉斯回歸之檢定力較變方分析高,但比例越接近0.5時,雖羅吉斯回歸之檢定力與變方分析差異不明顯,但其檢定力同樣較變方分析高,因此在二元資料轉換為比例數據上的分析,使用羅吉斯回歸分析較為適當。

Categorical data analysis is commonly applied to agricultural science, such as plant disease, seed germination, effect on yields between treatments, or impact on plant traits and so on. For the analysis of categorical data, if the original data can be looked upon as a sample from a binomial distribution, it is usually transformed into proportional data before doing analysis. The frequently used way of analyzing the kind of data is analysis of variance. Logistic regression is also used in analyzing proportional data transformed from binary data. In order to compare the applicability of analysis between analysis of variance and logistic regression on the proportional data transformed from binary data, this study uses simulated data which includes different number of samples, differences of proportion, and values of proportion to compare the results for two analytic methods. The two analytic methods was compared by the power of hypothesis testing. P-value of boxplot and line chart are displayed to demonstrate the result. Finally, the results show that whether the number of samples are large, or whether the proportions are closer to 0.1 or 0.5; type I error will not be increased when using both analysis of variance and logistic regression. It also shows that whether the differences of proportion are large, or the proportions are closer to 0.1; the power tends to be higher when using logistic regression than analysis of variance. However, when proportions are closer to 0.5, the difference of power between logistic regression and ANOVA is not obvious. For the case, power still tends to be higher when using logistic regression than analysis of variance. Hence, logistic regression is more appropriate than analysis of variance in analyzing the proportional data transformed from binary data.
URI: http://hdl.handle.net/11455/95957
Rights: 同意授權瀏覽/列印電子全文服務,2018-08-01起公開。
Appears in Collections:農藝學系

Files in This Item:
File SizeFormat Existing users please Login
nchu-107-7102031026-1.pdf4.11 MBAdobe PDFThis file is only available in the university internal network   
Show full item record
 

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.