Please use this identifier to cite or link to this item: http://hdl.handle.net/11455/60831
標題: 使用支持向量機預測被子植物MADS-box基因分類與整合網站之建立
The Classification Prediction of MADS-box Genes on Angiosperms Using SVM and the Construction of Related Websites
作者: 詹憫正
Chan, Min-Cheng
關鍵字: MADS-box gene
MADS-box基因
SVM
prediction
classification
Arabidopsis thaliana
Oncidium Gower Ramsey
支持向量機
預測
分類
阿拉伯芥
文心蘭
出版社: 基因體暨生物資訊學研究所
引用: Altschul, S.F., et al. (1990) Basic local alignment search tool, J Mol Biol, 215, 403-410. Arora, R., et al. (2007) MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress, BMC Genomics, 8, 242. Becker, A. and Theissen, G. (2003) The major clades of MADS-box genes and their role in the development and evolution of flowering plants, Mol Phylogenet Evol, 29, 464-489. Chang, C.C., Hsu, C.W. and Lin, C.J. (2000) The analysis of decomposition methods for support vector machines, IEEE Trans Neural Netw, 11, 1003-1008. Chang, Y.Y., et al. (2009) Four orchid (Oncidium Gower Ramsey) AP1/AGL9-like MADS box genes show novel expression patterns and cause different effects on floral transition and formation in Arabidopsis thaliana, Plant Cell Physiol, 50, 1425-1438. Cheng, J.L., Randall, A. and Baldi, P. (2006) Prediction of protein stability changes for single-site mutations using support vector machines, Proteins-Structure Function and Bioinformatics, 62, 1125-1132. Cseke, L.J., et al. (2005) SEP-class genes in Populus tremuloides and their likely role in reproductive survival of poplar trees, Gene, 358, 1-16. De Bodt, S., Theissen, G. and Van de Peer, Y. (2006) Promoter analysis of MADS-box genes in eudicots through phylogenetic footprinting, Molecular Biology and Evolution, 23, 1293-1303. Dias, B.F.D., et al. (2005) Unravelling MADS-box gene family in Eucalyptus spp.: A starting point to an understanding of their developmental role in trees, Genet Mol Biol, 28, 501-510. Fischer, A., et al. (1995) Chromosomal mapping of the MADS-box multigene family in Zea mays reveals dispersed distribution of allelic genes as well as transposed copies, Nucleic Acids Res, 23, 1901-1911. Hartmann, U., et al. (2000) Molecular cloning of SVP: a negative regulator of the floral transition in Arabidopsis, Plant Journal, 21, 351-360. Hileman, L.C., et al. (2006) Molecular and phylogenetic analyses of the MADS-box gene family in tomato, Mol Biol Evol, 23, 2245-2258. Immink, R.G., et al. (2003) Analysis of the petunia MADS-box transcription factor family, Mol Genet Genomics, 268, 598-606. Jager, M., et al. (2003) MADS-box genes in Ginkgo biloba and the evolution of the AGAMOUS family, Mol Biol Evol, 20, 842-854. Kater, M.M., Dreni, L. and Colombo, L. (2006) Functional conservation of MADS-box factors controlling floral organ identity in rice and Arabidopsis, J Exp Bot, 57, 3433-3444. Kofuji, R., et al. (2003) Evolution and divergence of the MADS-box gene family based on genome-wide expression analyses, Mol Biol Evol, 20, 1963-1977. Kramer, E.M., Dorit, R.L. and Irish, V.F. (1999) Molecular evolution of genes controlling petal and stamen development: Duplication and divergence within the APETALA3 and PISTILLATA MADS-box gene lineages (vol 149, pg 765, 1998), Genetics, 151, 915-915. Larkin, M.A., et al. (2007) Clustal W and Clustal X version 2.0, Bioinformatics, 23, 2947-2948. Leseberg, C.H., et al. (2006) Genome-wide analysis of the MADS-box gene family in Populus trichocarpa, Gene, 378, 84-94. Lin, E.P., et al. (2009) Identification and characterization of two Bamboo (Phyllostachys praecox) AP1/SQUA-like MADS-box genes during floral transition, Planta, 231, 109-120. Linskens, H.F. (1997) Pollen as food and medicine: a review, Economic Botany, 51, 78. Lupas, A. (1996) Prediction and analysis of coiled-coil structures, Methods Enzymol, 266, 513-525. Ma, H., Yanofsky, M.F. and Meyerowitz, E.M. (1991) AGL1-AGL6, an Arabidopsis gene family with similarity to floral homeotic and transcription factor genes, Genes & Development, 5, 484-495. Marchler-Bauer, A., et al. (2009) CDD: specific functional annotation with the Conserved Domain Database, Nucleic Acids Res, 37, D205-210. Marchler-Bauer, A. and Bryant, S.H. (2004) CD-Search: protein domain annotations on the fly, Nucleic Acids Res, 32, W327-331. Martinez-Castilla, L.P. and Alvarez-Buylla, E.R. (2004) Adaptive evolution in the Arabidopsis MADS-box gene family inferred from its complete resolved phylogeny (vol 100, pg 13407, 2003), P Natl Acad Sci USA, 101, 1110-1110. Mena, M., et al. (1995) A characterization of the MADS-box gene family in maize, Plant J, 8, 845-854. Mendoza, L., Thieffry, D. and Alvarez-Buylla, E.R. (1999) Genetic control of flower morphogenesis in Arabidopsis thaliana: a logical analysis, Bioinformatics, 15, 593-606. Mouradov, A., et al. (1998) Family of MADS-Box genes expressed early in male and female reproductive structures of monterey pine, Plant Physiol, 117, 55-62. Nam, J., et al. (2003) Antiquity and evolution of the MADS-box gene family controlling flower development in plants, Molecular Biology and Evolution, 20, 1435-1447. Ng, M. and Yanofsky, M.F. (2001) Function and evolution of the plant MADS-box gene family, Nat Rev Genet, 2, 186-195. Parenicova, L., et al. (2003) Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: New openings to the MADS world, Plant Cell, 15, 1538-1551. Poupin, M.J., et al. (2007) Isolation of the three grape sub-lineages of B-class MADS-box TM6, PISTILLATA and APETALA3 genes which are differentially expressed during flower and fruit development, Gene, 404, 10-24. Purugganan, M.D., et al. (1995) Molecular evolution of flower development: diversification of the plant MADS-box regulatory gene family, Genetics, 140, 345-356. Rijpkema, A.S., Gerats, T. and Vandenbussche, M. (2007) Evolutionary complexity of MADS complexes, Curr Opin Plant Biol, 10, 32-38. Rounsley, S.D., Ditta, G.S. and Yanofsky, M.F. (1995) Diverse roles for MADS box genes in Arabidopsis development, Plant Cell, 7, 1259-1269. Schmitz, J., et al. (2000) Cloning, mapping and expression analysis of barley MADS-box genes, Plant Mol Biol, 42, 899-913. Shchennikova, A.V., et al. (2004) Identification and characterization of four chrysanthemum MADS-box genes, belonging to the APETALA1/FRUITFULL and SEPALLATA3 subfamilies, Plant Physiol, 134, 1632-1641. Shitsukawa, N., et al. (2007) Genetic and epigenetic alteration among three homoeologous genes of a class E MADS box gene in hexaploid wheat, Plant Cell, 19, 1723-1737. Shore, P. and Sharrocks, A.D. (1995) The MADS-Box Family of Transcription Factors, Eur J Biochem, 229, 1-13. Tzeng, T.Y., et al. (2003) Two lily SEPALLATA-like genes cause different effects on floral formation and floral transition in Arabidopsis, Plant Physiol, 133, 1091-1101. Urbanus, S.L., et al. (2009) In planta localisation patterns of MADS domain proteins during floral development in Arabidopsis thaliana, Bmc Plant Biol, 9, 5. Vandenbussche, M., et al. (2003) Toward the analysis of the petunia MADS box gene family by reverse and forward transposon insertion mutagenesis approaches: B, C, and D floral organ identity functions require SEPALLATA-like MADS box genes in petunia, Plant Cell, 15, 2680-2693. Yu, H. and Goh, C.J. (2000) Identification and characterization of three orchid MADS-box genes of the AP1/AGL9 subfamily during floral transition, Plant Physiol, 123, 1325-1336. Zhang, B., Su, X. and Zhou, X. (2008) A MADS-box gene of Populus deltoides expressed during flower development and in vegetative organs, Tree Physiol, 28, 929-934. Zhang, L., Xu, Y. and Ma, R.C. (2008) Molecular cloning, identification, and chromosomal localization of two MADS box genes in peach (Prunus persica), J Genet Genomics, 35, 365-372.
摘要: 在花器發育的過程中,MADS-box基因是很重要的轉錄調控因子,而MADS-box基因的分類主要是依據模式植物-阿拉伯芥的ABCDE模型為基礎來相互對應與比較。在type II MADS-box基因囊括的MIKC class,有調控花器形成的重要功能。花器的組成與調控,由外至內:萼片,被class A與E調控;花瓣,被class A、B與E調控;雄蕊,被class B、C與E調控;心皮,被class C以及E的基因調控;class D的基因則與胚珠的發育有關。ABCDE類型中,特定類型的基因組合會與花不同組織的發育有關,如果能進一步分析特定MADS-box基因屬於哪種類型,或許對其生化功能的判斷有所幫助。有別於比較序列相似程度的傳統分類方法,本研究把已知分群的MADS-box基因序列當成input,取用BLAST後的e-value做為learning set,利用SVM的學習機制來建構一個分類的模型,最後利用經實驗證實而得知MADS-box基因分群的文心蘭序列來驗證分類模型,預測結果相當一致。iMADS是一個整合性分析工具的網站,除了提供MADS-box基因的分類預測結果,也呈現其他經由實驗驗證的相關資訊供研究者參考,如:蛋白質的保留性區域與在植物中可能的表現部位,便於相關研究者快速獲得相關資訊,進行更深入的研究。
MADS-box genes are important transcriptional factors during floral organ development. The classical classification model of MADS-box genes is constructed by Arabidopsis thaliana, whose floral tissues such as sepal is controlled by class A and E; petal is controlled by class A, B and E; stamen is controlled by class B, C and E; and carpel is controlled by class C and E. It is showed that specific classes of MADS-box genes feature in specific functions. It is showed that specific classes of MADS-box genes feature in specific functions. The traditional classification estimation of MADS-box genes relies on phylogenetic analysis or multiple sequences alignment which is needed to waste time on collecting reference sequences. Data collection is the key point to affect the evaluation of target genes. This study proposed a new prediction method of MADS-box genes classification based on similarity measure evaluated by general five programs of BLAST and constructed the classification model using Support Vector Machine which depended on 210 MADS-box genes of different plant species and validated classification model by 10 MADS-box genes of Oncidium Gower Ramsey. Furthermore, we constructed a web-based tool, iMADS, which integrates several web tools in order to shorten the wasted time and provide related information about putative class of MADS-box gene, expressed tissues in plants, conserved domain search, coiled-coil prediction and evolutionary analysis. Those contents of latter three are assayed from web tools including NCBI Conserved Domain Search, COILS and Phylodendron separately. iMADS is an information-integrated analytic tool for MADS-box genes. It may reduce costing of time and money of researchers, making a quickly-output prediction, and presenting reliable and systematic results to users. This web-based tool is publicly available at http://predictor.nchu.edu.tw/iMADS.
URI: http://hdl.handle.net/11455/60831
其他識別: U0005-2207201012163500
文章連結: http://www.airitilibrary.com/Publication/alDetailedMesh1?DocID=U0005-2207201012163500
Appears in Collections:基因體暨生物資訊學研究所

文件中的檔案:

取得全文請前往華藝線上圖書館



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.