Welcome to Acta Agronomica Sinica,

Acta Agron Sin ›› 2007, Vol. 33 ›› Issue (01): 70-76.

• ORIGINAL PAPERS • Previous Articles     Next Articles

A Maximum Likelihood-Based Dynamic Clustering Method and Its Application

XIAO Jing,HU Zhi-Qiu,WANG Xue-Feng,XU Chen-Wu*   

  1. Jiangsu Provincial Key Laboratory of Crop Genetics and Physiology, Yangzhou University, Yangzhou 225009, Jiangsu, China
  • Received:2005-11-23 Revised:1900-01-01 Online:2007-01-12 Published:2007-01-12
  • Contact: XU Chen-Wu

Abstract:

Clustering analysis is to determine the intrinsic grouping in a set of unlabeled data. A cluster is a collection of objects which are similar between them and are dissimilar to the objects belonging to other clusters. However, the current clustering techniques have not addressed all the requirements adequately. For instance, dealing with large number of dimensions and large number of data can be problematic because of time complexity. The effectiveness of the distance-based clustering methods depends on the definition of distance; if an obvious distance measure doesn’t exist we must define it, which is not always easy, especially in multi-dimensional spaces. In addition, the choice of the optimal number of clusters in practice is impossible. Thus, choosing the correct number of clusters and the best clustering method is still a question open to discussion. In order to solve these problems, in this paper, we introduced a maximum likelihood-based dynamic clustering method, which combined the conventional dynamic clustering and discrimination analysis. The parameters of different clusters were estimated by the maximum likelihood method implemented via expectation-maximization (EM) algorithm and the objects were classified by the Bayesian posterior probability. This classified idea could increase the posterior confidence of classified individuals. The results of simulation studies showed that the proposed method not only unbiasedly estimated the corresponding cluster parameters but also differentiated the optimum clustering numbers by Bayesian information criterion (BIC). Compared with the K-means method and the minimum square sum within groups (MinSSw) method, the proposed method was more robustness and had almost the same clustering accuracy as K-means and MinSSw methods. Moreover, the misclassified rate (MR) could be reduced by enhancing the discrimination criterion. However, the unclassified rate (UR) would be increased by enhancing the discrimination criterion. Thus, an eclectic discrimination criterion could be given by the user in order to decrease both MR and UR. The result indicated that the proposed method had a significant advantage on clustering accuracy compared to the K-means and MinSSw methods. An example of the plant height and the number of tiller of F2 population in rice cross Duonieai×Zhonghua 11 was used in the illustration. The results listed in Table 6 indicated that the genetic difference of these two traits in this cross involves only one pleiotropic major gene. The additive effect and dominance effect of the major gene were estimated as -24.57 cm and 57.12 cm on plant height, and 23.01 and -25.89 on number of tiller, respectively. The major gene shows overdominance for plant height and near complete dominance for number of tillers.

Key words: Cluster analysis, Posterior probability, Bayesian information criterion, Discrimination analysis

[1] ZHANG Yi-Zhong, ZENG Wen-Yi, DENG Lin-Qiong, ZHANG He-Cui, LIU Qian-Ying, ZUO Tong-Hong, XIE Qin-Qin, HU Deng-Ke, YUAN Chong-Mo, LIAN Xiao-Ping, ZHU Li-Quan. Codon usage bias analysis of S-locus genes SRK, SLG, and SP11/SCR in Brassica oleracea [J]. Acta Agronomica Sinica, 2022, 48(5): 1152-1168.
[2] WANG Rui-Yun,JI Xu,LU Ping,LIU Min-Xuan,XU Yue,WANG Lun,WANG Hai-Gang,QIAO Zhi-Jun. Analysis of Genetic Diversity in Common Millet (Panicum miliaceum) Using Fluorescent SSR in China [J]. Acta Agron Sin, 2017, 43(04): 530-548.
[3] HU Yi-Bo, YANG Xiu-Shi, LU Ping*,REN Gui-Xing*. Diversity and Correlation of Quality Traits in Quinoa Germplasms from North China [J]. Acta Agron Sin, 2017, 43(03): 464-470.
[4] XU Ning,CHEN Bing-Xu,WANG Ming-Hai,BAO Shu-Ying,WANG Gui-Fang,GUO Zhong-Xiao. Identification of Alkali Tolerance of Mungbean Germplasm Resources during Germination [J]. Acta Agron Sin, 2017, 43(01): 112-121.
[5] WU Qi,ZHOU Yu-Fei,GAO Yue,ZHANG Jiao,CHEN Bing-Ru,XU Wen-Juan,HUANG Rui-Dong. Screening and Identification for Drought Resistance during Germination in Sorghum Cultivars [J]. Acta Agron Sin, 2016, 42(08): 1233-1246.
[6] LIU Ying,ZHANG Qiao-Feng,FU Bi-Sheng,CAI Shi-Bin,JIANG Yan-Jie,ZHANG Zhi-Liang,DENG Yuan-Yu,WU Ji-Zhong,DAI Ting-Bo. Genetic Diversity of Wheat Germplasm Resistant to Sharp Eyespot and Genotyping of Resistance Loci Using SSR Markers [J]. Acta Agron Sin, 2015, 41(11): 1671-1681.
[7] LI Long,WANG Lan-Fen,WU Jing,JING Rui-Lian,WANG Shu-Min*. Identification of Drought Resistence at Seedlings Stage in Common Bean (Phaseolus vulgaris L.) Varieties [J]. Acta Agron Sin, 2015, 41(06): 963-971.
[8] WANG Yi-Tao,ZHOU Yu-Fei,LI Feng-Xian,YI Bing,BAI Wei,YAN Tong,XU Wen-Juan,GAO Ming-Chao,HUANG Rui-Dong. Identification and Classification of Sorghum Cultivars for Drought Resistance during Germination Stage Based on Principal Components Analysis and Self Organizing Map Cluster Analysis [J]. Acta Agron Sin, 2014, 40(01): 110-121.
[9] TANG Mei,CHEN Yu-Ning,REN Xiao-Ping,HUANG Li,ZHOU Xiao-Jing,YAN Hai-Yan,JIANG Hui-Hang. Genetic Diversity of Arachis Accessions Detected by EST-SSR from Cultivated Peanut (Arachis hypogaea L.) [J]. Acta Agron Sin, 2012, 38(07): 1221-1231.
[10] LIU Jin-Shuai, LAI Hui-Cheng, JIA Zhen-Hong. Image Segmentation of Cotton Based on YCbCcr Color Space and Fisher Discrimination Analysis [J]. Acta Agron Sin, 2011, 37(07): 1274-1279.
[11] GUO Chun-Fang;SUN Yun;CHEN Chang-Song;CHEN Rong-Bing;ZHANG Mu-Qing. Comparison and Cluster Analysis of Photosynthetic Characters and Water Use Efficiency in Tea (Camellia sinensis) Cultivars [J]. Acta Agron Sin, 2008, 34(10): 1797-1804.
[12] YANG Jin-Zhong;HAO Jian-Ping;DU Tian-Qing;CUI Fu-Zhu;SANG Su-Ping. Discrimination of Numerous Maize Cultivars Based on Seed Image Process [J]. Acta Agron Sin, 2008, 34(06): 1069-1073.
[13] MENG Fan-Lei;QIANG Xiao-Lin;SHE Kui-Jun;TANG Ya-Wei;HU Yin-Gang. Genetic Diversity Analysis among Hulless Barley Varieties from the Major Agricultural Areas of Tibet [J]. Acta Agron Sin, 2007, 33(11): 1910-1914.
[14] DU Yong;WANG Yan;WANG Xue-Hong;SUN Lai-Li;YANG Jian-Chang. Comparisons of Plant Type, Grain Yield, and Quality of Different Japonica Rice Cultivars in Huanghe-Huaihe River Area [J]. Acta Agron Sin, 2007, 33(07): 1079-1085.
[15] XU Yan-Hong;GUAN Jian-Ping;ZONG Xu-Xiao. Genetic Diversity Analysis of Cowpea Germplasm Resources by SSR [J]. Acta Agron Sin, 2007, 33(07): 1206-1209.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!