Single nucleotide polymorphism (SNP), the third generation molecular marker, is a research focus because of its characteristics of large amount, high stability, wide distribution, easy, high-throughput detection and wide application. Most SNP mining methods were based on the amplification of target regions, except RFLP. Few applications of high-throughput SNP mining methods were reported in common wheat (Triticum aestivum) SNP detection because of its hexaploid, huge genome and high homologous among A, B and D genomes. The dbEST of wheat in GenBank is an important resource to mining SNP with bioinformatic methods, and the full-length cDNA libraries, which constructed with wheat’s three genome donor species seedlings, provide the chance to mine wheat genome specific cSNP candidates.
More than 5 000 clones, from A, S and D genomes’ full-length cDNA libraries, were randomly sequenced, and 2 467 singlets were obtained, thereunto, 990 from A genome, 737 from S genome and 740 from D genome. Using these singlets as origin templates, we “fished out” 1 296 SNP with AutoSNP program from the wheat dbEST in GenBank and developed an effective method to mine wheat cSNP. Among these SNP, 397 were from A genome, 322 from S genome and 420 from D genome. The common SNP of A and D genomes were 154. The SNP shared by A and S, S and D, A, S and D was only 1 respectively, which seemed to indicate that the relationship of A and D genomes was much closer among wheat three genomes according to the principle of SNP mining.
Among the 1296 SNP, eighty-two were InDel , which accounted for 6.25% of all the sequence variations, and most of them were single-base InDel (64.20%), a small part of them were 2–5 base InDel (30.86%), and only very few InDels (4.94%), whose size were more than 5 bases. The analysis results indicated that the frequencies of SNP in the three genomes were different, that was 0.98‰ in A genome, 0.68‰ in S genome and 1.54‰ in D genome, and the average frequency of the three genome was 0.914‰, which was closer with that of soybean (0.97‰), yet much lower than that of maize (6.3‰). Additionally, the ratio of transition/transversion (Ts/Tv) of the three genome were different either, the highest (1.404) was in D genome, the second in A genome (1.196) and the lowest in S genome (1.033), and the average ratio of A, S and D genomes was 1.306, which was much closer with that of maize (1.53) and soybean (1.08), yet far from that of human, mouse and fruit fly.