Assessment of diversity in tropical soybean (Glycine max (L.) Merr.) varieties and elite breeding lines using single nucleotide polymorphism markers

Abush Tesfaye Abebe; Adesike Oladoyin Kolawole; Nnanna Unachukwu; Godfree Chigeza; Hailu Tefera; Melaku Gedil

doi:10.1017/S1479262121000034

Assessment of diversity in tropical soybean (Glycine max (L.) Merr.) varieties and elite breeding lines using single nucleotide polymorphism markers

Published online by Cambridge University Press: 17 February 2021

Abush Tesfaye Abebe

Adesike Oladoyin Kolawole ,

Hailu Tefera and

Abush Tesfaye Abebe: Affiliation:
International Institute of Tropical Agriculture, Ibadan, Nigeria
Adesike Oladoyin Kolawole: Affiliation:
Ladoke Akintola University of Technology, Ogbomoso, Nigeria
Nnanna Unachukwu: Affiliation:
International Institute of Tropical Agriculture, Ibadan, Nigeria
Godfree Chigeza: Affiliation:
International Institute of Tropical Agriculture, Lusaka, Zambia
Hailu Tefera: Affiliation:
Private Consultant, 2384 Rolling Fork Circle, #403, Herndon, VA20171, USA
Melaku Gedil*: Affiliation:
International Institute of Tropical Agriculture, Ibadan, Nigeria
*: *Corresponding author. E-mail: m.gedil@cgiar.org

Article contents

Abstract
Introduction
Materials and methods
Results
Discussion
References

Rights & Permissions

Abstract

Soybean (Glycine max (L.) Merr.) is an important legume crop with high commercial value widely cultivated globally. Thus, the genetic characterization of the existing soybean germplasm will provide useful information for enhanced conservation, improvement and future utilization. This study aimed to assess the extent of genetic diversity of soybean elite breeding lines and varieties developed by the soybean breeding programme of the International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria. The genetic diversity of 65 soybean genotypes was studied using single-nucleotide polymorphism (SNP) markers. The result revealed that 2446 alleles were detected, and the indicators for allelic richness and diversity had good differentiating power in assessing the diversity of the genotypes. The three complementary approaches used in the study grouped the germplasm into three major clusters based on genetic relatedness. The analysis of molecular variance revealed that 71% (P < 0.001) variation was due to among individual genotypes, while 11% (P < 0.001) was ascribed to differences among the three clusters, and the fixation index (FST) was 0.11 for the SNP loci, signifying moderate genetic differentiation among the genotypes. The identified private alleles indicate that the soybean germplasm contains diverse variability that is yet to be exploited. The SNP markers revealed high diversity in the studied germplasm and found to be efficient for assessing genetic diversity in the crop. These results provide valuable information that might be utilized for assessing the genetic variability of soybean and other legume crops germplasm by breeding programmes.

Keywords

allelic diversity breeding programmes genetic relatedness SNP soybean germplasm

Type: Research Article
Information: Plant Genetic Resources , Volume 19 , Issue 1 , February 2021 , pp. 20 - 28

DOI: https://doi.org/10.1017/S1479262121000034 [Opens in a new window]
Copyright: Copyright © The Author(s), 2021. Published by Cambridge University Press on behalf of NIAB

Introduction

Soybean (Glycine max (L.) Merr.) is one of the major legumes and oil crops of the world, in terms of total production and trade (Chen and Nelson, Reference Chen and Nelson2005; Hymowitz and Shurtleff, Reference Hymowitz and Shurtleff2005; FAO, 2020). However, the genetic improvement of such an essential crop has been challenged by its extremely narrow genetic base (Gizlice et al., Reference Gizlice, Carter and Burton1993, Reference Gizlice, Carter and Burton1994, Reference Gizlice, Carter, Gerig and Burton1996; Salado-Navarro et al., Reference Salado-Navarro, Sinclair and Hinson1993; Sneller, Reference Sneller1994; Singh and Hymowitz, Reference Singh and Hymowitz1999; Cornelious and Sneller, Reference Cornelious and Sneller2002). To sustain its genetic diversity, over 170,000 soybean germplasms have been conserved across 17 countries globally; while the Chinese National Crop Genebank maintains 31,575 accessions (Qiu et al., Reference Qiu, Chen, Liu, Li, Guan, Wang and Chang2011) and the soybean genetic resource centre of the United States Department of Agriculture (USDA) maintains over 18,000 germplasm collections (Soybase, 2020). These germplasm collections have made significant contributions to production and breeding programmes, since they possess several unique genes that can be utilized for the genetic improvement of the crop (Qiu et al., Reference Qiu, Chen, Liu, Li, Guan, Wang and Chang2011; Soybase, 2020). Populations derived from the genetic recombination of biparental crosses of diverse parents might be vital sources of higher genetic variability (Helms et al., Reference Helms, Orf, Vallad and McClean1997; Kisha et al., Reference Kisha, Sneller and Diers1997). Some reports show the contributions of improved varieties (e.g. crosses of elite X elite varieties) and elite breeding lines developed by breeding programmes are on the rise, compared to germplasm collections and landraces. For instance, in China, elite breeding lines from crosses and cultivars contributed to 36 and 86% of the soybean varieties released in the 1950s, and between 1993 and 2004, respectively, relative to landraces, traditional varieties and wild relatives (Qiu et al., Reference Qiu, Chen, Liu, Li, Guan, Wang and Chang2011).

Progress in genetic improvement of any crop depends on the presence of genetic diversity within the populations under selection. Knowledge of the genetic diversity crops is very important for designing strategies to establish core collections and enhance utilization of the germplasm by breeding programmes. Soybean production is gaining increasing importance in several sub-Saharan African (SSA) countries, such as Nigeria, Ghana, Uganda, Ethiopia, Zambia and Malawi (FAO, 2020), the crop being cultivated in the wider agro-ecological conditions of these countries. The soybean improvement programme at the International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria, has a soybean collection of more than 1800 accessions, including high-yielding cultivars without information on the extent of their genetic diversity. These accessions have been phenotypically screened for priority traits that include disease resistance, pod shattering tolerance, early and medium maturity, efficient natural nodulation, lodging tolerance and yield improvement (Tefera et al., Reference Tefera, Kamara and Asafo-Adjei2009; Chigeza et al., Reference Chigeza, Boahen, Gedil, Agoyi, Mushoriwa, Denwar, Gondwe, Tesfaye, Kamara, Alamu and Chikoye2019). These authors also reported the release of more than 100 IITA bred soybean varieties by the National Agricultural Research Systems across several countries in SSA, following the screening efforts.

Numerous studies have been carried out to determine the degree of genetic variation of varieties and breeding lines of soybean and their relatedness (Keim et al., Reference Keim, Beavis, Schupp and Freestone1992; Gizlice et al., Reference Gizlice, Carter and Burton1994; Sneller, Reference Sneller1994; Sneller et al., Reference Sneller, Miles and Hoyt1997; Kisha et al., Reference Kisha, Diers, Hoyt and Sneller1998; Nelson et al., Reference Nelson, Elmore, Klein and Shapiro1998). Due to its commercial value, soybean has been the subject of advanced genomic studies by the private and public sectors. A number of molecular genomic resources, including single-nucleotide polymorphisms (SNPs), such as: Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html), SoyKB (http://soykb.org/) and Soybase (https://soybase.org/) are available in the public databases. Despite the abundance of genomic resources, no molecular characterization has been performed on the soybean breeding lines at IITA. Therefore, the objective of this study was to assess the extent of genetic diversity of breeding lines and varieties developed by the IITA soybean breeding programme.

Materials and methods

Plant material and DNA isolation

A total of 65 soybean genotypes (17 released varieties and 48 elite breeding lines) from IITA's soybean breeding programme were used in this study (Table 1). Although most of these varieties were released in Nigeria, some lines have been released in Ghana, Benin, Togo, Democratic Republic of the Congo, Uganda, and Ethiopia from 1989 to 2011. All the genotypes with TGx 1987 series are rust-resistant, as they are developed from a cross of rust-resistant donor parent UG-5, in addition to other desirable traits. TGx 1835-10E is a variety released in Nigeria for early maturity and rust resistance. Soy104 and UG-5 were unimproved, but sources of rust resistance genes. TGx 1440-1E and TGx 1448-2E are suitable as a trap varieties in depleting the seed bank of Striga hermonthica.

Table 1. Rust reaction and pedigree of the soybean genotypes used in the genetic diversity study

^a Res, resistant; Susc, susceptible.

DNA extraction and genotyping

For SNP genotyping, the 65 soybean genotypes were planted and grown into seedlings for 3 weeks, after which fresh bulk young leaves were harvested from all the seedlings (10) per genotype and ground into a fine powder using liquid nitrogen. The genomic DNA of each plant sample was extracted using a miniprep Dellaporta extraction protocol (Dellaporta et al., Reference Dellaporta, Wood and Hicks1983). The quality of the extracted DNA samples was checked using a 1% agarose gel prepared in 150 ml of 1× TBE agarose gel and quantified using Nanodrop Technologies ND-1000 model range of a nanodrop spectrophotometer.

GoldenGate assay-based SNP genotyping

The genomic DNA extracted from each of the 65 soybean genotypes was sent to the Soybean Genomics and Improvement Laboratory, Beltsville Agricultural Research Center (SGIL-BARC), Beltsville, Maryland. DNA quality was also checked at SGIL-BARC and immediately used for the GoldenGate assay. The GoldenGate assay was performed as per the procedure described by Hyten et al. (Reference Hyten, Song, Choi, Yoon, Specht, Matukumalli, Nelson, Shoemaker, Young and Cregan2008).

Statistical analyses

Genetic diversity indices

SNP markers with a minor allele frequency (MAF) less than 0.05 were filtered out, resulting in 1223 informative loci used in the analyses. The genetic properties of SNPs, such as MAF, polymorphic information content (PIC) and percentage of polymorphic loci (% P) were calculated to quantify the genetic diversity within and among the 65 soybean genotypes. In addition, genetic diversity indices, such as total number of different alleles (N _a), number of effective alleles (N _e), Shannon's information index (I), number of private alleles, gene diversity (H _e), observed heterozygosity (H _o) and number of loci with private alleles were computed using Power Marker (Liu and Muse, Reference Liu and Muse2005) and GenAlEx version 6.41 (Peakall and Smouse, Reference Peakall and Smouse2012) software.

Population structure

The inherent population structure within the genotypes was characterized based on the common attributes of the genotypes using the three complementary clustering approaches. In the first distance-based hierarchical clustering analysis, a pairwise genetic distance (identity-by-state, IBS) matrix was calculated among all individuals using PLINK 5 (Purcell et al., Reference Purcell, Neale, Todd-brown, Thomas, Bender, Maller, Sklar, de Bakker, Daly and Sham2007) and Ward's minimum variance. A hierarchical cluster dendrogram was then built from the IBS matrix using the Analyses of Phylogenetics and Evolution (ape) package (Paradis et al., Reference Paradis, Claude and Strimmer2004) implemented in R (R core team, 2015). The second approach was a model-based maximum likelihood estimation of ancestral subpopulations using ADMIXTURE (Alexander et al., Reference Alexander, Novembre and Lange2009). ADMIXTURE assumes that the loci are in linkage equilibrium, and the ancestral populations are in Hardy–Weinberg equilibrium (Frichot et al., Reference Frichot, Mathieu, Trouillon, Bouchard and François2014). In the ADMIXTURE analysis, the number of subpopulations (K) varied from 2 to 12, and the value of K exhibiting a low cross-validation error was selected (Alexander and Lange, Reference Alexander and Lange2011). The third approach was an assumption-free discriminant analysis of principal components (DAPC) analysis which was implemented in R using the ‘adegenet’ package (Jombart, Reference Jombart2008). DAPC that involved optimal clusters of transformed principal component analysis-based SNP data was used to identify and describe clusters of genetically related individuals and subgrouping based on k-means. The best-supported model by Bayesian information criterion (BIC) was selected. Based on results of hierarchical clustering information and ADMIXTURE analysis, the most appropriate K was selected. The membership probabilities of each genotype for the different groups were obtained, and results of the three complementary approaches (the hierarchical tree/dendrogram, ADMIXTURE and DAPC) were compared.

Based on the numbers of inferred clusters determined from the three complementary approaches, analysis of molecular variance (AMOVA) was computed to estimate molecular variation within and among the genotypes using GenAlEx 6.41 and Power Marker V3.25 software. The extent of genetic variance explained by population structure was derived from the AMOVA, fixation index (F _ST) and standardized F _ST (F′_ST) based on Wright's F-statistic (Wright, Reference Wright1978).

Results

Genetic diversity

The MAF ranged from 0.02 to 0.50, with an average of 0.23. The highest PIC value of 0.38 was recorded in the markers BARC-064873-18956, BARC-051149-11016, BARC-029669-06297 and BARC-019787-04375; while the lowest value was 0.02 with an average value of 0.25. The percentage of polymorphic loci recorded in this study (85%) was high, and an indicator of the efficiency of SNP markers used in this study in detecting polymorphism (Fig. 1). The entire soybean samples had an average effective number of alleles, gene diversity, observed heterozygosity and Shannon's information index of 1.53, 0.31, 0.19 and 0.25, respectively (Table 2). These frequencies were considered to be desirable in differentiating the studied soybean genotypes.

Fig. 1. Frequency distribution of the mean values of genetic properties of SNPs across the 65 soybean genotypes.

Table 2. Average genetic diversity measures and their corresponding standard errors at 1223 SNP loci of the 65 soybean genotypes

The presence of private alleles was considered as an additional factor to differentiate the population. A total of 108 private alleles were identified among the 65 soybean genotypes. However, based on the number of clusters identified, the number of private alleles detected in cluster 2 was about 3–9-fold higher than the other two clusters.

Genetic relatedness

All the three complementary methods used in determining the number of clusters among the 65 soybean genotypes showed the presence of three major clusters with few sub-clustering (Fig. 2). Cluster 1 consists of 19 genotypes, most of which were crosses with Uganda's UG-5, including the genotype from USA (SOY104), all of which are rust-resistant, except for TGX 536-02D, which was released in Nigeria, Benin and Ghana since 1985, and susceptible to the rust disease. Cluster 2 was the largest, with 26 genotypes, which are all susceptible to leaf rust. Some genotypes (i.e. TGx 1440-1E, TGx 1448-2E, TGx 1937-1F, TGx 1908-8F and TGx 1910-14F) in this cluster are early flowering, resistant to lodging, resistant to pod shattering and have good nodulation ability. These genotypes were released in Nigeria, Cameroon, Togo, Mozambique, Ghana, Benin, Cote D'Ivoire, Kenya, Malawi and Mozambique. Cluster 3 had 20 genotypes, of which only two (TGX 1961-1F and TGX 1835-10E) were resistant to leaf rust with the latter released in Nigeria, Uganda, Kenya and Cameroon since 2008. Other genotypes in this cluster are susceptible to soybean leaf rust disease, but possess other desirable agronomic traits. Although all clusters are discrete and well separated from the other clusters, each cluster is reasonably heterogeneous in terms of the genotypes' attributes. It was observed that genotypes with the same pedigrees clustered together that validates clustering with the SNP markers was efficient in grouping genetically related genotypes. Very few intermixing of the rust-resistant with susceptible soybean genotypes were observed across the clusters (Fig. 2). The DAPC method, using discriminant functions (Fig. 3), maximized the diversity between the three clusters while minimizing the diversity with-in cluster. The first three principal components explained 31.95% of the cumulative variation. The three genetically distinct groups identified using DAPC were consistent with the groups identified by the hierarchical cluster/dendrogram and ADMIXTURE. The error rate from the cross-validation method of both the ADMIXTURE and the BIC from the DAPC showed a rapid decline from K = 1 to K = 3 and from 1 to 3, respectively, indicating that the samples can be grouped into three major clusters. The results obtained above were consistent and showed good correspondence; thus, indicating the samples' population structure had been correctly identified.

Fig. 2. Hierarchical cluster/dendrogram of the 65 soybean genotypes (Ward's minimum variance method) and the individual ancestry estimated from ADMIXTURE analysis and the clustering by DAPC. Individuals are partitioned into segments corresponding to the inferred membership in k = 3 genetic clusters as indicated by the colours.

Fig. 3. Scatter plot of individuals on the first two principal components obtained from DAPC based on the analysis of 65 soybean genotypes using 1223 SNP markers. The graph represents the individuals as dots.

The extent of genetic variation among the genotypes was further revealed by AMOVA, which showed high variations (71%) (P < 0.001) among the individual genotypes, and 11% (P < 0.001) of the total variation was ascribed to differences among the three clusters detected by the three complementary approaches used to determine the population structure (Table 3). The 18% (P < 0.001) was accounted for by the variation within the 65 genotypes. The estimated fixation index (F _ST) was 0.11 (P < 0.001), indicating moderate genetic differentiation among the clusters.

Table 3. Hierarchical AMOVA and Wright's fixation index (F _ST) for 65 soybean genotypes based on 1223 SNP markers

***Significant at the 0.001 probability level.

Discussion

Genetic characterization information is vital in designing future hybridization plans of the IITA soybean breeding programme and partners receiving IITA's elite soybean lines to be utilized by the national soybean improvement programmes across Africa, and beyond. In this study, the discriminatory power and information obtained from the estimates of genetic diversity and population structure of the studied soybean genotypes were enlightening.

The number of alleles per locus measures genetic variation at the gene level. In a population of self-fertilizing species, such as soybean, lower allelic diversity and heterozygosity are commonly expected (Wright, Reference Wright1921). Hence, PIC, MAF, Shannon's information index, gene diversity and observed heterozygosity were >0.5 in this study. The average PIC value of 0.25 was moderately informative and implies that the SNP markers have differentiating power, since PIC cannot exceed 0.50 in bi-allelic markers (Singh et al., Reference Singh, Choudhury, Singh, Kumar, Srinivasan, Tyagi, Singh and Singh2013). Comparable average PIC values were reported on ryegrass (Roldàn-Ruiz et al., Reference Roldàn-Ruiz, Dendauw, Van Bockstaele, Depicker and De Loose2000), soybean (Chen et al., Reference Chen, Hou, Zhang, Pang and Li2017) and wheat (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018). The high percent polymorphic loci and other genetic diversity indices measured also depict the existence of variability among the soybean genotypes. The occurrence of private alleles among the genotypes indicates that the germplasm consisted of diverse, unique, and favourable alleles that may contribute positively to soybean breeding that is yet to be exploited. Thus, these observations imply the presence of diversity within the genotypes and demonstrate that the selected markers were informative and useful for further soybean genetic diversity studies.

For germplasm characterization, earlier reports concluded that large numbers of SNPs would be required to replace the highly polymorphic SSRs in diversity and relatedness studies (Hamblin et al., Reference Hamblin, Warburton and Buckler2007; Semagn et al., Reference Semagn, Babu, Hearne and Olsen2014). The average genetic distance (similarity) among a set of genotypes measures genetic diversity at the population level (Lu and Bernardo, Reference Lu and Bernardo2001). The three diverse but complementary clustering analyses employed in this study differentiated the 65 genotypes from each other, assigning them into three different groups, which indicate substantial genetic diversity. The genotypes in each cluster share common features, which largely corresponds to their pedigree and agronomic traits. TGx 1835-10E, released for leaf rust resistance (Table 1), found in the cluster comprising genotypes with efficient natural nodulation, pod shattering resistance, medium maturity and high yield. The two striga trap varieties: TGx 1440-1E and TGx 1448-2E that are known in depleting the seed bank of S. hermonthica through suicidal germination were found in the same cluster with other genotypes that are early maturing, resistant to lodging and high yielding. All the genotypes derived from crosses made to UG-5, a rust-resistant parent, were found in the same cluster because all of them have rust resistance features (Table 1), and hence, according to Tantasawat et al. (Reference Tantasawat, Trongchuen, Prajongjai, Jenweerawat and Chaowiset2011), share genetic homology. Some correspondence between the clustering pattern and the pedigree of the soybean genotypes was observed. These results indicate that the different pedigrees for each soybean genotypes played an essential role in maintaining genetic variation, as genotypes with similar pedigree clustered together by the SNP markers. Lee et al. (Reference Lee, Yu, Hwang, Blake, So, Lee, Nguyen and Shannon2008) suggested that soybean genotypes originating from different genetic backgrounds could have important genetic differences. The markers successfully differentiated the studied germplasm, and the genetic distance observed within the genotypes indicates that good recombination to produce superior progenies can be achieved from crosses between genetically dissimilar genotypes (Narvel et al., Reference Narvel, Fehr, Chu and Grant2000).

Assessment of genetic relatedness among parental lines will help breeders identify the most diverse cross combinations useful to enhance genetic gain from the population. Therefore, the relationship of soybean genotypes of IITA breeding lines can facilitate the selection of diverse parental lines carrying priority traits for recombination.

The 71% variation among individual genotypes confirms that populations of self-fertilizing species are expected to have high differentiation. The F _ST value in this study was 0.11, which may be regarded as moderate as per Wright's qualitative guidelines (Wright, Reference Wright1978). These observations indicate the presence of diversity between the genotypes, and demonstrate the highly informativeness and usefulness of the selected markers for future soybean genetic diversity studies.

SNPs have emerged as powerful tools for many genetic applications. Its unique features include abundance in the genome and the ability to generate polymorphism at a single base level. We aim to have the SNP markers optimized by providing relevant information on the markers' discriminatory power in characterizing the IITA soybean germplasm. The markers used were powerful for detecting genetic diversity among and within the soybean populations. We suggest using these validated sets of SNP markers as they spanned the whole genome and provided a biologically sound classification of the genotypes. The results of this study will help to conserve, utilize and manage IITA's soybean germplasm effectively. Determination of the extent of genetic variability present in this germplasm will furnish soybean breeders' with the required information and decision-making tools for effective parental selection in their breeding programmes.

Acknowledgements

This research was supported by the core fund provided by the International Institute of Tropical Agriculture (IITA), and support from Soybean Genomics and Improvement Laboratory, Beltsville Agricultural Research Center (SGIL-BARC), Beltsville, Maryland.

Conflict of interest

None.

References

Alexander, DH and Lange, K (2011) Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics 12: 246.CrossRef Google Scholar PubMed

Alexander, DH, Novembre, J and Lange, K (2009) Fast model based estimation of ancestry in unrelated individuals. Genome Research 19: 1655–1664.CrossRef Google Scholar PubMed

Chen, Y and Nelson, RL (2005) Relationship between origin and genetic diversity in Chinese soybean germplasm. Crop Science 45: 1645–1652.CrossRef Google Scholar

Chen, W, Hou, L, Zhang, Z, Pang, X and Li, Y (2017) Genetic diversity, population structure, and linkage disequilibrium of a core collection of Ziziphus jujuba assessed with genome-wide SNPs developed by genotyping by-sequencing and SSR markers. Frontiers in Plant Science 8: 575.Google Scholar PubMed

Chigeza, G, Boahen, S, Gedil, M, Agoyi, E, Mushoriwa, H, Denwar, N, Gondwe, T, Tesfaye, A, Kamara, A, Alamu, OE and Chikoye, D (2019) Public sector soybean (Glycine max) breeding: advances in cultivar development in the African tropics. Plant Breeding 2019: 1–10.Google Scholar

Cornelious, BK and Sneller, CH (2002) Yield and molecular diversity of soybean lines derived from crosses of northern and southern elite parents. Crop Science 42: 642–647.CrossRef Google Scholar

Dellaporta, SL, Wood, J and Hicks, JB (1983) A plant DNA mini preparation: version II. Plant Molecular Biology Reporter 1: 19–21.CrossRef Google Scholar

Eltaher, S, Sallam, A, Belamkar, V, Emara, HA, Nower, AA, Salem, KF, Poland, J and Baenziger, PS (2018) Genetic diversity and population structure of F3:6 Nebraska winter wheat genotypes using genotyping-by-sequencing. Frontiers in genetics 9: 76.CrossRef Google Scholar PubMed

FAO (2020) FAOSTAT. URL: http://www.fao.org/faostat/en/#data. Accessed date and time: 12/10/202, 9:40 am.Google Scholar

Frichot, E, Mathieu, F, Trouillon, T, Bouchard, G and François, O (2014) Fast and efficient estimation of individual ancestry coefficients. Genetics 196: 973–983.CrossRef Google Scholar PubMed

Gizlice, Z, Carter, TE Jr and Burton, JW (1993) Genetic diversity in North American soybean: I. Multivariate analysis of founding stock and relation to coefficient of parentage. Crop Science 33: 614–620.CrossRef Google Scholar

Gizlice, Z, Carter, TE Jr and Burton, JW (1994) Genetic base for North American public soybean cultivars released between 1947 and 1988. Crop Science 34: 1143–1151.CrossRef Google Scholar

Gizlice, Z, Carter, TE Jr, Gerig, TM and Burton, JW (1996) Genetic diversity patterns in North American public soybean cultivars based on coefficient of parentage. Crop Science 36: 753– 765.CrossRef Google Scholar

Hamblin, MT, Warburton, ML and Buckler, ES (2007) Empirical comparison of simple sequence repeats and single nucleotide polymorphisms in assessment of maize diversity and relatedness. PLoS One 2: 1367.CrossRef Google Scholar PubMed

Helms, T, Orf, J, Vallad, G and McClean, P (1997) Genetic variance, coefficient of parentage, and genetic distance of six soybean populations. Theoretical and Applied Genetics 94: 20–26.CrossRef Google Scholar PubMed

Hymowitz, T and Shurtleff, WR (2005) Debunking soybean myths and legends in the historical and popular literature. Crop Science 45: 473–476.CrossRef Google Scholar

Hyten, DL, Song, Q, Choi, IY, Yoon, MS, Specht, JE, Matukumalli, LK, Nelson, RL, Shoemaker, RC, Young, ND and Cregan, PB (2008) High-throughput genotyping with the GoldenGate assay in the complex genome of soybean. Theoretical and Applied Genetics 116: 945–952.CrossRef Google Scholar PubMed

Jombart, T (2008) Adegenet an R package for the multivariate analysis of genetic markers. Bioinformatics (Oxford, England) 24: 1403–1405.CrossRef Google Scholar

Keim, P, Beavis, GW, Schupp, J and Freestone, R (1992) Evaluation of soybean RFLP marker diversity in adapted germplasm. Theoretical and Applied Genetics 85: 205–212.CrossRef Google Scholar

Kisha, T, Sneller, CH and Diers, BW (1997) Relationship of genetic distance and genetic variance in populations of soybean. Crop Science 37: 1317–1325.CrossRef Google Scholar

Kisha, T, Diers, BW, Hoyt, JM and Sneller, CH (1998) Genetic diversity among soybean plant introductions and North American germplasm. Crop Science 38: 1669–1680.CrossRef Google Scholar

Lee, JD, Yu, JK, Hwang, YH, Blake, S, So, YS, Lee, GJ, Nguyen, HT and Shannon, JG (2008) Genetic diversity of wild soybean (Glycine soja Sieb. and Zucc.) accessions from South Korea and other countries. Crop Science 48: 606–616.CrossRef Google Scholar

Liu, K and Muse, SV (2005) Power marker: an integrated analysis environment for genetic marker analysis. Bioinformatics (Oxford, England) 21: 2128–2129.CrossRef Google Scholar

Lu, H and Bernardo, R (2001) Molecular marker diversity among current and historical maize inbreds. Theoretical and Applied Genetics 103: 613–617.CrossRef Google Scholar

Narvel, JM, Fehr, WR, Chu, WC and Grant, D (2000) Simple sequence repeat diversity among soybean plant introductions and elite genotypes. Crop Science 40: 1452–1458.CrossRef Google Scholar

Nelson, LA, Elmore, RW, Klein, RN and Shapiro, C (1998) Nebraska Soybean Variety Tests. Nebraska Coop. Ext. E. C. 98-104-A. Lincoln: University of Nebraska.Google Scholar

Paradis, E, Claude, J and Strimmer, K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics (Oxford, England) 20: 289–290.CrossRef Google Scholar PubMed

Peakall, R and Smouse, P (2012) Genalex 6.5: genetic analysis in Excel. Population genetic software for teaching and research – an update. Bioinformatics (Oxford, England) 28: 2537–2539.CrossRef Google Scholar PubMed

Purcell, S, Neale, B, Todd-brown, K, Thomas, L, Bender, D, Maller, J, Sklar, P, de Bakker, PIW, Daly, MJ and Sham, PC (2007) PLINK: a toolset for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81: 559–575.CrossRef Google Scholar

Qiu, L, Chen, P, Liu, Z, Li, Y, Guan, R, Wang, L and Chang, R (2011) The worldwide utilization of the Chinese soybean germplasm collection. Plant Genetic Resources: Characterization and Utilization 9: 109–122.CrossRef Google Scholar

R Core Team (2015) R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.Google Scholar

Roldàn-Ruiz, I, Dendauw, J, Van Bockstaele, E, Depicker, A and De Loose, MA (2000) AFLP markers reveal high polymorphic rates in ryegrasses (Lolium spp.). Molecular breeding 6: 125–134.CrossRef Google Scholar

Salado-Navarro, LR, Sinclair, TR and Hinson, K (1993) Changes in yield and seed growth traits in soybean cultivars released in the southern U.S.A. from 1945 to 1983. Crop Science 33: 1204–1209.CrossRef Google Scholar

Semagn, K, Babu, R, Hearne, S and Olsen, M (2014) Single nucleotide polymorphism genotyping using Kompetitive Allele Specific PCR (KASP): overview of the technology and its application in crop improvement. Molecular Breeding 33: 1–14.CrossRef Google Scholar

Singh, RJ and Hymowitz, T (1999) Soybean genetic resources and crop improvement. Genome 42: 605–616.CrossRef Google Scholar

Singh, N, Choudhury, DR, Singh, AK, Kumar, S, Srinivasan, K, Tyagi, RK, Singh, NK and Singh, R (2013) Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties. PLoS One 8: e84136.CrossRef Google Scholar PubMed

Sneller, CH (1994) Pedigree analysis of elite soybean lines. Crop Science 34: 1515–1522.CrossRef Google Scholar

Sneller, CH, Miles, J and Hoyt, JM (1997) Agronomic performance of soybean plant introduction and their genetic similarity to elite lines. Crop Science 37: 1595–1600.CrossRef Google Scholar

Soybase (2020) Soybean genetic resources and genetic enhancements white paper. URL: https://soybase.org/Genetic_Resources/Soybean_Genetic_Resources.html accessed date: 16/08/2020.Google Scholar

Tantasawat, P, Trongchuen, J, Prajongjai, T, Jenweerawat, S and Chaowiset, W (2011) SSR analysis of soybean (Glycine max (L.) Merr.) genetic relationship and variety identification in Thailand. Australian Journal of Crop Science 5: 283–290.Google Scholar

Tefera, H, Kamara, AY and Asafo-Adjei, B (2009) Improvement in grain and fodder yields of early maturing promiscuous soybean varieties in the Guinea savanna of Nigeria. Crop Science 49: 2037–2042.CrossRef Google Scholar

Wright, S (1921) Systems of mating. II. The effects of inbreeding on the genetic composition of a population. Genetics 6: 124–143.CrossRef Google Scholar PubMed

Wright, S (1978) Evolution and the Genetics of Populations Vol. 4. Variability Within and among Natural Populations, Chicago: University of Chicago Press, p. 58.Google Scholar

Table 1. Rust reaction and pedigree of the soybean genotypes used in the genetic diversity study

Fig. 1. Frequency distribution of the mean values of genetic properties of SNPs across the 65 soybean genotypes.

Table 2. Average genetic diversity measures and their corresponding standard errors at 1223 SNP loci of the 65 soybean genotypes

Table 3. Hierarchical AMOVA and Wright's fixation index (FST) for 65 soybean genotypes based on 1223 SNP markers

Article contents

Assessment of diversity in tropical soybean (Glycine max (L.) Merr.) varieties and elite breeding lines using single nucleotide polymorphism markers

Abstract

Keywords

Introduction

Materials and methods

Plant material and DNA isolation

DNA extraction and genotyping

GoldenGate assay-based SNP genotyping

Statistical analyses

Genetic diversity indices

Population structure

Results

Genetic diversity

Genetic relatedness

Discussion

Acknowledgements

Conflict of interest

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests