Hostname: page-component-745bb68f8f-b6zl4 Total loading time: 0 Render date: 2025-02-06T06:42:10.241Z Has data issue: false hasContentIssue false

Genetic structure and diversity of upland rice germplasm using diversity array technology (DArT)-based single nucleotide polymorphism (SNP) markers

Published online by Cambridge University Press:  04 November 2020

Kehinde A. Adeboye*
Affiliation:
Center of Excellence in Agricultural Development and Sustainable Environment (CEADESE), Federal University of Agriculture, Abeokuta, Nigeria Genebank Department, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland/OT Gatersleben, Germany
Olayinka E. Oyedeji
Affiliation:
Center of Excellence in Agricultural Development and Sustainable Environment (CEADESE), Federal University of Agriculture, Abeokuta, Nigeria Department of Agricultural, Food and Nutritional Sciences, Agriculture/Forestry Centre, University of Alberta, Edmonton, Canada
Ahmad M. Alqudah
Affiliation:
Genebank Department, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland/OT Gatersleben, Germany
Andreas Börner
Affiliation:
Genebank Department, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland/OT Gatersleben, Germany
Olusegun Oduwaye
Affiliation:
Department of Plant Breeding and Seed Technology, Federal University of Agriculture, Abeokuta, Nigeria
Olutumininu Adebambo
Affiliation:
Department of Animal Breeding and Genetics, Federal University of Agriculture, Abeokuta, Nigeria
Isaac O. Daniel
Affiliation:
Department of Plant Breeding and Seed Technology, Federal University of Agriculture, Abeokuta, Nigeria
*
*Corresponding author. E-mail: kaadeboye@yahoo.com
Rights & Permissions [Opens in a new window]

Abstract

Investigating genetic structure and diversity is crucial for rice improvement strategies, including mapping quantitative trait loci with the potential for improved productivity and adaptation to the upland ecology. The present study elucidated the population structure and genetic diversity of 176 rice germplasm adapted to the upland ecology using 7063 genome-wide single nucleotide polymorphism (SNP) markers from the Diversity Array Technology (DArT)-based sequencing platform (DArTseq). The SNPs were reasonably distributed across the rice genome, ranging from 432 SNPs on chromosome 9 to 989 SNPs on chromosome 1. The minimum minor allele frequency was 0.05, while the average polymorphism information content and heterozygosity were 0.25 and 0.03, respectively. The model-based (Bayesian) population structure analysis identified two major groups for the studied rice germplasm. Analysis of molecular variance revealed that all (100%) of the genetic variance was attributable to differences within the population, and none was attributable to the population structure. The estimates of genetic differentiation (PhiPT = 0.001; P = 0.197) further showed a negligible difference among the population structures. The results indicated a high genetic exchange or gene flow (number of migrants, Nm = 622.65) and a substantial level of diversity (number of private alleles, Pa = 1.52 number of different alleles, Na = 2.67; Shannon's information index, I = 0.084; and percentage of polymorphic loci, %PPL = 55.9%) within the population, representing a valuable resource for rice improvement. The findings in this study provide a critical analysis of the genetic diversity of upland rice germplasm that would be useful for rice yield improvement. We suggested further breeding and genetic analyses.

Type
Research Article
Copyright
Copyright © The Author(s), 2020. Published by Cambridge University Press on behalf of NIAB

Introduction

Rice production in the upland ecology holds enormous potentials to meet the dual objectives of providing food and poverty escape in Africa (Gassner et al., Reference Gassner, Harris, Mausch, Terheggen, Lopes, Finlayson and Dobie2019). Agriculture is the traditional mainstay of the African economy with agricultural activities predominantly in the upland ecology, which covers a meaningful proportion of the arable land areas in the region (Allurri et al., Reference Alluri, Mahsatra and Lawson1979; Diagne et al., Reference Diagne, Amovin-Assagba, Futakuchi and Wopereis2013). Rice grain yield in the ecology is characteristically low (Oikeh et al., Reference Oikeh, Nwilene, Agunbiade, Oladimeji, Ajayi, Mande, Tsunematsu and Samejima2008; Diagne et al., Reference Diagne, Amovin-Assagba, Futakuchi and Wopereis2013) because the poor resource farmers cannot access sophisticated infrastructures, including irrigation technology, and chemicals to boost their productivity asides the myriad of biotic and abiotic constraints. Despite these limitations, rice production in Africa is significantly dependent on the upland ecology, which accounts for about 32% of the rice-growing areas in the region (Diagne et al., Reference Diagne, Amovin-Assagba, Futakuchi and Wopereis2013).

Consequently, rice improvement programmes are exploring various procedures to boost rice productivity in the upland ecology (Saito et al., Reference Saito, Asai, Zhao, Laborte and Grenier2018). Breeding works aimed at finding promising cultivars with superior characteristics over existing varieties are also intensified (Saito et al., Reference Saito, Sokei and Wopereis2012; Tollens et al., Reference Tollens, Demont, Sié, Diagne, Saito, Wopereis, Wopereis, Johnson, Ahmadi, Tollens and Jalloh2013; Saito et al., Reference Saito, Asai, Zhao, Laborte and Grenier2018). However, the reality of changing climate may further complicate plant breeding efforts (Wassmann et al., Reference Wassmann, Jagadish, Heuer, Ismail, Redona, Serraj, Singh, Howell, Pathak and Sumfleth2009; Ceccarelli et al., Reference Ceccarelli, Grando, Maatougui, Michael, Slash, Haghparast, Rahmanian, Taheri, Al-Yassin, Benbelkacem, Labdi, Mimoun and Nachit2010; Dreccer et al., Reference Dreccer, Bonnett, Lafarge, Christou, Savin, Costa-Pierce, Misztal and Whitelaw2013) such that potential cultivars need to demonstrate the ability to adapt to the changing environment.

Plant breeding potentials in improving crops for yield adaptation to environmental stresses rely on genetic variability within the germplasm (Lenaerts et al., Reference Lenaerts, Collard and Demont2019). Some works are available on genetic variation and diversity studies of rice germplasm, being a model crop (Garris et al., Reference Garris, Tai, Coburn, Kresovich and McCouch2005; Yamasaki and Ideta, Reference Yamasaki and Ideta2013; Ramadan et al., Reference Ramadan, Elmoghazy and Mowafi2015; Islam et al., Reference Islam, Khalequzzaman, Prince, Siddique, Rashid, Ahmed, Pittendrigh and Ali2018; Ndjiondjop et al., Reference Ndjiondjop, Semagn, Sow, Manneh, Gouda, Kpeki, Pegalepo, Wambugu, Sié and Warburton2018). However, rice genetic diversity keeps expanding with the development of new breeding lines and continuous germplasm collections. Thus, a need to continually elucidate genetic diversity among germplasms to aid rice breeding programmes, and population development, particularly for the upland ecology.

Crop diversity may be assayed at phenotypic and genotypic levels, exploring statistical procedures to partition phenotypic or genetic descriptors into genetic or environmental components (Cobb et al., Reference Cobb, Declerck, Greenberg, Clark and McCouch2013). The use of genetic markers, most especially single nucleotide polymorphism (SNP) markers have proven more effective in detecting patterns of crop diversities (Fischer et al., Reference Fischer, Rellstab, Leuzinger, Roumet, Gugerli, Shimizu, Holderegger and Widmer2017). In recent times, advanced Genotyping-by-Sequencing (GBS) platforms, such as Diversity Array Technology (DArT) (Sansaloni et al., Reference Sansaloni, Petroli, Jaccoud, Carling, Detering, Grattapaglia and Kilian2011), have enabled timely mining of high-density SNP information at a significantly reduced cost (Poland and Rife, Reference Poland and Rife2012). Consequently, diversity studies of several crop species have shown relatively better coverage and reduced missing data with DArT compared to other GBS platforms (Chen et al., Reference Chen, Zavala, Ortega, Petroli, Franco, Burgueño, Costich and Hearne2016; Nazzicari et al., Reference Nazzicari, Biscarini, Cozzi, Brummer and Annicchiarico2016; Yang et al., Reference Yang, Wei, Liu, Wu, Li, Zhang, Jian, Li, Tai, Zhang, Zhang, Jiang, Xia and Wan2016a).

The objectives of this study were to examine the genetic diversity and population structure of 176 upland rice germplasm using the DArT-SNP markers to gain a better understanding of their potential utilization in rice improvement programmes for the upland ecology.

Materials and methods

Plant materials

A population of 176 upland rice germplasm consisting of breeding lines, improved varieties and landraces was studied (online Supplementary Table S1). The breeding lines were developed, and preserved with other germplasm at Africa Rice Center (AfricaRice), Ibadan, Nigeria. Fresh leaves were collected from each of the germplasm, lyophilized and shipped in 96-plex to DArT Pty Ltd, Canberra, Australia, for genotyping by sequencing using diversity array technology (Sansaloni et al., Reference Sansaloni, Petroli, Jaccoud, Carling, Detering, Grattapaglia and Kilian2011; Ren et al., Reference Ren, Ray, Li, Xu, Zhang, Liu, Yao, Kilian and Yang2015).

DArT-based Genotyping-By-Sequencing

DArT-based GBS platform was used for the genotypic analysis and SNP calling. Procedures for DNA extraction, complexity reduction, cloning, library construction, cleaning and DNA quality assessment were previously described by Egea et al. (Reference Egea, Mérida-García, Kilian, Hernandez and Dorado2017). Amplification fragments of DNA sequence on the IlluminaHiseq 2500 (www.illumina.com), and the final SNPs called using the DArTsoft analytical pipeline (http://www.diversityarrays.com/darttechnology-package-dartSoft).

Data analysis

SNP filtering and genome characterization

The DArT GBS platform generated SNP characteristics, including missing rate, polymorphism information content (PIC) and frequency of heterozygosity for all SNPs. The filtering of the SNPs was done in two stages to exclude all SNPs unmapped to any of the 12 rice chromosomes and remove redundant markers. Markers with minor allele frequency below 5%, polymorphism information index (PIC) <10% and missing rate of more than 10% were considered redundant. The genome summary plugin in TASSELv.5.2.37 software (Bradbury et al., Reference Bradbury, Zhang, Kroon, Casstevens, Ramdoss and Buckler2007) generated the allele frequency and MAF of markers.

Clustering and structure analysis

Clustering and principal component (PC) analyses of the 176 upland rice germplasm were implemented in TASSELv.5.2.37 software (Bradbury et al., Reference Bradbury, Zhang, Kroon, Casstevens, Ramdoss and Buckler2007). Grouping of genotypes into clusters was obtained using the Unweighted Pair Group Method with Arithmetic Means (UPGMA) and displayed with the Archaeopteryx Tree plugin.

The structure of the population was further estimated using a model-based (Bayesian) analysis in STRUCTUREv2.3.4 (Pritchard et al., Reference Pritchard, Stephens and Donnelly2000). The k-values representing the fixed number of population subgroups (1–10) obtained in four independent analyses. The burn-in iterations and Markov Chain Monte Carlo (MCMC) replication were each programmed at 100,000. The k-value that is the best fit for the population was by Structure Harvester (Earl and Vonholdt, Reference Earl and vonHoldt2012). The CLUMPP files for each k-value with several curves, including that of the log probability of the data [LnP(D)] and derived statistics (ΔK) constructed based on the rate of change in [LnP(D)] between successive K-values were generated from Structure Harvester. The structure of the population was determined by k-value, corresponding to the maximum ΔK. Also, individual Q-values were obtained from the corresponding CLUMPP file generated, and genotypes with Q-value >0.5 belonged to the same group.

Analysis of molecular variance

The results of population structures were subjected to the analysis of molecular variance, which estimates and partitions the pattern of variation within and among the population using GenAlEx v6.5 (Peakall and Smouse, Reference Peakall and Smouse2012). Genetic differentiation between populations was determined using Phi-statistics (PhiPT) value. The number of migrants (Nm value) represents the gene flow index. The probability value for the significance of variance components was estimated based on 1000 standard permutations.

Analysis of allelic pattern

Analysis of allelic patterns across the estimated population structures was implemented in GenAlEx v6.5 (Peakall and Smouse, Reference Peakall and Smouse2012). Genetic diversity within and among the population was determined using the following indices: number of private alleles (Pa), number of different alleles (Na), number of effective alleles (Ne), Shannon's information index (I), diversity (h), unbiased diversity (uh) and percentage of polymorphic loci (%PPL) (Peakall and Smouse, Reference Peakall and Smouse2012).

Results

Genome characterization using DArT-based SNP markers

DArT-based GBS generated 23,000 SNPs for the 176 upland rice germplasm. Out of the 23,000 SNPs, the first stage of filtering excluded 2614 unmapped SNPs while retaining 20,386 mapped to the 12 rice chromosomes (Table 1). SNPs per chromosome ranged from 1090 SNPs on chromosome 9 to 2391 SNPs on chromosome 1, with an average of 1699 per chromosome.

Table 1. Distribution of DArT-SNP markers on the 12 rice chromosomes (filtering at MAF>0.05, missing<0.1 and PIC>0.1)

The second stage of filtering returned 7063 SNPs (Table 1). The SNPs were reasonably distributed across the rice genome, ranging from 432 SNPs (6%) on chromosome 9 to 989 SNPs (14%) on chromosome 1. The PIC ranged from 0.11 to 0.5 and averaged at 0.25 (Fig. 1 and online Supplementary Table S2), while gene diversity (or heterozygosity) of the 7063 SNPs ranged from 0 to 0.57 and averaged at 0.03 (Fig. 2 and online Supplementary Table S2). Almost 98% of the markers had heterozygosity values below 0.1. The minimum MAF was 0.05 (online Supplementary Table S2). The frequencies of transition and transversion SNP types were 0.63 and 0.27, respectively (online Supplementary Table S3).

Fig. 1. Polymorphic information content (PIC) of 7063 DArT-SNP markers for 176 upland rice germplasm.

Fig. 2. Gene diversity (heterozygosity) of 7063 DArT-SNP markers for 176 upland rice germplasm.

Clustering and structure analysis

The clustering analysis using the UPGMA and the PC analysis identified three clusters for the 176 upland germplasm designated as P1, P2 and P3 (online Supplementary Figs. S1 and S2). The model-based (Bayesian) analysis, however, estimated two groups (Q1 and Q2) (Fig. 3) corresponding to the best-fit K-value of 2 (online Supplementary Fig. S3). There were 15 individuals in group Q1 and 161 individuals in Q2 at cluster value 0.11 and 0.89, respectively. The mean fixation index was 0.431 and 0.1618, while the expected heterozygosity was 0.3529 and 0.7203 for group Q1 and Q2, respectively (online Supplementary Table S4). The allele frequency divergence (net nucleotide distance) between Q1 and Q2 equals 0.4752. The individuals in Q2 sub-divides into two subgroups corresponding to P2 and P3 clusters obtained from the clustering and PC analyses (online Supplementary Table S1).

Fig. 3. Population structure of 176 upland rice germplasm base on 7063 SNP markers (K = 2). The colours represent the estimated proportions of the two populations.sgg

Molecular variance and gene flow across the populations

Results from the analysis of molecular variance revealed that all the genetic variance was within the population (100%), and none was attributable to the population structure (0%) estimated from the model-based analysis (Table 2).

Table 2. Summary analysis of molecular variance (AMOVA) for 176 upland rice germplasm based on 7063 DArT-SNP markers

Probability, P (rand  ⩾ data), for PhiPT is based on standard permutation across the full data set.

The PhiPT value for testing the statistical significance of the estimated population structures was 0.001 (P = 0.197), and the number of migrants (Nm) was 622.65.

Allelic patterns across the populations

The average value for the numbers of different (Na), effective (Ne) and private alleles (Pa) across the populations were 1.94, 1.03 and 0.82, respectively (Table 3). Higher values of 2.67 and 1.54 were obtained in Q2 for Na and Pa, while Q1 had 1.22 and 0.10, respectively. The number of effective alleles was 1.03 for both Q1 and Q2.

Table 3. Genetic diversity indices for the two estimated population structures of 176 upland rice germplasm based on 7063 DArT-SNP markers

Na, No. of different alleles; Ne, No. of effective alleles; I, Shannon's information index; Pa, No. of private alleles; h, diversity; uh, unbiased diversity; %PPL, percentage of polymorphic loci.

The population had mean values for Shannon's information index (I) diversity index (h), unbiased diversity index (uh) of 0.7, 0.02 and 0.03, respectively. The average percentage of polymorphic loci was 55.9%, with 20.9% in Q1 and 90.9% in Q2, respectively.

Discussion

The goal of analysing genetic structure and diversity within the population of individuals is to estimate the pattern of genetic variations, determining their further use in plant breeding programmes, including marker-traits analysis and selection of parental lines for linkage analysis. In recent years, molecular markers – particularly the GBS-SNP markers – have proven more efficient in estimating genome-wide genetic diversity (Fischer et al., Reference Fischer, Rellstab, Leuzinger, Roumet, Gugerli, Shimizu, Holderegger and Widmer2017). In the present study, 23,000 SNPs from the DArT-Seq GBS platform were analysed and filtered to retain 7063 polymorphic SNPs distributed across the 12 rice chromosomes. According to Anderson et al. (Reference Anderson, Pettersson, Clarke, Cardon, Morris and Zondervan2010), well-described reasons for marker exclusion include large amounts of missing data, lower variability and genotyping errors.

GBS platforms such as DArT generate large numbers of SNPs for use in genetic analyses and genotyping (Beissinger et al., Reference Beissinger, Hirsch, Sekhon, Foerster, Johnson, Muttoni, Vaillancourt, Buell, Kaeppler and de Leon2013). However, some authors have reported possibilities of redundant markers that may not be functional or useful for specific genetic analyses (Pootakham et al., Reference Pootakham, Shearman, Ruang-areerate, Sonthirod, Sangsrakru, Jomchai, Yoocha, Triwitayakorn, Tragoonrung and Tangphatsornruang2014; Yang et al., Reference Yang, Chen, Chen, Sun, Huang, Zhou, Huang, Wang, Liu, Wang, Chen and Guo2019). The usefulness of a marker for a particular purpose depends on the number of alleles it has and corresponding relative frequency (Shete et al., Reference Shete, Tiwari and Elston2000). All the 7063 SNP passed the quality control analyses used to remove redundant markers, and are considered suitable for estimating genetic diversity and structure of the studied 176 upland rice germplasm.

The results from the assessment of SNP marker distribution across the chromosome indicated that chromosome 9 had the lowest genetic diversity with the fewest number of polymorphic markers. Previous studies suggested that fewer polymorphic markers are an indication of lower genetic diversity (Chao et al., Reference Chao, Zhang, Akhunov, Sherman, Ma, Luo and Dubcovsky2009; Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018). The transition SNPs occurred twice as frequently as transversion SNPs in our study, which indicates that mutation does not occur randomly within the markers. The transition SNPs were more frequent than transversions in previous studies on maize (Morton et al., Reference Morton, Bi, McMullen and Gaut2006), Camellia sinensis (Yang et al., Reference Yang, Ren, Ray, Xu, Li, Zhang, Liu, Yao and Kilian2016b), Camelina sativa (Luo et al., Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019), Brassica napus (Huang et al., Reference Huang, Deng, Guan, Li, Lu, Wang, Fu, Mason, Liu and Hua2013), Brassica rapa (Park et al., Reference Park, Yu, Mun and Lee2010) and may be due to synonymous mutations in protein-coding (Guo et al., Reference Guo, McDowell, Nodzenski, Scholtens, Allen, Lowe and Reddy2017).

The values of PIC and heterozygosity index of markers are indications of the diversity level of the population and molecular markers. The range of PIC values for the studied rice germplasm revealed a moderate level of genetic diversity, which is in line with the use of PIC values for genetic diversity by Botstein et al. (Reference Botstein, White, Skolnick and Davis1980). Results from the present research are also similar to the findings of Luo et al. (Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019) and Eltaher et al. (Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018), who studied genetic diversity in C. sativa and Nebraska wheat, respectively. Eltaher et al. (Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018) argued that moderate PIC values might be expected due to low mutation rates (Coates et al., Reference Coates, Sumerford, Miller, Kim, Sappington, Siegfried, Siegfried and Lewis2009), as indicated by the ratio of transition to transversion SNPs in this study. It may also be attributed to the bi-allelic nature of SNPs, restricting PIC values to 0.5 when the two alleles have identical frequencies (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018). More so, PIC values of SNPs are usually low compared to that of SSR (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018). The heterozygosity values of below 0.1 found in almost 98% of the markers further reiterate their suitability in estimating the genetic structure and diversity of the studied rice germplasm. Zhu et al. (Reference Zhu, Niu, Shi and Mou2019) estimated genetic diversity in olive using SNP markers with heterozygosity values of <0.1. According to Ndjiondjop et al. (Reference Ndjiondjop, Semagn, Gouda, Kpeki, Dro Tia, Sow, Goungoulou, Sie, Perrier, Ghesquiere and Warburton2017), rice plants are self-pollinators, which may account for the low heterozygosity in this study.

Some previous studies have reported clear genetic structures for many rice germplasm (Garris et al., Reference Garris, Tai, Coburn, Kresovich and McCouch2005; Zhao et al., Reference Zhao, Tung, Eizenga, Wright, Ali, Price, Norton, Islam, Reynolds, Mezey, McClung, Bustamante and McCouch2011; Ndjiondjop et al., Reference Ndjiondjop, Semagn, Sow, Manneh, Gouda, Kpeki, Pegalepo, Wambugu, Sié and Warburton2018). From our results, the cluster and PC analyses revealed three stratifications (P1, P2 and P3), while the model-based analysis structured the population into two distinct groups of 15 (Q1) and 161 (Q2) individuals. The model-based (Bayesian) analysis structured clusters P2 and P3 of the PC analysis into a group, suggesting the two clusters are probably not differentiated.

Furthermore, the stratifications estimated from this study, including the model-based analysis, appear random and do not reflect any specific pedigree, which may be an indication that the genotypes are closely related. The analysis of molecular variance revealed that all the genetic variance was within the population, and none attributed to population structures estimated from either the clustering analysis or the model-based analysis. Previous studies on crops, such as C. sativa (Luo et al., Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019), wheat (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018), Bangladesh rice (Islam et al., Reference Islam, Khalequzzaman, Prince, Siddique, Rashid, Ahmed, Pittendrigh and Ali2018) and Japanese rice (Yamasaki and Ideta, Reference Yamasaki and Ideta2013) attributed larger proportion of genetic variation to the individual differences than the population structure.

The estimates of PhiPT from AMOVA indicated that there were no significant differences between the groups or clusters obtained from the population structure analysis (PhiPT = 0.001, P = 0.197). Wright (Reference Wright1978), who studied two allelic systems and used FST for population differentiation, suggested that FST (analogue of PhiPT) value of 0.25 indicates good differentiation between population groups, the range 0.15–0.25 indicates moderate differentiation while differentiation is negligible if the FST value is 0.05 or less.

Lack of significant differentiation among the estimated population structure may be due to high genetic exchange and gene flow, as indicated by the high value of the number of migrants (Nm) (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018; Luo et al., Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019). Nuijten and Richards (Reference Nuijten, Richards, Wopereis, Johnson, Ahmadi, Tollens and Jalloh2013) discussed the mechanisms of gene flow in rice farmer's field, implying that various cultural, as well as agro-ecological factors influence gene flow in rice. The studied germplasm is a mix of Oryza sativa (indica and japonica subspecies) and Oryza glaberrima, which are either released varieties or breeding lines selected for better yielding and adaptation to a specific ecology, and this might have influenced the low genetic variation observed.

A specific pattern of population stratifications (structures) may be a result of selection, genetic drift (Buckler and Thornsberry, Reference Buckler and Thornsberry2002; Breseghello and Sorrells, Reference Breseghello and Sorrells2006) and gene exchange (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018; Luo et al., Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019), which influences their use in plant breeding programmes (Eltaher et al., Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018). Estimating the patterns of the genetic structure is crucial for rice improvement strategies involving marker-trait association studies, such as genome-wide association scanning for quantitative trait loci of interest and genomic selections. The information obtained from such estimation will build confidence in the outcome of the potential association that may be detected.

Several reports estimated the level of diversity among population groups using indices such as number of private alleles (Pa), number of different alleles (Na), number of effective alleles (Ne), Shannon's information index (I), diversity (h), unbiased diversity (uh) and percentage of polymorphic loci (%PPL) (Peakall and Smouse, Reference Peakall and Smouse2012). Eltaher et al. (Reference Eltaher, Sallam, Belamkar, Emara, Nower, Salem, Poland and Baenziger2018) and Luo et al. (Reference Luo, Brock, Dyer, Kutchan, Schachtman, Augustin, Ge, Fahlgren and Abdel-Haleem2019) both suggested that higher value for these indices is an indication of a higher level of genetic diversity. In our study, comparing genetic diversity indices such as Ne, h and uh also revealed the population structures are not exactly different. However, group Q2 appeared more diverse with higher values of Pa, Na and I. The level of diversity may be attributed to the high gene flow index (Arora et al., Reference Arora, Kundu, Dilbaghi, Sharma and Tiwari2014) but also represents valuable resources for future rice improvement programmes.

In summary, the DArT GBS platform generated high-density markers with reasonable distribution across the whole rice genome. The DArT markers identified two population structures for the 176 upland rice germplasm, which are not significantly differentiated. Nonetheless, substantial diversity existed within the population. These materials may therefore be useful in breeding programmes targeted at recombining favourable alleles within adapted gene pools (McCouch et al., Reference McCouch, Wing, Semon, Venuprasad, Atlin, Sorrells, Jannink, Wopereis, Johnson, Ahmadi, Tollens and Jalloh2013), such as improvement of defensive traits, grain quality characteristics or the improvement of yield per se (Lamkey and Lee, Reference Lamkey and Lee2006; McCouch et al., Reference McCouch, Wing, Semon, Venuprasad, Atlin, Sorrells, Jannink, Wopereis, Johnson, Ahmadi, Tollens and Jalloh2013). More so, introgression of novel alleles from divergent sources (Tanskley and McCouch, Reference Tanksley and McCouch1997; McCouch et al., Reference McCouch, Sweeney, Li, Jiang, Thomson, Septiningsih, Edwards, Moncada, Xiao and Garris2007; Venuprasad et al., Reference Venuprasad, Dalid, Del Valle, Zhao, Espiritu, Sta Cruz, Amante, Kumar and Atlin2009, Reference Venuprasad, Impa, VowdaGowda, Atlin and Serraj2011) and advanced genomic techniques, such as CRISPR/Cas9 technology (Cong et al., Reference Cong, Ran, Cox, Lin, Barretto, Habib, Hsu, Wu, Jiang, Marraffini and Zhang2013; Hsu et al., Reference Hsu, Lander and Zhang2014), may be explored to develop a more diverse population. Our study provided useful insight into the genetic structure and diversity of upland rice germplasm that would guide in planning rice improvement programme in the upland ecology.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/S1479262120000362

Acknowledgement

We thank the World Bank's Africa Center of Excellence in Agriculture and Sustainable Environment (CEADESE), Federal University of Agriculture, Abeokuta, Nigeria, for providing funds (Grant ACE023) for the genotyping through the Ph.D. Scholarships awarded to Kehinde A. Adeboye and Olayinka E. Oyedeji. We are also grateful to the management of Africa Rice Center, Ibadan, Nigeria, for supplying the plant materials.

References

Alluri, K, Mahsatra, IC and Lawson, TL (1979) Production constraints for upland rice in West Africa. Paper presented at the First Annual Research Conference in Soil and Climatic Resources and Constraints in Relation to Food Crop Production in West Africa, IITA, Ibadan, Nigeria.Google Scholar
Anderson, CA, Pettersson, FH, Clarke, GM, Cardon, LR, Morris, AP and Zondervan, KT (2010) Data quality control in genetic case-control association studies. Nature Protocols 5: 15641573.CrossRefGoogle ScholarPubMed
Arora, A, Kundu, S, Dilbaghi, N, Sharma, I and Tiwari, R (2014) Population structure and genetic diversity among Indian wheat varieties using microsatellite (SSR) markers. Australian Journal of Crop Science 8: 12811289.Google Scholar
Beissinger, TM, Hirsch, CN, Sekhon, RS, Foerster, JM, Johnson, JM, Muttoni, G, Vaillancourt, B, Buell, CR, Kaeppler, SM and de Leon, N (2013) Marker density and read depth for genotyping populations using genotyping-by-sequencing. Genetics 193: 10731081.CrossRefGoogle ScholarPubMed
Botstein, D, White, RL, Skolnick, M and Davis, RW (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. American Journal of Human Genetics 32: 314331.Google ScholarPubMed
Bradbury, PJ, Zhang, Z, Kroon, DE, Casstevens, TM, Ramdoss, Y and Buckler, ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics (Oxford, England) 23: 26332635.CrossRefGoogle ScholarPubMed
Breseghello, F and Sorrells, ME (2006) Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172: 11651177.CrossRefGoogle ScholarPubMed
Buckler, ES and Thornsberry, JM (2002) Plant molecular diversity and applications to genomics. Current Opinion in Plant Biology 5: 107111.CrossRefGoogle ScholarPubMed
Ceccarelli, S, Grando, S, Maatougui, M, Michael, M, Slash, M, Haghparast, R, Rahmanian, M, Taheri, A, Al-Yassin, A, Benbelkacem, A, Labdi, M, Mimoun, H and Nachit, M (2010) Plant breeding and climate changes. The Journal of Agricultural Science 148: 627637.CrossRefGoogle Scholar
Chao, S, Zhang, W, Akhunov, E, Sherman, J, Ma, Y, Luo, MC and Dubcovsky, J (2009) Analysis of gene-derived SNP marker polymorphism in US wheat (Triticum aestivum L.) cultivars. Molecular Breeding 23: 2333.CrossRefGoogle Scholar
Chen, J, Zavala, C, Ortega, N, Petroli, C, Franco, J, Burgueño, J, Costich, DE and Hearne, SJ (2016) The development of quality control genotyping approaches: a case study using elite maize lines. PLoS ONE 11: e0157236.CrossRefGoogle ScholarPubMed
Coates, BS, Sumerford, DV, Miller, NJ, Kim, KS, Sappington, TW, Siegfried, BD, Siegfried, BD and Lewis, LC (2009) Comparative performance of single nucleotide polymorphism and microsatellite markers for population genetic analysis. Journal of Heredity 100: 556564.CrossRefGoogle ScholarPubMed
Cobb, JN, Declerck, G, Greenberg, A, Clark, R and McCouch, S (2013) Next-generation phenotyping: requirements and strategies for enhancing our understanding of genotype-phenotype relationships and its relevance to crop improvement. Theoretical and Applied Genetics 126: 867887.CrossRefGoogle ScholarPubMed
Cong, L, Ran, FA, Cox, D, Lin, S, Barretto, R, Habib, N, Hsu, PD, Wu, X, Jiang, W, Marraffini, LA and Zhang, F (2013) Multiplex genome engineering using CRISPR/Cas systems. Science (New York, N.Y.) 339: 819823.CrossRefGoogle ScholarPubMed
Diagne, A, Amovin-Assagba, E, Futakuchi, K, and Wopereis, MCS (2013) Estimation of cultivated area, number of farming households and yield for major rice-growing environments in Africa. In: Wopereis MCS, Johnson DE, Ahimadi N, Tollens E, and Jalloh A (Eds.), Realizing Africa's Rice Promise. Wallingford: CAB Publishing, pp. 3545.CrossRefGoogle Scholar
Dreccer, MF, Bonnett, D and Lafarge, T (2013) Plant breeding under a changing climate. In: Christou, P, Savin, R, Costa-Pierce, BA, Misztal, I and Whitelaw, CBA (eds) Sustainable Food Production. New York: Springer, pp. 12961307.CrossRefGoogle Scholar
Earl, DA and vonHoldt, BM (2012) Structure harvester: a website and program for visualizing structure output and implementing the Evanno method. Conservation Genetic Resources 4: 359361.CrossRefGoogle Scholar
Egea, LA, Mérida-García, R, Kilian, A, Hernandez, P and Dorado, G (2017) Assessment of genetic diversity and structure of large garlic (Allium sativum) germplasm bank, by diversity arrays technology ‘genotyping-by-sequencing’ platform (DArTseq). Frontiers in Genetics 8: 98.10.3389/fgene.2017.00098CrossRefGoogle Scholar
Eltaher, S, Sallam, A, Belamkar, V, Emara, HA, Nower, AA, Salem, KFM, Poland, J and Baenziger, PS (2018) Genetic diversity and population structure of F3:6 Nebraska winter wheat genotypes using genotyping-by-sequencing. Frontiers in Genetics 9: 76.CrossRefGoogle ScholarPubMed
Fischer, MC, Rellstab, C, Leuzinger, M, Roumet, M, Gugerli, F, Shimizu, KK, Holderegger, R and Widmer, A (2017) Estimating genomic diversity and population differentiation – an empirical comparison of microsatellite and SNP variation in Arabidopsis halleri. BMC Genomics 18: 69.CrossRefGoogle ScholarPubMed
Garris, AJ, Tai, TH, Coburn, J, Kresovich, S and McCouch, S (2005) Genetic structure and diversity in Oryza sativa L. Genetics 169: 16311638.10.1534/genetics.104.035642CrossRefGoogle ScholarPubMed
Gassner, A, Harris, D, Mausch, K, Terheggen, A, Lopes, C, Finlayson, R and Dobie, P (2019) Poverty eradication and food security through agriculture in Africa: rethinking objectives and entry points. Outlook on Agriculture 48: 309315.CrossRefGoogle ScholarPubMed
Guo, C, McDowell, IC, Nodzenski, M, Scholtens, DM, Allen, AS, Lowe, WL and Reddy, TE (2017) Transversions have larger regulatory effects than transitions. BMC Genomics 18: 394.CrossRefGoogle ScholarPubMed
Huang, SM, Deng, LB, Guan, M, Li, J, Lu, K, Wang, HZ, Fu, D, Mason, AS, Liu, S and Hua, W (2013) Identification of genome-wide single nucleotide polymorphisms in allopolyploid crop Brassica napus. BMC Genomics 14: 717.CrossRefGoogle ScholarPubMed
Hsu, PD, Lander, ES and Zhang, F (2014) Development and applications of CRISPR-Cas9 for genome engineering. Cell 157: 12621278.10.1016/j.cell.2014.05.010CrossRefGoogle ScholarPubMed
Islam, MZ, Khalequzzaman, M, Prince, MFRK, Siddique, MA, Rashid, ESMH, Ahmed, MSU, Pittendrigh, BR and Ali, MP (2018) Diversity and population structure of red rice germplasm in Bangladesh. PLoS ONE 13: e0196096.CrossRefGoogle ScholarPubMed
Lamkey, KR and Lee, M (Eds.) (2006) Plant Breeding: The Arnel. R. Hallauer International Symposium. Ames: Blackwell Publishing, pp. 293309.CrossRefGoogle Scholar
Lenaerts, B, Collard, BCY and Demont, M (2019) Improving global food security through accelerated plant breeding. Plant Science 287: 110207.CrossRefGoogle ScholarPubMed
Luo, Z, Brock, J, Dyer, JM, Kutchan, T, Schachtman, D, Augustin, M, Ge, Y, Fahlgren, N and Abdel-Haleem, H (2019) Genetic diversity and population structure of a Camelina sativa spring panel. Frontiers in Plant Science 10: 184.CrossRefGoogle ScholarPubMed
McCouch, S, Sweeney, M, Li, J, Jiang, H, Thomson, M, Septiningsih, E, Edwards, J, Moncada, P, Xiao, J and Garris, A (2007) Through the genetic bottleneck: O. rufipogon as a source of trait-enhancing alleles for O. sativa. Euphytica 154: 317339.CrossRefGoogle Scholar
McCouch, S, Wing, RA, Semon, M, Venuprasad, R, Atlin, G, Sorrells, ME and Jannink, JL (2013) Making rice genomics work for Africa. In: Wopereis, MCS, Johnson, DE, Ahmadi, N, Tollens, E and Jalloh, A (eds.) Realizing Africa's rice promise. Wallingford: CABI Publishing, pp 108129.CrossRefGoogle Scholar
Morton, BR, Bi, IV, McMullen, MD and Gaut, BS (2006) Variation in mutation dynamics across the maize genome as a function of regional and flanking base composition. Genetics 172: 569577.CrossRefGoogle ScholarPubMed
Nazzicari, N, Biscarini, F, Cozzi, P, Brummer, EC and Annicchiarico, P (2016) Marker imputation efficiency for genotyping-by-sequencing data in rice (Oryza sativa) and alfalfa (Medicago sativa). Molecular Breeding 36: 69.CrossRefGoogle Scholar
Ndjiondjop, MN, Semagn, K, Gouda, AC, Kpeki, SB, Dro Tia, D, Sow, M, Goungoulou, A, Sie, M, Perrier, X, Ghesquiere, A and Warburton, ML (2017) Genetic variation and population structure of Oryza glaberrima and development of a mini-core collection using DArTseq. Frontiers in Plant Science 8: 1748.10.3389/fpls.2017.01748CrossRefGoogle ScholarPubMed
Ndjiondjop, MN, Semagn, K, Sow, M, Manneh, B, Gouda, AC, Kpeki, SB, Pegalepo, E, Wambugu, P, Sié, M and Warburton, ML (2018) Assessment of genetic variation and population structure of diverse rice genotypes adapted to lowland and upland ecologies in Africa using SNPs. Frontiers in Plant Science 9: 446.CrossRefGoogle ScholarPubMed
Nuijten, E and Richards, P (2013) Gene flow in farmers’ field. In: Wopereis, MCS, Johnson, DE, Ahmadi, N, Tollens, E and Jalloh, A (eds) Realizing Africa's Rice Promise. Wallingford: CABI Publishing, pp 95-107.10.1079/9781845938123.0095CrossRefGoogle Scholar
Oikeh, SO, Nwilene, FE, Agunbiade, FA, Oladimeji, O, Ajayi, O, Mande, S, Tsunematsu, H and Samejima, H (2008) Growing Upland Rice: A Production Handbook. Africa Rice Center, Benin, p. 5.Google Scholar
Park, S, Yu, HJ, Mun, JH and Lee, SC. (2010). Genome-wide discovery of DNA polymorphism in Brassica rapa. Molecular Genetics and Genomics 283: 135145.CrossRefGoogle ScholarPubMed
Peakall, R and Smouse, PE (2012) GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research – an update. Bioinformatics (Oxford, England) 28: 25372539.CrossRefGoogle ScholarPubMed
Poland, JA and Rife, TW (2012) Genotyping-by-sequencing for plant breeding and genetics. Plant Genetics 5: 92102.Google Scholar
Pootakham, W, Shearman, JR, Ruang-areerate, P, Sonthirod, C, Sangsrakru, D, Jomchai, N, Yoocha, T, Triwitayakorn, K, Tragoonrung, S and Tangphatsornruang, S (2014) Large-scale SNP discovery through DNA sequencing and SNP genotyping by target enrichment sequencing in cassava (Manihot esculenta Crantz). PLoS ONE 9: e116028.CrossRefGoogle Scholar
Pritchard, JK, Stephens, M and Donnelly, P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945959.Google ScholarPubMed
Ramadan, E, Elmoghazy, A and Mowafi, H (2015) Molecular markers based genetic diversity analysis for drought tolerance in rice (Oryza sativa, L) using SSR markers. International Journal of Scientific Research in Agricultural Sciences 2: 137146.Google Scholar
Ren, R, Ray, R, Li, P, Xu, J, Zhang, M, Liu, G, Yao, X, Kilian, A and Yang, K (2015) Construction of a high-density DArTseq SNP-based genetic map and identification of genomic regions with segregation distortion in a genetic population derived from a cross between feral and cultivated-type watermelon. Molecular Genetics and Genomics 290: 14571470.CrossRefGoogle Scholar
Saito, K, Asai, H, Zhao, D, Laborte, AG and Grenier, C (2018) Progress in varietal improvement for increasing upland rice productivity in the tropics. Plant Production Science 21: 145158.CrossRefGoogle Scholar
Saito, K, Sokei, Y and Wopereis, MCS (2012) Enhancing rice productivity in West Africa through genetic improvement. Crop Science 52: 484494.CrossRefGoogle Scholar
Sansaloni, C, Petroli, C, Jaccoud, D, Carling, J, Detering, F, Grattapaglia, D and Kilian, A (2011) Diversity Arrays Technology (DArT) and next-generation sequencing combined: genome-wide, high throughput, highly informative genotyping for molecular breeding of Eucalyptus. BMC Proceedings 5: P54.CrossRefGoogle Scholar
Shete, S, Tiwari, H and Elston, RC (2000) On estimating the heterozygosity and polymorphism information content value. Theoretical Population Biology 57: 265271.CrossRefGoogle ScholarPubMed
Tanksley, SD and McCouch, S (1997) Seed banks and molecular maps: unlocking genetic potentials from the wild. Science (New York, N.Y.) 277: 10631066.CrossRefGoogle Scholar
Tollens, E, Demont, M, Sié, M, Diagne, A, Saito, K and Wopereis, MCS (2013) From WARDA to AfricaRice: an overview of rice research for development activities conducted in partnership in Africa. In: Wopereis, MCS, Johnson, DE, Ahmadi, N, Tollens, E and Jalloh, A (eds) Realizing Africa's Rice Promise. Wallingford: CABI Publishing, pp. 188203.Google Scholar
Venuprasad, R, Dalid, CO, Del Valle, M, Zhao, D, Espiritu, M, Sta Cruz, MT, Amante, M, Kumar, A and Atlin, GN (2009) Identification and characterization of large-effect quantitative trait loci for grain yield under lowland drought stress in rice using bulk-segregant analysis. Theoretical and Applied Genetics 120: 177190.CrossRefGoogle ScholarPubMed
Venuprasad, R, Impa, SM, VowdaGowda, RP, Atlin, GN and Serraj, R (2011) Rice near-isogenic-lines (NILs) contrasting for grain yield under lowland drought stress. Field Crops Research 123: 3846.10.1016/j.fcr.2011.04.009CrossRefGoogle Scholar
Wassmann, R, Jagadish, S, Heuer, S, Ismail, A, Redona, E, Serraj, R, Singh, RK, Howell, G, Pathak, H and Sumfleth, K (2009) Climate change affecting rice production: the physiological and agronomic basis for possible adaptation strategies. Advances in Agronomy 101: 59122.CrossRefGoogle Scholar
Wright, S (1978) Evolution and the Genetics of Populations, Volume 4: Variability Within and Among Natural Populations. Chicago: University of Chicago Press.Google Scholar
Yamasaki, M and Ideta, O (2013) Population structure in Japanese rice population. Breeding Science 63: 4957.CrossRefGoogle ScholarPubMed
Yang, X, Ren, R, Ray, R, Xu, J, Li, P, Zhang, M, Liu, G, Yao, X and Kilian, A (2016a) Genetic diversity and population structure of core watermelon (Citrullus lanatus) genotypes using DArTseq-based SNPs. Plant Genetic Resources 14: 226233.CrossRefGoogle Scholar
Yang, H, Wei, CL, Liu, HW, Wu, JL, Li, ZG, Zhang, L, Jian, JB, Li, YY, Tai, YL, Zhang, J, Zhang, ZZ, Jiang, CJ, Xia, T and Wan, XC (2016b). Genetic divergence between Camellia sinensis and its wild relatives revealed via genome-wide SNPs from RAD sequencing. PLoS ONE 11: e0151424.CrossRefGoogle Scholar
Yang, G, Chen, S, Chen, L, Sun, K, Huang, C, Zhou, D, Huang, Y, Wang, J, Liu, Y, Wang, H, Chen, Z and Guo, T (2019) Development of a core SNP arrays based on the KASP method for molecular breeding of rice. Rice 12: 21.CrossRefGoogle ScholarPubMed
Zhao, K, Tung, CW, Eizenga, GC, Wright, MH, Ali, ML, Price, AH, Norton, GJ, Islam, MR, Reynolds, A, Mezey, J, McClung, AM, Bustamante, CD and McCouch, SR. (2011) Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nature Communications 2: 467.CrossRefGoogle ScholarPubMed
Zhu, S, Niu, E, Shi, A and Mou, B (2019) Genetic diversity analysis of olive germplasm (Olea europaea L.) with genotyping-by-sequencing technology. Frontiers in Genetics 10: 755.10.3389/fgene.2019.00755CrossRefGoogle ScholarPubMed
Figure 0

Table 1. Distribution of DArT-SNP markers on the 12 rice chromosomes (filtering at MAF>0.05, missing<0.1 and PIC>0.1)

Figure 1

Fig. 1. Polymorphic information content (PIC) of 7063 DArT-SNP markers for 176 upland rice germplasm.

Figure 2

Fig. 2. Gene diversity (heterozygosity) of 7063 DArT-SNP markers for 176 upland rice germplasm.

Figure 3

Fig. 3. Population structure of 176 upland rice germplasm base on 7063 SNP markers (K = 2). The colours represent the estimated proportions of the two populations.sgg

Figure 4

Table 2. Summary analysis of molecular variance (AMOVA) for 176 upland rice germplasm based on 7063 DArT-SNP markers

Figure 5

Table 3. Genetic diversity indices for the two estimated population structures of 176 upland rice germplasm based on 7063 DArT-SNP markers

Supplementary material: File

Adeboye et al. supplementary material

Adeboye et al. supplementary material

Download Adeboye et al. supplementary material(File)
File 420.4 KB