Introduction
In Brazil, forests have been indiscriminately exploited, which threatens the continuation of biomes (Francoso et al., Reference Francoso, Haidar and Machado2016; Silva et al., Reference Silva, Millington, Moran, Batistella and Liu2020). Plans for conserving Brazil's natural ecosystems for future generations are urgently needed along with strategies for seed collection for environmental reforestation. One example is the Brazilian Atlantic Forest, where the conservation of the ecosystem is a high priority due to its vast species diversity, significant levels of endemism and the extensive fragmentation of the biome (Dinerstein et al., Reference Dinerstein, Olson, Graham, Webster, Primm, Bookbinder and Ledec1995; Colombo and Joly, Reference Colombo and Joly2010; Silva et al., Reference Silva, Millington, Moran, Batistella and Liu2020).
Ex situ conservation is a complementary strategy to in situ conservation, both of which help to avoid the extinction of species, populations or individuals (Degen and Sebbenn, Reference Degen, Sebbenn, Pancel and Kölh2014; Hausman et al., Reference Hausman, Bertke, Jaeger and Rocha2014). This can be done using experimental field designs such as provenance and progeny tests (PTs), which also enable the estimation of genetic variability and heritability of phenotypic traits (Sebbenn, Reference Sebbenn, Higa and Silva2006). Such provenance and PTs can be transformed into seed orchards by selecting provenances, progenies, individuals within progenies and seeds that can be used as sources for environmental reforestation (Hodgkin et al., Reference Hodgkin, Rao, Cibrian-Jaramillo and Gaiji2003; Sebbenn et al., Reference Sebbenn, Boshier, Freitas, Zanatto, Sato, Ettori and Moraes2007; Bakkali et al., Reference Bakkali, Haouane, Hadiddou, Oukabli, Santoni, Udupa, Van Damme and Khadari2013). Forest reforestation based on seedlings that originate from a large number of seed trees (>25) can also be considered an ex situ conservation strategy (Sebbenn, Reference Sebbenn, Higa and Silva2006).
The levels of genetic diversity retained in ex situ conservation trials and reforestation areas depend on the reference effective population size (N e) of the population used to establish these areas (Sebbenn, Reference Sebbenn, Higa and Silva2006). The effective population size values for species with overlapping generations, such as trees, range from 70 to 150 (Nunney and Campbell, Reference Nunney and Campbell1993; Caballero et al., Reference Caballero, Bravo and Wang2016), and values ranging from 1000 to 5000 (Nunney and Campbell, Reference Nunney and Campbell1993; Lande, Reference Lande1995) have been suggested for short- and long-term conservation purposes, respectively. The sampled N e may change between seed germination (effective sample) and adult stages (realized sample) due to the effects of deterministic process, such as inbreeding and outbreeding depression, and stochastic process, including random mortality, disease and pest attacks, which can decrease the levels of inbreeding and change the levels of relatedness among progenies (Sebbenn, Reference Sebbenn, Higa and Silva2006; Feres et al., Reference Feres, Sebbenn, Guidugli, Mestriner, Moraes and Alzate-Marin2012). Thus, understanding the mating pattern of the genetic material of source populations is critical to determine the number of seed trees needed to guarantee the reference effective population size indicated for ex situ conservation in trials and reforested areas (Sebbenn, Reference Sebbenn, Higa and Silva2006; Degen and Sebbenn, Reference Degen, Sebbenn, Pancel and Kölh2014).
Among the tree species that occur in the Atlantic Forest, the dioecious tree Genipa americana L. (Rubiaceae) is both ecologically and economically valuable: its wood is used in construction, and for furniture, tool handles and general carpentry; its bark is used for treating hides in tanneries as it is rich in tannins and the edible fruits also provide a bluish-green dye that is used for a variety of purposes. The species occurs in several forest formations throughout tropical America, from Mexico to Brazil (Carvalho, Reference Carvalho1994), suggesting high-genetic diversity within and among populations. Population density is variable among sites (<1–25 trees/ha) and the trees have been used for reforestation in swampy and degraded areas (Durigan and Nogueira, Reference Durigan and Nogueira1990; Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017). It also commonly occurs in riparian forests. As adults, the tree can reach 25 m in height and 90 cm in diameter at breast height (DBH) (Carvalho, Reference Carvalho1994). The dispersal of its fruits occurs by gravity, zoochory and hydrochory (Carvalho, Reference Carvalho1994). The species is insect pollinated, particularly by bumblebees such as Bombus morio and Epicharis rustica var. flava (Crestana et al., Reference Crestana, Toledo Filho and Campos2004). However, despite the ecological and economic importance of G. americana, there are few studies on its mating system and gene flow, and such studies are crucial for the effective conservation of the species (Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2015a, Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017).
To investigate the differences between the effective and realized samples from natural populations (NPs) for ex situ conservation, we compared the mating system, genetic diversity, biparental inbreeding and effective size between two G. americana PTs, one with open-pollinated offspring at 6 months of age established in a nursery PT (effective mating system), and the other with open-pollinated trees at 14 years of age established in field PT (realized mating system). We specifically sought to answer the following: (i) Are there biparental inbred and full-sib individuals in the trials due to mating among relatives and correlated mating, respectively? (ii) Is the paternity correlation higher within than among fruits? (iii) What is the mean pollen dispersal distance? (iv) Is the effective population size different between effective and realized mating samples? (v) Is the effective population size of the trials sufficient for ex situ conservation of the populations?
Materials and methods
Study site and sampling
The study was carried out in two PTs originated from open-pollinated seeds sampled from two populations (distance >1900 km) in Brazil (Fig. 1). The first population is a forest fragment (21 ha) that is a remnant of a NP in the municipality of Arauá, Sergipe State, Brazil (11°15′30″S, 37°36′55″W, elevation of 73 m). The population occurs in a transition zone between the Atlantic Forest and Caatinga biomes. The climate is megathermal and sub-humid with an average annual temperature of 24.6°C and mean annual precipitation of 863.3 mm (Sergipe, Reference Sergipe2000). In 2014, we sampled open-pollinated seeds from 30 seed trees, with 28 seeds per tree (four fruits, eight seeds per fruit) for a total of 960 seeds. Mother trees were georeferenced using GPS-III (Garmin, USA), measured for DBH, and leaves were sampled for DNA analysis. Identification of seeds, fruits and seed trees were maintained for hierarchical analysis of the mating system within and among fruits. Seeds were germinated in a nursery PT, established in a randomized complete block experimental design, with 30 open-pollinated progenies, four blocks and seven plants per plot, for a total of 840 individuals. We sampled leaves from all 488 surviving individuals (58.1%) for DNA analysis, for a total of 30 progenies, ranging from 4 to 20 individuals per progeny, from 2 to 4 fruits per tree and 1 to 8 seeds per fruit.
The second population is a PT established in 2000 as an Active Germplasm Bank of the Restoration and Ecosystem Conservation Division of CESP (Companhia Energética de São Paulo), in the municipality of Rosana, São Paulo State, Brazil (22°30′38.84″S, 52°57′23.20″W, elevation of 306 m). The PT was established to offset the impacts of the construction of the Sérgio Motta Hydroelectric Power Facility in the municipality of Porto Primavera, located in the Pontal do Paranapanema region. Seeds to establish the PT were collected from 30 seed trees located in the currently flooded reservoir of the dam, as such the population no longer exists. The experimental design to establish the PT was a randomized complete block, with 30 open-pollinated progenies, three blocks and eight plants per plot, using a 3 × 2 m spacing, for a total of 720 individuals. The climate is megathermal and sub-humid with an average annual temperature of 21.9°C and mean annual precipitation of 1200 mm (Francisco, Reference Francisco1989). According to the Brazilian System of Soil Conservation, the soil of the site is an oxisol with a moderate, medium texture (Oliveira Reference Oliveira1999), and the relief is relatively flat to undulating. For DNA analysis, we sampled leaves in 2014 from all 584 surviving trees in PT (81.1%), for a total sample of 30 progenies, ranging from 15 to 24 individuals per progeny.
Microsatellite genotyping
DNA extraction was based on the method described by Doyle and Doyle (Reference Doyle and Doyle1990). We used seven species-specific primers developed for G. americana that present Mendelian segregation and are not genetically linked (Manoel et al., Reference Manoel, Freitas, Tambarussi, Cambuim, Moraes and Sebbenn2015b): Gam01, Gam06, Gam08, Gam11, Gam24, Gam28 and Gam41 (Manoel et al., Reference Manoel, Freitas, Barreto, Moraes, Souza and Sebbenn2014). Genotyping was conducted on an ABI3130xl (Applied Biosystems, USA) using the protocol described by Schuelke (Reference Schuelke2000) with the addition of a M13 tail in the forward primer of each locus. Alleles were read in GeneMapper 5.0 software (Applied Biosystems, USA).
Analysis of genetic diversity and structure
Genetic diversity for NP seed trees and offspring of both populations was estimated as a mean across loci for the following indices: total number of alleles per locus (K); allelic richness (R) and observed (H o) and expected (H e) heterozygosity. To investigate biparental inbreeding in NP seed trees, the fixation index (F) was used and its statistical significance was tested using a Monte Carlo permutation of alleles among individuals, in FSTAT 2.9.3.2 software (Goudet, Reference Goudet1995). For offspring, F values estimated for open-pollinated progenies may be biased due to overestimates in the gene frequencies of maternal alleles, as each plant within the progeny received at least one maternal allele. Thus, for progenies we calculated the F values as described in Manoel et al. (Reference Manoel, Freitas, Tambarussi, Cambuim, Moraes and Sebbenn2015b), which uses the pollen pool frequency as the parental gene frequencies. The frequency of null alleles (Null) was estimated for adults and offspring of each population based on the inbreeding population model (IIM) in INEST 2.0 software (Chybicki and Burczyk, Reference Chybicki and Burczyk2009). To test if these indices were significantly different between cohorts (seed trees and offspring), a Jackknife among loci test was used. As the trials were structured in progenies, with gene frequency likely affected by maternal alleles within progenies, we used the offspring's pollen allele pool frequencies to determine the genetic differentiation ($G_{{\rm st}}^{\rm ^{\prime}} $) between populations. This method may better represent the genetic diversity of the source populations by capturing a larger number of alleles. To do so, the method developed for microsatellite loci by Hedrick (Reference Hedrick2005) was used, and the pollen allele pool frequency was calculated for offspring of each population using MLTR 3.4 software (Ritland, Reference Ritland2002). From this, we estimated the expected heterozygosity for each locus ($H_{\rm e} = 1-\sum\nolimits_{i = 1}^k {p_i^2 } $, where p i is the frequency of the i-th allele and k is the number of alleles at the locus under consideration) and mean heterozygosity over loci within populations ($\overline H _{\rm e}$). We also estimated the mean genetic diversity within populations ($H_{\rm s} = \lpar n_1\overline H _{{\rm e}1} + n_2\overline H _{{\rm e}2}\rpar /\lpar n_1 + n_2\rpar $, where n 1 and n 2 are the sample size in NP and PT populations, respectively, and $\overline H _{{\rm e}1}$ and $\overline H _{{\rm e}2}$ are the expected heterozygosity in NP and PT populations, respectively) and for all populations ($H_{\rm T} = 1-\sum\nolimits_{i = 1}^k {\overline p _i^2 } $, where $\overline p _i$ is the mean frequency of the i-th allele between populations and k is the number of alleles in the locus in consideration). The indices H s and H T were used to estimate the genetic differentiation between populations, (G st = 1 − (H s/H T)) and to calculate the standardized genetic differentiation ($G_{{\rm st}}^{\rm ^{\prime}} = G_{{\rm st}}\lpar 1 + H_{\rm s}\rpar /\lpar 1-H_{\rm s}\rpar $) (Hedrick, Reference Hedrick2005).
Pollen dispersal
The distance of pollen dispersal was estimated only for NP, since the geographical coordinates of seed trees are known. As only seed trees and open-pollinated offspring were genotyped, mean pollen dispersal distance was estimated using the TWOGENER model, as implemented in POLDIST software (Robledo-Arnuncio et al., Reference Robledo-Arnuncio, Austerlitz and Smouse2007). This model requires information about the effective reproductive population density and because we did not have this value, we used the Kindist module as it does not depend on prior knowledge of the effective density of pollen donors. The Kindist model is based on the expected decrease in pairwise normalized correlated paternity (ψ (z)) in open-pollinated progenies on the standard distance (Z) between pairwise progenies of the sampled seed trees (Robledo-Arnuncio et al., Reference Robledo-Arnuncio, Austerlitz and Smouse2007). We also estimated the global pollen pool differentiation among seed trees (ϕ ft). To estimate the dispersion kernel curve, the paternity correlation must decrease with an increase in spatial distance between seed trees. POLDIST gives the mean pollen dispersal distance and parameters of the dispersion curve as a function of an assumed dispersion. Here, we used the exponential dispersion model, which is the most suitable for species that may have long-distance dispersal (Smouse and Sork, Reference Smouse and Sork2004), such as G. americana which is pollinated by bees. The estimated parameters were the scale of pollen dispersion (a), the shape of the dispersion (b) and the mean and standard deviation of pollen dispersion.
Mating system analysis
Analysis of the mating system was based on the mixed mating model and correlated mating model, implemented in MLTR 3.4 software (Ritland, Reference Ritland2002). Analyses were estimated at the population and individual progeny levels, using the expectation maximization (EM) method for numerical resolution. The estimated indices were: ovule and pollen gene frequencies; fixation index of seed trees (F m); multilocus outcrossing rate (t m); single-locus outcrossing rate (t s); mating among related individuals (1 − t s) and multilocus paternity correlation (r p). As in NP, our sample was hierarchical within and among fruits. Thus, the multilocus paternity correlation was also estimated within (r pw) and among (r pa) fruits, excluding fruits with only one offspring, for a total sample of 470. The 95% confidence interval of the indices was obtained by 1000 bootstrap resampling, using progenies as the resampling unit for population level analysis and individuals within progenies for the progeny level analysis. We also estimated the effective number of pollen donors as N ep = 1/r p, the average within-progeny coancestry coefficient (Θ) as Θ = 0.125(1 + F m)(1 + r p) (Sousa et al., Reference Sousa, Sebbenn, Hattemer and Ziehe2005), and the variance effective size within progenies (N e) as:
where n is the sample size within progenies and F is the mean inbreeding coefficient within progenies (Sebbenn, Reference Sebbenn, Higa and Silva2006). The number of seed trees for seed collection was calculated as m = N e(r)/N e, assuming that the objective was to retain a reference effective size (N e(r)) of 150 (Sebbenn, Reference Sebbenn, Higa and Silva2006) in the total sampled progeny arrays. The total variance effective size (N eT) in the PTs was estimated assuming absence of relatedness between individuals of different progenies by $N_{{\rm eT}} = \sum\nolimits_{i = 1}^m {N_{\rm e}} $. To determine if the sample size within progeny (n), H o, F o, N ep, Θ and N e were associated, we used the Spearman's rank correlation coefficient (ρ).
Results
Genetic diversity
For the total sample from both populations, the number of alleles per locus ranged from 8 to 23, for a total of 96 alleles (K) and an average of 13.7. We detected 19 private alleles in NP and 15 in PT. The allelic richness (R), expected heterozygosity (H e) and fixation index (F) were significantly lower in NP seed trees (R = 6.5, H e = 0.70, F = −0.19) than in offspring of both populations (R = 7.4–7.5, H e = 0.77–0.79, F = −0.04 to 0.05) (Table 1). In contrast, the observed heterozygosity (H o) was significantly higher in NP seed trees (H o = 0.84) and PT offspring (H o = 0.82) than in NP offspring (H o = 0.73). The fixation index (F) was significant and lower than zero for the NP seed tree population, and not significantly different from zero for offspring of both populations (−0.04 to 0.05). These results suggest selection for heterozygous individuals among seed trees and an absence of biparental inbreeding in offspring. Null alleles (Null) were detected at low frequencies (maximum 0.11) in some loci among offspring, suggesting that null alleles did not significantly bias our estimates of mean F values. The genetic differentiation between PT and NP populations ($G_{{\rm st}}^{\rm ^{\prime}} $) was significantly greater than zero for the offspring pollen gene pool (0.52 ± 0.08, mean ± standard error), indicating that similar levels of genetic diversity (50%) was distributed within and between populations.
SD is the standard deviation; for offspring the F was estimated using the H e calculated from gene frequencies in the pollen pool; *P < 0.05. Different letters indicate significant differences (P < 0.05) between samples, based on a Jackknife test.
Pollen dispersion
Spearman's rank correlation coefficient between pairwise correlated paternity and distance between seed trees in NP (ρ = −0.126) was significantly negative (P = 0.047) and showed a downward trend with increasing distance between seed trees (Fig. 2 and online Supplementary Table S1). It was therefore possible to estimate the mean pollen dispersal distance in the population. Using the exponential power dispersion model, the scale of the pollen dispersion parameter (a) was 89.5, the shape of the dispersion parameter (b) was 0.415, and the mean pollen dispersal distance was 179 ± 18.9 m (± standard deviation).
Mating system
The outcrossing rate (t m) was not significantly different from unity (1) for the studied populations, indicating that all offspring originated from outcrossing, as expected for a dioecious species (Table 2). In NP, the rate of mating among relatives (1 − t s = 0.15) was significantly higher than zero, indicating that some offspring present biparental inbreeding. The multilocus paternity correlation within and among fruits (r p) was significantly higher than zero in both populations, indicating that some offspring are full-sibs, particularly in the NP population. The effective number of pollen donors (N ep) was significantly higher in PT (13) than NP (2.9). Due to higher correlated mating in NP, the within progeny coancestry coefficient (Θ) was also higher in NP (0.168) than PT (0.135) and the effective size within progenies (N e) was lower in NP (2.69) than PT (3.27). Consequently, the number of seed trees (m) required for seed collection was higher in NP (56) than PT (46). The total variance effective size (N eT) was higher in PT (98.1) than in NP (88.7). In NP, the paternity correlation within fruits (r pw = 0.40) was significantly higher than among fruits (r pa = 0.26), indicating that about 2.5 effective pollen donors fertilized the fruits (N epw) and 3.8 pollen donors fertilized each tree (N epa).
95% CI is the 95% confidence interval.
In general, the outcrossing rate (t m) estimated at the individual level was not significantly different from unity, ranging in NP from 0.93 to 0.99 and in PT from 0.96 to 0.99 (online Supplementary Tables S2 and S3). The Spearman's rank correlation coefficient between the sample size (n) of progenies and t m was significant and higher than zero (0.678), indicating that lower t m values are a product of progenies with small sample sizes. The lowest estimates (0.93) were detected in two NP progenies (16 and 28) with only four genotyped offspring, indicating that this result is an artefact of sample size. Furthermore, the other indices were also generally lower in these two progenies. All other estimated indices were variable among progenies (1 − t s: NP = 0.03–0.20, PT = 0.04–0.10; r p: NP = 0.06–0.40, PT = 0.04–0.11; N ep : NP = 2.5–17.9, PT = 9.3–25.6; Θ: NP = 0.132–0.235, PT = 0.130–0.139; N e: NP = 1.73–3.29, PT = 3.12–3.40). In 22 progenies of NP (73.3%) and 28 of PT (93.3%), the F m was lower than F o, suggesting selection against inbred individuals between seed/young trees and adult stages. In NP, for 28 progenies (93.3%) the r pw was higher than r pa, showing that the effective number of pollen donors was generally lower within than among fruits. The sample size within progenies (n) was significantly negatively correlated with Θ and positively correlated with N e (Table 3). The N ep was significantly positively correlated with H o and N e and significantly negatively correlated with F o and Θ. The H o was significantly negatively correlated with F o and N e.
*P < 0.05; **P < 0.01.
Discussion
The indiscriminate exploitation of natural forest resources around the world has led to fragmentation, creating unbalanced ecosystems and the possible extinction of small populations. As such, understanding the mating system and pollen dispersal patterns of a species, along with its genetic diversity and population structure, become essential for their conservation. Our study compares offspring from two distinct G. americana PTs, one at 6 months of age (NP) and one at 14 years of age (PT), in order to understand the breeding dynamics and propose effective conservation strategies. However, it is important to highlight that our results for mating system in NP represent the effective mating pattern, as it is based on the analysis of germinated seeds, and in PT the results represent the realized mating pattern, as they are based on juveniles at 14 years of age. Thus, in both cases, our results may differ from the mating patterns that occurred during the respective reproductive events that gave origin to our samples, as some stochastic processes may have occurred, such as random mortality, seed predation and disease, as well as some deterministic processes, including selection against inbred individuals between the zygote stage and when sampling occurred.
Genetic diversity and biparental inbreeding
The genetic differentiation between populations for the offspring pollen gene pool ($G_{{\rm st}}^{\rm ^{\prime}} $ = 0.52) showed that the majority of genetic diversity is distributed between populations and not within populations. This result can be attributed to the distance between the populations (1900 km) and the observed pollen dispersal patterns of isolation by distance, which was reported previously for other G. americana populations (Rabbani et al., Reference Rabbani, Silva-Mann and Ferreira2012; Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017), indicating that genetic differentiation among populations increases with increased geographic distance.
The populations studied herein present high levels of genetic diversity in comparison with the results from a previous study using the same seven loci. Manoel et al. (Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017) studying adults, juveniles and open-pollinated seeds from two reproductive events found a lower mean observed (H o = 0.38–0.54) and expected heterozygosity (H e = 0.55–0.77). Another study on three G. americana populations in the Amazon forest using six simple sequence repeat loci (Ruzza et al., Reference Ruzza, Rossi, Bispo, Tiago, Cochev, Rossi and Fernandes2018), three of which were the same as those used herein, also reported lower H o (0.13–0.21) and H e (0.48–0.57) than our study. The differences between studies may be due to the sample size and natural history of the studied populations (bottleneck and founder effects). However, the high levels of heterozygosity detected for both populations assessed herein are significant for in situ conservation of the NP population and ex situ conservation of both populations, as a large number of new genotypic recombinations may occur through mating among individuals of NP and also among individuals of PT. Furthermore, this result suggests that both studied populations have the potential to be used in future breeding programmes and seed collection efforts aimed at the recovery of degraded areas.
Mating system
The outcrossing rate (t m) estimated for NP and PT were similar to that expected for dioecious species (t m = 1.0), where there is no possibility of self-fertilization as all individuals originate from outcrossing. At the progeny level, the estimates of t m were high (0.93–0.99), but lower than unity in some cases. Estimates of t m lower than unity for the species (0.87 and 0.95) were also reported previously and were attributed to mating among related trees, since reproduction in dioecious species occurs only by outcrossing (Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2015a). We observed that the t m values at the progeny level were positively correlated with the sample size, where progenies with a small sample size presented the lowest values in NP, indicating that results for these progenies must be interpreted with caution. If we examine the multilocus genotypes of the progenies with the lowest estimates of crossbreeding, we can see that all descendants of all progenies had at least one different allele from the mother tree in at least one locus, indicating that they all originated from outcrossing. Furthermore, this suggests that the low estimates of t m are an artefact of the numerical method of EM to solve the probability equations. To prove this hypothesis, we re-estimated the t m using the Newton–Raphson numerical method and found that none of the estimates were significantly different from unity.
However, mating was not random due to some mating among relatives (1 − t s) and correlated mating (r p), especially in NP (1 − t s = 0.15, r p = 0.34). Mating among relatives is indicative of intrapopulation spatial genetic structure (SGS), which typically results from seed dispersal near to the seed trees (Hardy et al., Reference Hardy, Maggia, Bandou, Breyne, Henri Caron, Chevallier, Doliguez, Dutech, Kremer, Latouche-Halle, Troispoux, Veron and Degen2006; Tarazi et al., Reference Tarazi, Sebbenn, Kageyama and Vencovsky2013; Degen and Sebbenn, Reference Degen, Sebbenn, Pancel and Kölh2014). The presence of SGS in the population used to establish the PT is likely similar to our results for NP. Correlated mating (r p) can be attributed to the behaviour of pollen dispersers, in this case bees, visiting and fertilizing many flowers of the same female tree with pollen from a single or small number of previously visited male trees, as indicated by the effective number of pollen donors (N ep: NP = 2.9; PT = 13). This pollinator behaviour also explains the lower levels of correlated mating among (r pa) than within (r pw) fruits and, consequently, the lower number of effective pollen donors within (N epw = 2.5) than among fruits (N epa = 3.8). Similarly, Manoel et al. (Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2015a) investigated annual variation in the mating system of a G. americana population and reported lower r pa (0.364–0.714) than r pw (0.553–0.842) and a lower N epw (1.2–1.8) than N epa (1.4–2.7). Furthermore, the same pattern of lower r pa than r pw has been identified for other multi-seed-fruit and insect- and animal-pollinated tree species (Fuchs et al., Reference Fuchs, Lobo and Quesada2003; Tamaki et al., Reference Tamaki, Setsuko and Tomaru2009; Silva et al., Reference Silva, Albuquerque, Ervedosa, Mota, Figueira and Sebbenn2011; Feres et al., Reference Feres, Sebbenn, Guidugli, Mestriner, Moraes and Alzate-Marin2012; Wadt et al., Reference Wadt, Baldoni, Silva, Campos, Martins, Azevedo, Mata, Botin, Hoogerheide, Tonini and Sebbenn2015; Baldoni et al., Reference Baldoni, Wadt, Campos, Silva, Azevedo, Mata, Botin, Mendes, Tardin, Tonini, Hoogerheide and Sebbenn2017; Giustina et al., Reference Giustina, Baldoni, Tonini, Azevedo, Neves, Tardin and Sebbenn2018; Moraes et al., Reference Moraes, Kubota, Rossini, Marino, Freitas, Moraes, Silva, Cambuim and Sebbenn2018), indicating that this may be standard pollinator behaviour for such tree species.
Nevertheless, this fertilization pattern may also be the result of a limited number of flowering neighbour trees due to individual variation in flowering phenology and/or low population density as a result of natural species distribution, logging or forest fragmentation (Moraes et al., Reference Moraes, Kubota, Rossini, Marino, Freitas, Moraes, Silva, Cambuim and Sebbenn2018). Although the results indicate long-distance pollen dispersal in the NP population (mean of 179 m), the pattern was isolation by distance, which indicates more effective pollination events occurring at short distances and less chance of pollen dispersal over long distances. This pattern suggests that the observed mating among relatives (1 − t s) and correlated mating (r p) in NP is mainly the product of near-neighbour mating between related and unrelated male and female trees. G. americana is pollinated by insects (bees) capable of flying long distances (>5 km) and thus long-distance pollen dispersal (Steffan-Dewenter and Kuhn, Reference Steffan-Dewenter and Kuhn2003; Abou-Shaara, Reference Abou-Shaara2014). The result of short-distance pollen dispersal compared to potential long-distance flight by bees could also be related to foraging preferences of some species (De Vere et al., Reference De Vere, Jones, Gilmore, Moscrop, Lowe, Smith, Hegarty, Creer and Ford2017). Similarly, pollen dispersal following a pattern of isolation by distance has been found for other G. americana populations (mean of 166–313 m; Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017) and in studies on other tree species (Lander et al., Reference Lander, Boshier and Harris2010; Braga and Collevatti, Reference Braga and Collevatti2011; Fuchs and Hamrick, Reference Fuchs and Hamrick2011; Ismail et al., Reference Ismail, Ghazoul, Ravikanth, Uma Shaanker, Kushalappa and Kettle2012; Manoel et al., Reference Manoel, Alves, Dourado, Gaino, Freitas, Moraes and Sebbenn2012; Baldauf et al., Reference Baldauf, Ciampi-Guillardi, Aguirra, Corrêa, Santos, Souza and Sebbenn2014; Breed et al., Reference Breed, Ottewell, Gardner, Marklund, Dormontt and Lowe2015; Reim et al., Reference Reim, Proft, Heinz and Lochschmidt2015; Tambarussi et al., Reference Tambarussi, Boshier, Vencovsky, Freitas and Sebbenn2015; Bezemer et al., Reference Bezemer, Krauss, Phillips, Roberts and Hopper2016; Baldoni et al., Reference Baldoni, Wadt, Campos, Silva, Azevedo, Mata, Botin, Mendes, Tardin, Tonini, Hoogerheide and Sebbenn2017; Spoladore et al., Reference Spoladore, Mansano, Lemes, Freitas and Sebbenn2017; Moraes et al., Reference Moraes, Kubota, Rossini, Marino, Freitas, Moraes, Silva, Cambuim and Sebbenn2018; Souza et al., Reference Souza, Kubota, Tambarussi, Freitas, Moraes, Silva, Cambuim and Sebbenn2018; Lander et al., Reference Lander, Harris, Cremona and Boshier2019; Potascheff et al., Reference Potascheff, Oddou-Muratorio, Klein, Figueira, Bressan, Oliveira, Lander and Sebbenn2019; Solís-Hernández and Fuchs, Reference Solís-Hernández and Fuchs2019). This is further supported by the presence of SGS and mating among related trees (1 − t s) in other G. americana populations (Sebbenn et al., Reference Sebbenn, Kageyama and Vencovsky1998; Manoel et al., Reference Manoel, Freitas, Furlani Junior, Alves, Moraes and Sebbenn2017) and can explain the mating among related trees detected within progenies of PT.
The effective number of pollen donors was four times lower in NP than in PT ($N_{\scriptstyle {\rm ep} \atop \scriptstyle } $: NP = 2.9; PT = 13). The difference between populations may be explained by the fact that NP represents the effective mating measured in offspring at 6 months of age and PT represents realized mating at 14 years of age. As previously mentioned, results may differ due to the occurrence of stochastic (random mortality, seed predation and disease) and deterministic processes (selection against inbred individuals between the offspring and adult stages, and nursery selection of more vigorous offspring to establish the PT population).
As noted above, the significant differences for 1 − t s and r p between NP and PT can be attributed to the fact that our results for mating system in NP represent the effective mating pattern, measured in germinated seeds, while the result for PT represent the realized mating pattern, measured in 14-year-old trees. Thus, our results may differ from the actual mating patterns that occurred during the reproductive events due to stochastic and deterministic process. However, the analysis of the PT population included the oldest and only surviving trees 14 years after establishing the PT; therefore, these trees were likely subject to stronger stochastic and deterministic processes than the NP population, particularly inbreeding depression which can cause higher mortality of inbred individuals originated from mating among related trees. The observed lower levels of biparental inbreeding in mothers (F m) than offspring (F o) is a strong indicator of selection against inbred individuals between seed/young trees and adult stages. Furthermore, the observed variation among seed trees within and among populations for mating among related trees and correlated mating can also be attributed to the mentioned stochastic and deterministic processes, as well as the site-specific individual variation in flowering phenology, limited number of flowering neighbour trees or low population density.
Conclusions
PTs for ex situ conservation enable the maintenance of high levels of genetic diversity of a species, since they can preserve alleles from populations that no longer exist, despite sampling a small number of individuals from the original populations. The combined strategy of seed collection from both NP and PT populations is also valuable as it can support increases in genetic diversity levels. Commercial use of wood and fruits must be done with caution to ensure that a balanced sex ratio is preserved in the region. Our results show clear mating system differences between NP (effective) and PT (realized) offspring. Mating was not random in the populations, especially the effective mating (NP), due to the high rates of mating among related trees and correlated mating, resulting in a low effective size (N e). The high frequency of biparental inbred individuals (half-sibs and full-sibs) in the NP population is expected to decrease as the trees transition into adult stages due to inbreeding depression, which will increase the N e and heterozygosity and decrease the genetic differences between the trials. Thus, the effective population size of both populations may reach the minimum level (N e ⩾ 70) for short-term ex situ conservation. As the species is dioecious and inbreeding can only occur through mating among relatives, the trial may be suitable for seed collection for environmental reforestation and breeding if submitted to selective logging, maintaining only male or female trees within each plot to avoid the occurrence of mating among related trees. Furthermore, to maximize the effective population size (N e) of the parental population for seed production, the same number of male (N m) and female (N f) trees of each progeny should be maintained. If a single male and a single female are selected per progeny, 30 males and 30 females will remain in the resulting seedling seed orchard after selective logging, which will result in an effective population size [N e = (4N mN f)/(N m + N f) = (4 × 30 × 30)/(30 + 30) = 60] similar to that indicated for short-term ex situ conservation. However, it is important to note that the gene pool of the populations will be dispersed throughout the reforestation areas, thus creating new populations from the source material. For ex situ conservation and seed collection for reforestation of the NP, our results from the NP offspring population indicate that seeds must be collected from at least 56 trees to obtain samples with an N e of 150.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S1479262121000022.
Acknowledgements
The authors are thankful for Sao Paulo Research Foundation (FAPESP; grant 13/19444-6 to MFVM). Research fellowships were provided by the National Council for Scientific and Technological Development (CNPq) for Alexandre M. Sebbenn and Mario L.T. Moraes. We also thank Dr Evelyn Nimmo for editing the English of the manuscript.