I. INTRODUCTION
In crystallography literature, whenever powder diffraction data is of interest, indexing is the first step that a scientist should be involved with. That is, since the 1960s, many researchers have been interested in trying to find methods of calculation, often analytical, to get efficiently through this step.
Among these methods, three have emerged and have been frequently used, namely: ITO (Visser, Reference Visser1969), TREOR (Werner et al., Reference Werner, Eriksson and Westdahl1985) and DICVOL91 (Boultif and Louër, Reference Boultif and Louër1991). Nevertheless, these methods have shown their limits over time in meeting requirements which have not been foreseen or have needed more resources and efforts. For instance, we may cite the zero-point error, the absolute angular error of lines' positions, the presence of impurities, and so on. Therefore, improved versions are found in the literature. That is, in DICVOL04 (Boultif and Louër, Reference Boultif and Louër2004; Louër and Boultif, Reference Louër and Boultif2006), the authors have added the correction of the zero-point error introduced in DICVOL91. In DICVOL06 (Louër and Boultif, Reference Louër and Boultif2007), they have introduced a new strategy to limit the risk to miss a solution as in DICVOL04. On the other hand, in N-TREOR (Altomare et al., Reference Altomare, Giacovazzo, Guagliardi, Moliterni, Rizzi and Werner2000), the authors have improved TREOR90, by making it more exhaustive and capable of refining the selected unit cell automatically. Finally, in N-TREOR09 (Altomare et al., Reference Altomare, Campi, Cuocci, Eriksson, Giacovazzo, Moliterni, Rizzi and Werner2009), the authors have introduced a new figure of merit.
Moreover, the growth of the power of computing machines and the resolution and accuracy of recording devices, have yielded the emergence of other methods; all aiming to increase the probability of obtaining good indexing. To this end, GAIN (Kariuki et al., Reference Kariuki, Belmonte, McMahon, Johnston, Harris and Nelmes1999) used the genetic algorithm. It exploits the Le Bail method (Le Bail et al., Reference Le Bail, Duroy and Fourquet1988) to fit the line profiles and X-Cell (Neumann, Reference Neumann2003) explores the successive dichotomies combined with the search of zero-point error by allowing unindexed lines. Furthermore, McMAILLE (Le Bail, Reference Le Bail2004) makes use of the Monte–Carlo process and CRYSFIRE suite (Shirley, Reference Shirley1999) employs several programs in a complementary way, among them ITO, DICVOL and TREOR. At this stage, it is worth noting that these methods remain insufficient since the solution is not always unique, the figure of merit, exploited for identifying the correct cell, can be inefficient and it is up to the user to select the most suitable one. The number of solutions depends mainly on the quality of the data. It is clear that worse the data quality is, more solutions we have and more likely the user gets confused.
Unlike the existing methods that are computationally complex, here we present a method to solve the problem of indexing in terms of a simple geometrical approach. The multiple cases that we treated numerically, showed the efficiency of the proposed approach and its capability of resolving difficulties associated with the zero-point error, the absolute angular error and so on. Note that the proposed method treats all symmetries, except the rhombohedral one, which is included in the hexagonal symmetry.
The paper is organized as follows. In Sections II, III and IV, we present the theory of the considered approach, describe the implementation of the proposed method and treat the determination of the zero-point errors, respectively. In Section V, we discuss the problem inherent to impurities. In Section VI, we describe the proposed algorithm. Then, in Sections VII and VIII, we treat numerical examples. Finally, in Section IX, we end the paper with some concluding remarks.
II. THE METHOD
A. Principles
To clarify the basic idea used in this work, let us consider three distances p 1, p 2 and p 3. A triangle whose sides are p 1, p 2 and p 3 may then be constructed (see Figure 1).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-19854-mediumThumb-S088571561200070X_fig1g.jpg?pub-status=live)
Figure 1. Description of a face of the reciprocal cell and the angle β* at the origin.
Let (Oxy) be an orthonormal coordinate system in which p 1 is along the Ox axis. Therefore, point E, the intersection of p 2 and p 3, has coordinates (x, y) such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn1.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn2.gif?pub-status=live)
Note that x may have negative values.
On the other hand, it is well known that an angle between two sides may be easily derived geometrically. That is, if β* is the angle between Ox and the line segment OE, then
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn3.gif?pub-status=live)
For instance, if p 1, p 2 and p 3 are the vectors in the reciprocal lattice corresponding, respectively, to the reflections of indices (100), (001) and (-101), then the triangle of Figure 1, would describe a face of the reciprocal cell and β* the angle at the origin.
To construct the other faces of the cell, the above construction process is repeated for all reciprocal distances, allowing cell determination.
Note that the vector length p i = 1/d hkl, where d hkl is the interplanar spacing.
B. Determination of powder diagram's new lines
We start by exploiting the experimental data at hand, noted set (A), which is used to build sets (B) and (C). These two sets may then contain values that could be the lengths of vectors b* and c*, respectively, and then, while carrying out the building of all possible faces, we determine the missing data values which could be necessary for the construction of the unit cell. In this way, we may build more triangles and choose those whose point E has abscissa x lying in the interval [−3p 1/2, p 1/2]. This interval has been chosen on the basis of the minimum angular error criterion. That is, compared with the reflections lying in the high 2θ region of the powder pattern, the distances that correspond to those that appear in the first part and the middle of the powder diagram, usually, satisfy this criterion. Note that this interval is considered to be sufficiently large such that the determination of the missing values is made possible.
We recall that in high symmetries, the reciprocal unit cell has two rectangular or square faces, whereas the third one could be neither rectangular nor square. Let a*, b* and c* be the three vectors basis of the reciprocal unit cell. The rest of the paper assumes that the length differences of the basis vectors are not too large. Considering the vector a* along Ox, we will first be interested in the construction of the cell's rectangular faces containing a vector b* that is perpendicular to a*. In the following, we assume that the distance p 1 of set (A), could be a multiple of the length of a*.
Then, we begin by looking for triangles such that abscissa x of E′ is equal to 0 or –p 1, Figure 2. If x =0, then the length of OE' could be a multiple of the length of b*. However, if x = –p 1, the ordinate y of E′ could be a multiple of the length of b*. In this manner, we can construct set (B) from values of length of OE′ or from those of the ordinate y of E′.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-44438-mediumThumb-S088571561200070X_fig2g.jpg?pub-status=live)
Figure 2. Description of a face of the reciprocal cell and the angle γ* = 90° at the origin.
To construct set (C), let us return to Figure 1. We can see that, among the values of angle β*, the one that is the closest to 90° allows the nodes (001) and (00 − 1) to remain in the area bounded by the straight lines x = p 1/2 and x = −p 1/2 (i.e., x ∈ [−p 1/2, + p 1/2]). The identification of these nodes' positions yields to the search of triangles for which point E is situated between these two lines. In this case, we admit that the side OE is a multiple of the length of c*. The periodic arrangement of the nodes in the reciprocal lattice shows that the other nodes of the same lines, i.e., (h01) and (h0−1), along Ox, are distant hp 1, from nodes (001) and (00 − 1), respectively; all within the area bounded by the straight lines x = hp 1 + p 1/2and x = hp 1 − p 1/2. In this way, the knowledge of one's position, determines the position of the other and vice versa. The construction of triangle T2, whose distances are assumed to correspond to the reflections of indices (100), (−101) and (−201), allows us to know the position of node (−101), which in turn gives the position of node (001). This follows from the fact that if (x, y) are the coordinates of node (−101), then those nodes (001) are (x + p 1, y). Then, when x ∈ [−3p1 /2, −p1/2], i.e., h = −1, the side of the reciprocal lattice that could be a multiple of the length of c*, is obtained by the following:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn4.gif?pub-status=live)
Note that the case x = −p 1/2 is included in the first interval (i.e., [ −p1 /2, + p1 /2]) and therefore excluded from the second because, according to Eq. (4), if x = −p 1/2 then $p^{\prime}=\sqrt {\lpar p_1 /2\rpar ^2+y^2 }={\rm OE}$, which again satisfies the hypothesis of the first interval (i.e., OE is a multiple of the length of c*).
Finally, the selected values are gathered in set (C). In summary, for each value p 1 of set (A), we have built two sets (B) and (C). We should specify that, in the following, in addition to set (A), sets (B) and (C) are built for the high symmetries. However, only set (C) is necessary for triclinic symmetry. Regardless of the original value of p 1 (multiples of a* or not), the procedure is repeated for p 1 divided by 2 and 3.
C. Cell construction
The construction of sets (B) and (C) was done upon the basis that there are two types of symmetries in the crystal system; namely, the one in which the cells have two rectangular or square faces and the other in which the cells possess neither a rectangular nor a square face.
In the first type of symmetry, we determine the triangle that defines the first face. For this, we use two elements p 1 and p 2 of set (A) and an element p 3′ of set (C). This triangle has p 1 as a basis and p 2 and p 3′ as sides that intersect at point E. β* is the angle at the origin. It is obtained from Eqs (1) and (3).
As in this type of symmetry, α* and γ* are equal to 90°, the cell construction is completed once we place perpendicularly to this face, an element p 4″ of set (B), which cuts off p 1 and p 3′ at point O.
In addition, this construction has also been realized using doubles and halves values of p 1 and p 3′ in order to increase the probability of obtaining the cell of concern.
Since the nature of the elements (primitives or multiples) of sets (A), (B) and (C) is unknown, the unit cell is obtained by further dividing by n (n = 1, 2, 3 and 4), p 1, p 4″ and p 3′.
Finally, to fully exploit the experimental data and thus to have more chance of finding the solution, the above procedure is repeated by assigning successively to p 1, the first ten elements of set (A) and to p 2, p 3′ and p 4″, all the elements of sets (A), (C) and (B), respectively.
D. Extension to triclinic cases
In the triclinic symmetry, the three angles are different from 90°. Then, we must determine the three faces of the structure. To do this, we first start by building the first two triangles. Knowing that these triangles must have a common side, we take three elements p 1, p 2 and p 3 of set (A), which we combine with two elements p 2′ and p 5′ of set (C) to obtain two triangles. Note that p 1 intersects with p 2′ and p 5′ at O and it is chosen to be a common side of the two triangles.
Using these triangles, we identify five parameters of the reciprocal cell; namely the three sides p 1, p 2′ and p 5′ and two angles α* and γ*, which are derived similarly to β* by equations such as Eqs (1) and (3). Specifically, we use for α*, in Eq. (1), p 2′ and p 2 in place of p 2 and p 3, respectively, and in Eq. (3), p 2′ in place of p 2, and for γ*, in Eq. (1), p 5′ and p 3 in place of p 2 and p 3, respectively, and in Eq. (3), p 5′ in place of p 2.
Note that, these two triangles in the trihedron could be at the limit, either superposed; i.e., the angle between them is equal to 0°, or completely spread out, i.e., the angle between them is equal to 180°. As shown in Figure 3, according to the above limits, we deduce that the smallest angle between p 2′ and p 5′ is equal to $\vert \alpha ^{\ast} - \gamma ^{\ast} \vert $ (position 1) and the greatest angle is equal to (α* + γ*) (position 2), where |·| designates the absolute value.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-16839-mediumThumb-S088571561200070X_fig3g.jpg?pub-status=live)
Figure 3. Illustration of positions (1) and (2) as extreme limits of p 2′.
The trihedron is defined completely only when we build the third triangle. This is achieved by using sides p 2′, p 5′ and an element p 4 of set (A) and taking into account that the angle β* at O, between p 2′ and p 5′, must satisfy
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn5.gif?pub-status=live)
To ensure that the reciprocal cell is elementary, the linear parameters are further divided by a factor n′ (n′ = 1, 2).
Note that by assigning all values of sets (A) and (C) to p 2, p 3 and p 4, and p 2′ and p 5′, respectively, we may construct several trihedra.
III. IMPLEMENTATION OF THE PROPOSED METHOD
To estimate the effectiveness of this method, a general program, named ‘IGC' (indexation by geometrical construction) was written in FORTRAN. This program implements the techniques introduced above. It begins by building sets (B) and (C) [or only set (C), in the triclinic case]. Then, it combines their elements with the experimental values of set (A) to construct a model of the cell which can be a solution to the problem of indexation. When the data are affected by the zero-point error, they are corrected. Therefore, more accurate unit cell parameters are determined (see Section IV). Moreover, the quality factors M20 (de Wolff, Reference De Wolff1968) and F20 (Smith and Snyder, Reference Smith and Snyder1979) are calculated to appreciate the validity of the proposed solution.
Many examples (up to 150) have been treated with this program. The results, similar to those obtained by other programs (e.g., Louër and Vargas, Reference Louër and Vargas1982) showed that the small volume solution, provided by high quality data, is usually the correct solution. It comes out that the strategy of this program is the same as the one adopted by the majority of the existing programs. This requires a volume cut into slices whose volume is 400 Å3. The unit cells whose volumes are less than a slice section belong to the slice in question. Such a strategy can speed up the search process until one (or more) solution(s) of smaller volume(s) is (are) found.
This time limitation is more perceptible when we deal with the case of high symmetry systems. Going from the lowest volume to the highest volume, it is known that solutions usually appear from the lowest symmetry to the highest symmetry. To have a wide field of search for solutions in the monoclinic system, the angle β is set up to 125°. Then, if the solution happens to be in a slice, the search of other solutions in larger volumes will continue simply by attributing to β the values 90° or γ = 120° in the case of a hexagonal symmetry. This would enable solutions in the other symmetries to appear.
In practice, the first 20 lines are sufficient to index a powder diagram. It is recommended that the considered phase be pure and the absolute angular error of line positions be less than 0.03° in 2θ.
The tests were performed on examples taken from the Personal Computer Powder Diffraction File (PCPDF) using a 3 GHz processor. The success rate was 96% and the execution time for most of these examples did not exceed 3 min (except the triclinic symmetry that required more time). The treated examples were cells with linear parameters less than 35 Å and volumes not exceeding 4500 Å3.
IV. ZERO-POINT ERROR
The proposed method also treats problems that may arise when the lines' positions of the powder diffraction diagram are not zero-point error free. This error appears clearly in the harmonic lines series. It is perceptible through a small displacement, in the same direction, of these lines.
In this work and, in a similar manner, by the reflection-pair method (Dong et al., Reference Dong, Wu and Chen1999), we determine the zero-point error by looking for all pairs of the experimental data which verify the following relation:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn6.gif?pub-status=live)
where p m and p n, elements of set (A), are lines corresponding to the diffraction angles θ m and θ n, respectively. The multipliers m i and n i are integers.
Equation (6) can be set exactly equal to ‘1’ if we simply introduce a real value Zp, called the zero-point error, to yield
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn7.gif?pub-status=live)
Following this reasoning, several values of Zp are obtained. However, as in 98% of the treated cases, the correction of zero-point error on θ has not exceeded 0.05°, i.e., |Zp| ≤ 0.05°. We kept only those that were less than this limit. Note that since relation (6) holds for p m = p n, then a zero value of the zero-point error is also expected.
V. TREATMENT OF IMPURITIES
To have a reliable indexing, we consider that all the lines of the powder diffraction pattern are important and then hope that the measurements of lines positions are correct, the compound is pure, and that the phase is unique. Otherwise, in order to get around any of these problems, we allow Ni lines to be non-indexed.
In most indexed examples, Ni ≤ 2. However, in some cases, the mixture of phases and the presence of impurities, cause the appearance of a large number of fake lines. Therefore, Ni may be greater than 2. To this effect, many solutions could be obtained. The risk of finding them in a slice of volume less than the volume which contains the correct solution is considerable.
VI. PROPOSED ALGORITHM
As shown in Figure 4, the proposed algorithm works as follows. It starts reading the experimental data (N values) given as distances d hkl or diffraction angles 2θ. Using these data and as explained earlier (Section II.B), it builds sets (B) and (C). Then, it constructs the cell that could be a solution. That is, using this cell, the algorithm calculates the theoretical diffraction angles that are to be compared with the experimental ones (Test (1))
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn8.gif?pub-status=live)
where Δ(2θ)Global = Δ(2θ) + 2Zp and Δ(2θ) is the absolute angular error. Test (1) assumes that Δ(2θ )Global > Δ(2θ). That is, at this stage, we used only the upper bound value of the zero-point error (Zp = 0.05°). Note that this test has considerably reduced the computation time. Once Test (1) is satisfied, the elements of (A), (B) and (C) which contributed to the construction of the candidate cell are corrected by adding Zp [calculated from Eq. (7)] to their respective diffraction angles. As long as values of Zp are not used up, new cell parameters and new diffraction angles are calculated and compared with the experimental angles (Test (2))
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160309074310039-0880:S088571561200070X_eqn9.gif?pub-status=live)
Once Test (2) is satisfied, Test (3) is performed according to the well-known M20 and F20 figures of merit. Now, if Test (3) is satisfied, which means that M20 and F20 are, in general, greater than 10, then there exist at least one solution. This procedure is repeated until all the elements of sets (A), (B) and (C) are used up. Note that, if there is no solution in the high symmetries, the algorithm looks for solutions in triclinic symmetry. Finally, it ends either when there is no solution in both high symmetries and the triclinic one or when there is at least one solution in either one of them.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-65665-mediumThumb-S088571561200070X_fig4g.jpg?pub-status=live)
Figure 4. Flow chart of the proposed algorithm.
VII. EXPERIMENTAL RESULTS
In order to show the efficiency of the proposed algorithm, this section is devoted to examples of different symmetries that may occur in the construction of a powder pattern cell. That is, most of the treated cases were chosen according to the difficulties that may be encountered in the process of diagram indexation such as the presence of zero-point error, the absence of lines of indices (0k0) or (00 l) or both, the absolute angular error of lines' positions, and the cell volume. In the latter case, when we have a big cell volume, the powder pattern contains, generally, several lines, therefore, the space between these lines is very narrow, which makes the zero-point error and the absolute angular error of lines' positions very annoying. These conditions often permit the solutions of low volume to appear, and to nearly miss the solution that should appear in a slice of higher volume.
The aim of these examples is to find solutions that are closest, in terms of cell parameters and volume, to those given by the Joint Committee on Powder Diffraction Standards (JCPDS) file. Among these examples, there is one the processing of which has revealed all the potentials of the proposed procedure. The example is C26H31GeO5P (Diethyl 1 – (triphenylgermanyl)-3-Keto-2-oxapentylphophinate). It is known that this compound should be indexed in monoclinic symmetry. Using λ = 1.5406 Å, the powder diffraction did not show any reflections of indices (0k0). The program has to look for the reflections of indices (0k0) and those of indices (00 l) before starting any cell construction. Among several theoretical values of p 4″ and p 3′, calculated in order to be associated with the experimental line p 1 (1/p 1 = 13.929 Å) indexed by (100), two values have led to the cell that constitutes a good approximation of the desired solution. The first value was 1/p 3′ = 5.7084 Å which is less than half of the experimental value (11.475 Å) indexed by (002), and the second was 1/p 4″ = 4.057 67 Å whose experimental line is totally absent. The unit cell parameters were a c = 14.248 Å, b c = 8.101 Å, c c = 23.444 Å, β c= 103.725° and V c= 2628.50 Å3. Note that the value 5.7084 has been retained in cell construction while the value 11.475 has not.
Using this cell, 20 lines were indexed over the 20 initially used lines. The tolerated absolute angular error ∆(2θ) = 0.045°. The uniqueness of the solution and the values of the figures of merit (M20 = 7 and F20 = 18 (0.017, 63)) show that the solution is acceptable. Note the zero-point error Zp = 0.020 23 degrees in θ.
The solution found in the JCPDS file is given by a = 14.30 Å, b = 8.115 Å, c = 23.46 Å, β = 103.68° and V = 2646.55 Å3. Although small, the disparities between these parameters and ours are, generally, owing to the absolute angular error of the lines' positions. Note that these disparities have an effect on the theoretical reflection lines calculated using our unit cell parameters. This could favor, for certain experimental lines, one indexing over another. For instance, this appears in lines 4.0153 and 3.9728 indexed, respectively, by (310) and (302), instead of being indexed, respectively, by (213) and (015), as given in the JCPDS file.
For comparison purposes, the above example has also been treated by programs, which are considered as benchmarks in the crystallography literature, such as DICVOL06 and N-TREOR. Their results are reported in Table I. Although the result given by our approach is closer to the true solution in terms of volume and cell parameters, we agree that, if we confine ourselves to the given example and not others, the results given by DICVOL06 and N-TREOR are better if we look at their figures of merit, which are higher compared with those obtained by our approach. However, if we compare the time and effort that have been used so far to set these programs, we may conclude that our approach needs to be improved and affined to attain such results.
TABLE I. Unit cell parameters of C26H31GeO5P, obtained by applying, DICVOL06, N-TREOR and our method.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-50663-mediumThumb-S088571561200070X_tab1.jpg?pub-status=live)
We may observe that in this approach a lot of experimental data are directly or indirectly involved in the construction of the cell. Thus, the effectiveness of the proposed method is proven to be closely related to the accuracy of the measurements.
To extend the study to more interesting examples, Table II lists some of the results obtained from other compounds found in the JCPDS files, which could be considered as a sample of all the examples we have treated so far. The choices were made on the basis of the difficulties encountered with diagram indexation. Referring to Table II, the compounds were indexed; some completely and others partially, in a maximum time of 60 s. The number of non-indexed lines Ni ≤ 2. The used lines are 20, the absolute angular error Δ(2θ) = 0.045° or 0.06°, β (max) = 125° and the zero-point error |Zp| ≤ 0.05°. The symbols (c), (q), (H), (O) and (M) designate cubic, tetragonal, hexagonal, orthorhombic and monoclinic symmetries, respectively.
TABLE II. Examples of unit cell parameters obtained by applying the proposed method.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-21302-mediumThumb-S088571561200070X_tab2.jpg?pub-status=live)
In Table II, we show the best results according to their M20 and F20. These solutions are not always unique. Although, they do not often appear, at the head of the list of the results, they are usually among the top five. Note that we did not adopt any criterion to rank the solutions.
For each compound of monoclinic symmetry, the number of solutions that appear is less than six. These solutions are not very different neither in dimensions of the cell parameters nor in values of merit factors M20 and F20. The gap between their merit factors is less than 3, except in the case of compound C4BaO9Ti-4H2O, where the difference is considerable between the merit factors of the cell (a = 14.05 Å, b = 13.81 Å, c = 13.39 Å, β = 88.45°, V = 2598.60 Å3, M20 = 48, F20 = 160 (0.003,32)), considered to be the best and which appears in the second place in the list of solutions and the solution that has the lowest values of M20 and F20 (a = 14.1 Å, b = 13.86 Å, c = 13.44 Å, β = 91.69°, V = 2624.62 Å3, M20 = 17, F20 = 57 (0,011, 33)) and which appears in the first position. This shows that the program has used different lines to determine the first and second solutions.
As, going from the lowest volume to the highest volume, the solutions usually appear from the lowest symmetry to the highest symmetry. Consequently, the strategy followed in our program is to let the solutions appear in their natural order. Therefore, the solutions of compounds of high symmetries (except the monoclinic symmetries), usually appear alone in their volume section; making them easy to detect.
Finally, we should point out that the execution time is acceptable. It could be further shortened if we reduce the number of experimental values of p 1. In effect, to increase the probability of having one of the lines (h00), the first ten values of set (A) are assigned to p 1. Knowing that, in the high symmetry case, one of the (h00) lines may be present in the first five experimental lines, then, except for the monoclinic and triclinic symmetries, the execution time could be divided by 2.
VIII. BETHANECHOL CHLORIDE BENCHMARKS
To test the robustness and the limits of our program, we investigated more interesting operational cases through the use of the ‘Bethanechol Chloride (C7H17ClN2O2) benchmarks’ (Bergmann et al., Reference Bergmann, Le Bail, Shirley and Zlokazov2004). The two entries ICDD PDF 43-1748 and 46-1964 were the subject of the following tests. These tests are presented in the form of 10 different lists of which each one contains 20 lines. They are listed as follows:
A. Indexing the raw data. A(1) for ICDD PDF entry 43-1748 and A(2) for 46-1964.
B. Indexing the data with I ≥ 5% (I/I max). B(1) and B(2) as above.
C. Indexing data corrected for zeroshift. C(1) and C(2) as above.
D. Indexing data corrected for zeroshift and having I ≥ 5% (I/I max). D(1), D(2) as above.
To these 2 × 4 tests, are added two further and considerably easier tests:
E. Indexing new laboratory X-ray data.
F. Indexing synchrotron data.
These conditions correspond probably to more than 50% of the crystal structures stored in the ICSD and CSD database. The indexing programs should be applied to the following series of 10 data sets in automatic and manual modes. According to these benchmarks, the results of some of these indexing programs are reported in Table III as follows: ‘1’ point means that the correct cell was found in the first FoM position among the proposals; ‘0’ means that the correct cell is mixed with the incorrect ones, not at the head of the list, but listed among the first ten; The ‘ −1’ means that the correct cell was not found at all, or at a position larger than 10 in the lists.
TABLE III. Results for the bethanechol chloride benchmarks.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-53229-mediumThumb-S088571561200070X_tab3.jpg?pub-status=live)
(1) ICDD PDF entry 43-1748, λ = 1.5418 Å.
(2) ICDD PDF entry 46-1964, λ = 1.5418 Å.
(3) Conventional X-ray data, λ = 1.540 56 Å.
(4) Synchrotron data, λ = 0.6995 Å.
na, not applicable, there is only a manual mode (taken as −1 when calculating the totals).
0n means that it was the nth cell proposal.
DICVOL04a: fast default and manual tests made by A. Le Bail.
DICVOL04b: manual tests made by D. Louër.
These tests show that our approach gives results that are comparable to those given by other routines. The examples in which our routine has failed are examples A(1), A(2) and C(1). In A(1), the number of impurity lines was 9. Thus, the number of lines of the phase that we wanted to index (11) was not sufficient to compute the missing lines such as (0k0) and (00 l); in A(2) and C(1), the real solutions were found but appeared after the tenth. The merit factors of the other examples obtained by our method were, sometimes, smaller than those found by others. These drawbacks will be taken into consideration in future works to improve the proposed approach.
The global note in the last column in Table III is obtained by adding the 20 results (1, 0 or −1). Note that ‘na’ is taken as −1 when computing the totals.
For the easiest example F(4) given in Table IV, we remark that, as far as the cell parameters obtained by the different approaches are concerned, the difference is not noticeable. Nevertheless, it appears important in terms of figures of merit, but not to the point to doubt about one method or another.
TABLE IV. Unit cell parameters of example F(4) obtained by different programs.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary-alt:20160712073306-65160-mediumThumb-S088571561200070X_tab4.jpg?pub-status=live)
DICVOL04a: fast default and manual tests made by A. Le Bail.
DICVOL04b: manual tests made by D. Louër.
IX. CONCLUSION
In this study, we have presented a technique to index powder diffraction diagrams. Unlike the existing methods that are computationally complex, we have introduced a method to solve the problem of indexing in terms of a simple geometrical approach. To show the effectiveness of the proposed algorithm, several examples have been treated. We have shown that the first example has treated the problems inherent to the absence of lines (0k0) and (00 l) and the zero-point error, by introducing two new calculated values and correcting the zero-point error. Regarding the simplicity of the proposed algorithm, we should point out that the obtained results are comparable to those given in the JCPDS files with an acceptable execution time. Finally, we think that our method can be improved if some crystallographic aspects and other mathematical tools are further exploited.
ACKNOWLEDGEMENTS
The authors would like to thank Pr. Armel Le Bail from University of Maine and Dr Toufik Laroussi from University Mentouri at Constantine, for their valuable assistance.