I. INTRODUCTION
Tezacaftor, in combination with ivacaftor, is sold as Symdeko, and used to treat cystic fibrosis. Its function is to correct the positioning of the cystic fibrosis transmembrane regulator (CFTR) protein on a mutated cell surface to permit proper channel formation and improved flow of water and salts across the cell membrane. The IUPAC name (CAS Registry number 1152311-62-0) is 1-(2,2-difluoro-1,3-benzodioxol-5-yl)-N-[1-[(2R)-2,3-dihydroxypropyl]-6-fluoro-2-(1-hydroxy-2-methylpropan-2-yl)indol-5-yl]cyclopropane-1-carboxamide. The molecular structure of tezacaftor is illustrated in Figure 1.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig1.png?pub-status=live)
Figure 1. The molecular structure of tezacaftor.
The synthetic routes and crystalline forms of tezacaftor have been reviewed by Hughes (Reference Hughes2019). X-ray powder diffraction data have been reported in EP2563778B1 (Keshavarz-Shokri et al., Reference Keshavarz-Shokri, Zhang and Alcacio2016) and many equivalent patents. A single-crystal study was carried out, but no atom coordinates are reported in the patent. A powder pattern for amorphous tezacaftor is also reported.
This work was carried out as part of a project (Kaduk et al., Reference Kaduk, Crowder, Zhong, Fawcett and Suchomel2014) to determine the crystal structures of large-volume commercial pharmaceuticals and include high-quality powder diffraction data for these pharmaceuticals in the Powder Diffraction File (Gates-Rector and Blanton, Reference Gates-Rector and Blanton2019).
II. EXPERIMENTAL
The sample was a commercial reagent, purchased from Sigma (Lot #10611), and was used as-received. The white powder was packed into a 1.5-mm diameter Kapton capillary and rotated during the measurement at ~50 Hz. The powder pattern was measured at 295 K at beam line 11-BM (Lee et al., Reference Lee, Shu, Ramanathan, Preissner, Wang, Beno, Von Dreele, Ribaud, Kurtz, Antao, Jiao and Toby2008; Wang et al., Reference Wang, Toby, Lee, Ribaud, Antao, Kurtz, Ramanathan, Von Dreele and Beno2008) of the Advanced Photon Source at Argonne National Laboratory using a wavelength of 0.457899 Å from 0.5° to 50° 2θ with a step size of 0.001° and a counting time of 0.1 s step−1.
The pattern was indexed on a C-centered monoclinic unit cell with a = 21.05637, b = 6.61006, c = 17.75860 Å, β = 95.82°, V = 2458.92 Å3, and Z = 4 using Jade Pro 7.8 (MDI, 2020). The cell volume indicated Z = 4, and since tezacaftor is a chiral molecule, the space group was assumed to be C2, which was confirmed by successful solution and refinement of the structure. A reduced cell search in the Cambridge Structural Database (Groom et al., Reference Groom, Bruno, Lightfoot and Ward2016) yielded three hits, but no structures for tezacaftor derivatives.
A tezacaftor molecule was downloaded from PubChem as Conformer3D_CID_46199646.sdf. It was converted to a .mol2 file using Materials Studio (Dassault, 2020) and converted into a Fenske-Hall Z-matrix using OpenBabel (O'Boyle et al., Reference O'Boyle, Banck, James, Morley, Vandermeersch and Hutchison2011). The structure was solved by Monte Carlo simulated annealing techniques as implemented in DASH (David et al., Reference David, Shankland, van de Streek, Pidcock, Motherwell and Cole2006). A 010 preferred orientation coefficient was also included, but no Mogul Distribution Bias was applied. The success rate was ~40% of the 100 runs.
Rietveld refinement was carried out using GSAS-II (Toby and Von Dreele, Reference Toby and Von Dreele2013). Only the 1.0–25.0° portion of the pattern was included in the refinement (d min = 1.057 Å), with an excluded region 1.60–2.0° to ignore a peak from the Kapton capillary. All non-H bond distances and angles were subjected to restraints, based on a Mercury/Mogul Geometry Check (Bruno et al., Reference Bruno, Cole, Kessler, Luo, Motherwell, Purkis, Smith, Taylor, Cooper, Harris and Orpen2004; Sykes et al., Reference Sykes, McCabe, Allen, Battle, Bruno and Wood2011) of the model. The Mogul average and standard deviation for each quantity were used as the restraint parameters. The restraints contributed 8.5% to the final χ 2. The y-coordinate of F1 was fixed to define the origin. The hydrogen atoms were included in calculated positions, which were recalculated during the refinement using Materials Studio (Dassault, 2020). The U iso were grouped by chemical similarity. The U iso of the hydrogen atoms were constrained to be 1.3× that of the heavy atoms to which there are attached. The background was modeled using a 6-term shifted Chebyshev polynomial, along with a peak at 1.71° and 5.45° to model the scattering from the Kapton capillary and any amorphous component of the sample. The peak profiles were described using the generalized microstrain model, and a second-order spherical harmonic preferred orientation model was included. A few additional peaks indicated the presence of a second crystalline phase, which was identified as NaCl. Its concentration refined to 0.44(1) wt%.
The final refinement of 144 variables using 23 337 observations and 106 restraints yielded the residuals R wp = 0.0696 and GOF = 1.45. The largest peak (1.43 Å from F3) and hole (2.04 Å from C12) in the difference Fourier map were 0.26 and −0.21(5) eÅ−3, respectively. The largest errors in the fit (Figure 2) are in the positions and shapes of some of the low-angle peaks, and may indicate subtle changes in the beam during the measurement.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig2.png?pub-status=live)
Figure 2. The Rietveld plot for the refinement of tezacaftor Form A. The blue crosses represent the observed data points, and the green line is the calculated pattern. The cyan curve is the normalized error plot. The vertical scale is the square root of the intensity.
A density functional geometry optimization (fixed experimental unit cell) was carried using CRYSTAL14 (Dovesi et al., Reference Dovesi, Orlando, Erba, Zicovich-Wilson, Civalleri, Casassa, Maschio, Ferrabone, De La Pierre, D-Arco, Noël, Causà and Kirtman2014). The basis sets for the H, C, N, and O atoms were those of Gatti et al. (Reference Gatti, Saunders and Roetti1994), and that for F was that of Peintinger et al. (Reference Peintinger, Vilela Oliveira and Bredow2013). The calculation was run on eight 2.1 GHz Xeon cores (each with 6 GB RAM) of a 304-core Dell Linux cluster at IIT, using 8 k-points and the B3LYP functional, and took ~73 h.
III. RESULTS AND DISCUSSION
Hughes notes that only one crystalline form (Form A) of tezacaftor has been reported. The synchrotron pattern from this current study (Figure 3) matches well with both the observed and calculated patterns reported by Keshavarz-Shokri et al. (Reference Keshavarz-Shokri, Zhang and Alcacio2016; Vertex), confirming that we have studied the same material. The lattice parameters of the two determinations (Table I) differ slightly. Those from this synchrotron powder study are more precise than those from the single-crystal study. The current lattice parameters are slightly smaller, probably reflecting differences in preparations.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig3.png?pub-status=live)
Figure 3. Comparison of the synchrotron pattern of tezacaftor Form A to the patterns reported by Keshavarz-Shokri et al. (Reference Keshavarz-Shokri, Zhang and Alcacio2016). The experimental pattern from Figure 5 of the patent is presented in red, and the pattern of Figure 4 of the patent (calculated from the single-crystal structure) is given in green. The published patterns were digitized using UN-SCAN-IT (Silk Scientific, 2013) and scaled to the synchrotron wavelength of 0.457899 Å using MDI JADE Pro (MDI, 2020).
TABLE I. Lattice parameters (space group C2) of tezacaftor.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_tab1.png?pub-status=live)
The refined atom coordinates of tezacaftor Form A and the coordinates from the density functional theory (DFT) optimization are reported in the CIFs deposited with ICDD. The root-mean-square (rms) Cartesian displacement of the non-hydrogen atoms in the Rietveld-refined and DFT-optimized structures is 0.201 Å (Figure 4), within the normal range for correct structures (van de Streek and Neumann, Reference van de Streek and Neumann2014). The maximum difference is 0.508 Å, at O7, and reflects a difference in the conformation of that branch of the molecule. This discussion concentrates on the DFT-optimized structure, as we believe it is more reliable. The asymmetric unit (with atom numbering) is illustrated in Figure 5, and the crystal structure is presented in Figure 6.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig4.png?pub-status=live)
Figure 4. Comparison of the Rietveld-refined (red) and VASP-optimized (blue) structures of tezacaftor Form A. The rms Cartesian displacement is 0.201 Å.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig5.png?pub-status=live)
Figure 5. The asymmetric unit of tezacaftor Form A, with the atom numbering. The atoms are represented by 50% probability spheroids.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig6.png?pub-status=live)
Figure 6. The crystal structure of tezacaftor Form A, viewed down the b-axis.
The crystal structure (Figure 6) is apparently dominated by van der Waals interactions. The O–H⋯O hydrogen bonds link the molecules in C1,1(10) chains (Etter, Reference Etter1990; Bernstein et al., Reference Bernstein, Davis, Shimoni and Chang1995; Shields et al., Reference Shields, Raithby, Allen and Motherwell2000) along the b-axis. Visually, there are π⋯π stacking interactions roughly parallel to the 1-11 plane, but the analysis of the Mulliken overlap populations does not pinpoint them.
Almost all of the bond distances, angles, and torsion angles fall within the normal ranges indicated by a Mercury/Mogul Geometry check (Macrae et al., Reference Macrae, Sovago, Cottrell, Galek, McCabe, Pidcock, Platings, Shields, Stevens, Towler and Wood2020). The F3–C36 bond distance of 1.385 Å (average = 1.325(16) Å, Z-score = 3.6) is flagged as unusual. The equivalent F2–C36 bond has a Z-score of only 1.9. The C25–N11–C22 angle of 108.2° is slightly unusual (average = 111.2(9)°, Z-score = 3.1), but the uncertainty on the average is exceptionally low, making the Z-score artificially high. The C27–C20–N10–C16 and C30–C20–N10–C16 torsion angles are flagged as unusual. These both occur in tails of distributions and represent rotation around the C20–N10 amide bond. They represent the orientation of two parts of the molecule, which seems to be unusual. The C16–C12–C15–C17 and C16–C12–C15–C18 torsions are also noted as unusual, but these are part of a broad distribution with relatively few hits.
Quantum chemical geometry optimization of the isolate tezacaftor molecule (DFT/B3LYP/6-31G*/water) using Spartan ‘18 (Wavefunction, 2020) indicated that the observed solid-state conformation is 7.0 kcal mol−1 higher in energy than the local minimum (Figure 7), and that the conformations differ significantly. The rms Cartesian displacement is 1.793 Å. In particular, the C30–C20–N10–C16 torsion angle is −177.0° in the local minimum structure, consistent with the expectations from the Mogul geometry analysis. The local minimum conformation also differs from the global minimum energy conformation from a molecular mechanics (force field) analysis (Figure 8). Apparently both inter- and intramolecular interactions contribute significantly to determining the observed solid-state conformation.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig7.png?pub-status=live)
Figure 7. Comparison of the local minimum-energy (green) conformation and the DFT-optimized conformation (orange) in tezacaftor Form A.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig8.png?pub-status=live)
Figure 8. Comparison of the local minimum-energy conformation (green) and the global minimum-energy conformation (purple) of tezacaftor.
Analysis of the contributions to the total crystal energy using the Forcite module of Materials Studio (Dassault, 2020) suggests that angle deformation terms are the dominant contributions to the intramolecular deformation energy, as expected for a molecule, which contains fused ring systems and a cyclopropane ring. The intermolecular energy is dominated by electrostatic attractions, which in this force-field-based analysis include hydrogen bonds. The hydrogen bonds are better analyzed using the results of the DFT calculation.
Although there are hydrogen bonds in the structure (Table II), they do not seem especially important to the crystal energy. Surprisingly, only the hydroxyl group O7–H62 acts as a hydrogen bond donor, while the other hydroxyl groups O8 and O9 act as acceptors in multiple C–H⋯O hydrogen bonds, many of which are intramolecular. The energy of the O–H⋯O hydrogen bond was calculated using the correlation of Rammohan and Kaduk (Reference Rammohan and Kaduk2018). The carbonyl group O4 acts as an acceptor in two C–H⋯O hydrogen bonds, one of which is intramolecular. There is an intramolecular N–H⋯C hydrogen bond between the amide nitrogen N10 and C15 in the cycylopropane ring. Examining both the Mulliken overlap populations and the short intermolecular contact suggest the possibility of a variety of weaker interactions.
TABLE II. Hydrogen bonds (CRYSTAL14) in tezacaftor.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_tab2.png?pub-status=live)
a Intramolecular.
The volume enclosed by the Hirshfeld surface (Figure 9; Hirshfeld, Reference Hirshfeld1977; Turner et al., Reference Turner, McKinnon, Wolff, Grimwood, Spackman, Jayatilaka and Spackman2017) is 605.65 Å3, 98.55% of 1/4 the unit cell volume. The molecules are, thus, not tightly packed. All of the significant close contacts (red in Figure 9) involve the hydrogen bonds. The volume/non-hydrogen atom is 16.6 Å3.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210318124955036-0543:S0885715621000051:S0885715621000051_fig9.png?pub-status=live)
Figure 9. The Hirshfeld surface of tezacaftor Form A. Intermolecular contacts longer than the sums of the van der Waals radii are colored blue, and contacts shorter than the sums of the radii are colored red. Contacts equal to the sums of radii are white.
The Bravais–Friedel–Donnay–Harker (Bravais, Reference Bravais1866; Friedel, Reference Friedel1907; Donnay and Harker, Reference Donnay and Harker1937) morphology suggests that we might expect platy morphology for tezacaftor Form A, with {001} as the principal faces. A second-order spherical harmonic model for preferred orientation was incorporated into the refinement. The texture index was only 1.001(0), indicating that preferred orientation was slight in this rotated capillary specimen. The powder pattern of tezacaftor Form A from this synchrotron data set has been submitted to ICDD for inclusion in the Powder Diffraction File.
IV. DEPOSITED DATA
The Crystallographic Information Framework (CIF) files containing the results of the Rietveld refinement (including the raw data) and the DFT geometry optimization were deposited with the ICDD. The data can be requested at info@icdd.com.
ACKNOWLEDGEMENTS
Use of the Advanced Photon Source at Argonne National Laboratory was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, under Contract No. DE-AC02-06CH11357. This work was partially supported by the International Centre for Diffraction Data. We thank Lynn Ribaud and Saul Lapidus for their assistance in the data collection, and Andrey Rogachev for the use of computing resources at IIT.
CONFLICTS OF INTEREST
The authors have no conflicts of interest to declare.