Introduction
The method of moments (MoM) [Reference Harrington1] solutions of surface integral equations have been demonstrated to be very effective in analyzing the electromagnetic (EM) radiation and scattering problems. The electric field integral equation (EFIE), the magnetic field integral equation (MFIE), and the combined field integral equation (CFIE) have been popular choices for an arbitrary three-dimensional (3-D) perfect electric conducting (PEC) object. However, conventional MoM solutions suffer from O(N 2) storage for matrix fill-in and O(N 2) operations per iteration for matrix vector multiplication (MVM) with an iterative matrix solver. Both the EFIE and the MFIE suffer from the interior resonance problem. To overcome these difficulties, this paper proposes an integral equation-fast Fourier transform (IE-FFT) algorithm with CFIE as the extension of the IE-FFT algorithm [Reference Seo, Wang and Lee2] without the help of a preconditioner. The conventional IE-FFT with EFIE should have a preconditioner for electrically large structures.
One of the fastest and most efficient algorithms for the MoM solutions is the multilevel fast multipole method [Reference Song and Chew3], which reduces the computational complexity from O(N 2) to O(Nlog N). Unfortunately, the strong dependence of multipole-based methods on integral kernels makes the methods inadequate for general kernel-independent techniques. To avoid the kernel-dependence, algebraically mathematical algorithms such as integral equation-QR factorization [Reference Seo and Lee4] and adaptive cross approximation (ACA) [Reference Zhao, Vouvakis and Lee5] have been proposed. Nevertheless, the numerical complexity of these algorithms may not be clear whether it is O(N 1.5) or not.
There is another physical class of fast IE methods based on grid-based approaches such as the adaptive integral method (AIM) [Reference Bleszynski, Bleszynski and Jaroszewicz6], the precorrected-FFT (p-FFT) [Reference Phillips and White7, Reference Nie, Li and Yuan8], and the IE-FFT. The AIM and the p-FFT algorithms need unknown “equivalent” source approximation on a Cartesian grid. Unlike these algorithms, the IE-FFT applies known Green's function interpolation on a Cartesian grid. The IE-FFT algorithm uses one global FFT for field interaction terms [Reference Seo, Wang and Lee2, Reference Seo and Lee9]. However, the near-field interaction terms do not take care of the singularity of the Green's function and are adequately corrected by the traditional MoM. Recently, a couple of the IE-FFT papers with CFIE [Reference An and Lü10, Reference Xie, Zhou, Li and Hong11] have been published. One IE-FFT with CFIE [Reference An and Lü10] is to use the gradient of Rao–Wilton–Glisson (RWG) basis function. The weakness of the paper should have the second order of the RWG basis function to obtain the same order of accuracy with the conventional MoM. To overcome this problem, another IE-FFT with CFIE [Reference Xie, Zhou, Li and Hong11] is proposed. However, the paper uses two coefficient matrices of Green's function, which is the most dominant numerical complexity in the IE-FFT. The proposed IE-FFT algorithm with CFIE follows the basic guidelines from the original IE-FFT paper [Reference Seo, Wang and Lee2]. The uniqueness of the IE-FFT algorithm lies on the simplicity and the rigorous error control. The Lagrange interpolation of Green's function simplifies the formulation and enables an efficient algorithmic implementation. The proposed IE-FFT algorithm with CFIE not only has better convergence rate but also removes the interior resonance. The IE-FFT with CFIE does not always need to have an appropriate preconditioner. Also, it works algebraically simple Lagrange polynomials on a 3-D uniform Cartesian grid. The proposed algorithm definitely guarantees O(N 1.5) complexity for memory requirement and O(N 1.5log N) complexity for MVM.
CFIE formulation
The CFIE formulation is well known for an arbitrarily shaped 3-D PEC objects. The resulting matrix equation can be written as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn1.png?pub-status=live)
where Z CFIE is the impedance matrix, J is the unknown surface current vector, and V CFIE is right-hand side (RHS) vector. The impedance matrix is rewritten as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn2.png?pub-status=live)
where ρ is the weighting constant of the CFIE and η 0 is the intrinsic impedance of free space. The entries of the impedance matrices Z EFIE and Z MFIE for the EFIE and the MFIE are, respectively, given by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn3.png?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn4.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn5.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn6.png?pub-status=live)
and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn7.png?pub-status=live)
N is the number of unknowns. Note that supp() indicates the support of non-boundary edge. Here, ${\bf A}_{i\comma j}$ and ${\bf D}_{i\comma j}$
are the impedance entries of vector and scalar potentials from the discrete Galerkin statement of the EFIE. ${\bf C}_{i\comma j}$
and ${\bf P}_{i\comma j}$
are singular and coupling entries of the impedance matrix from the discrete Galerkin statement of the MFIE. In the paper, the free-space Green's function $g\lpar {\vec{r}\semicolon \;{\vec{{r}}^{\prime}}} \rpar = e^{{-}jk_0\vert {\vec{r}-{\vec{{r}}^{\prime}}} \vert }/\vert {\vec{r}-{\vec{{r}}^{\prime}}} \vert$
is considered. The free-space wave number is denoted by k 0. $\vec{\alpha }_i\lpar {\vec{r}} \rpar$
stands for surface div-conforming vector RWG basis function [Reference Rao, Wilton and Glisson12]. The RHS is given by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn8.png?pub-status=live)
The entries of the RHS vector are written as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn9.png?pub-status=live)
and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn10.png?pub-status=live)
where $\vec{E}^{inc}\lpar {\vec{r}} \rpar$ and $\vec{H}^{inc}\lpar {\vec{r}} \rpar$
are the incident electric and magnetic field intensities, respectively.
The IE-FFT algorithm with CFIE
The IE-FFT algorithm with CFIE constructs a 3-D hexahedron bounding box that encloses the entire geometry. Figure 1 shows two discretizations: a triangular mesh for unknown RWG basis functions and a uniform Cartesian grid for known free-space Green's function.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig1.png?pub-status=live)
Fig. 1. Two discretizations for unknown surface current density and known free-space Green's function.
Note that C NF is a constant used to define the near-field correction region, and λ 0 is the wavelength in free space. Note r L is the sampling resolution. For example, L is the size of the third-order Cartesian element, which is the 3-D tensor product form of 1-D piecewise Lagrange polynomials on a Cartesian grid. The free-space Green's function can be written as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn11.png?pub-status=live)
where p is the order of Lagrange polynomials and N g is the number of 3-D Cartesian grid points. $\beta _n^p \lpar {\vec{r}} \rpar$ is the p th-order Lagrange interpolation basis function and $g_{n\comma {n}^{\prime}}$
is the “known” coefficient of the free-space Green's function. The vector and matrix form of the free-space Green's function can be expressed as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn12.png?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn13.png?pub-status=live)
and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn14.png?pub-status=live)
Equation (3) is combined with equation (11) as follows:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn15.png?pub-status=live)
where
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn16.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn17.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn18.png?pub-status=live)
and
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn19.png?pub-status=live)
The IE-FFT algorithm with CFIE has three vector and one scalar ${\bf \Pi }$ matrices. One vector and one scalar matrix come from the EFIE and the other two vectors from the MFIE. When an iterative solver is used, the near- and far-field computations of the IE-FFT algorithm with CFIE are described below. d indicates the distance between basis and testing functions.
Near-field computation (d ≤ CNFλ 0)
From equation (1), the MVM is computed by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn20.png?pub-status=live)
The matrix Z CFIE is obtained by the traditional MoM approach.
Far-field computation (d > CNFλ 0)
From equation (15), the MVM is performed by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn21.png?pub-status=live)
where Z corr is a correction matrix. In Fig. 1, the coupling between near-interaction terms separated at least by C NFλ 0 should be sufficiently corrected to assure the accuracy. The coefficient of Green's function has singularity when a source and an observation point are close to each other. In this paper, C NF is chosen to be 0.3. The correction entries are given by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_eqn22.png?pub-status=live)
where 0 ≤ i < N, j ∈ C neig, and C neig is the set of the near-field interaction elements within C NFλ 0. The near-field terms between RWG basis functions i and j should be substituted by the entries of the traditional MoM techniques. The complexity of Z corr and ${\bf \Pi }$ matrices is O(N). However, the dense G matrix for the coefficients of Green's function leads to O(N 3) storage and MVM per iteration for a Cartesian grid. By virtue of the FFT [Reference Vico, Greengard and Ferrando13], the memory requirement reduces to O(N 1.5) and the MVM could be speeded up to O(N 1.5log N).
Numerical results
To demonstrate the accuracy and efficiency of the proposed algorithm, a PEC sphere of 1 m is considered. The discretization of the triangular mesh is 7 elements per wavelength. All numerical simulations are carried out on a 2GB RAM Intel(R) Pentium(R) M processor 1.6 GHz. All computations have been done in single precision arithmetic. As a matrix solver, a generalized minimal residual (GMRES) method [Reference Golub and Van Loan14] is used without the help of any preconditioner. The residual tolerance is set to 1.0e − 3. In the simulation, the constant ρ of the CFIE is chosen to be 0.5. Third-order Lagrange polynomials are used to interpolate free-space Green's function. Figure 2 shows the results of bistatic RCS at a frequency of 1200 MHz.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig2.png?pub-status=live)
Fig. 2. Bistatic RCS results for 1 m PEC sphere at a frequency of 1200 MHz. (a) E-plane, (b) H-plane.
The IE-FFT with CFIE solution is compared with the analytic Mie series solution. Also, it is compared with the EFIE and the MFIE solutions accelerated by IE-FFT. The results of the IE-FFT with CFIE have very good agreements with those of Mie series at both E-plane and H-plane. The results of the EFIE also have good agreements but bad convergence rate. Despite good convergence, the results of the MFIE have some discrepancies when compared with those of the Mie series. As we have already known, the inaccuracy of the MFIE comes from the hyper-singular integration. Therefore, the EFIE has more accurate results than the CFIE, which has the weighting portion of the MFIE. The results of bistatic RCS at a frequency of 2400 MHz are shown in Fig. 3.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig3.png?pub-status=live)
Fig. 3. Bistatic RCS results for 1 m PEC sphere at a frequency of 2400 MHz. (a) E-plane, (b) H-plane.
The IE-FFT of the EFIE does not converge within the number of iterations exceeded to a maximum of 2000. However, the IE-FFT with CFIE has a very good convergence rate and also accurate results at both E- and H-planes. Table 1 summarizes the performance of the IE-FFT algorithm with CFIE for third-order Lagrange polynomials in terms of memory. All units are megabytes (MB). The complexity of Z corr and Π matrices is clearly shown as O(N). However, the complexity of G matrix is O(N 1.5).
Table 1. Memory requirement of IE-FFT algorithm with CFIE for scattering from a PEC sphere with a radius of 1 m
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab1.png?pub-status=live)
Table 2 summarizes the CPU time and the number of iterations of the IE-FFT algorithm with CFIE. All units are seconds (sec). The number of iterations for the EFIE, MFIE, and the CFIE is tabularized in order. In a frequency of 2400 MHz, the EFIE does not converge within the number of iterations exceeded to a maximum of 2000. The convergence rate of the IE-FFT with CFIE bounds on a small number of iterations.
Table 2. CPU time and the number of iterations of IE-FFT algorithm with CFIE for scattering from a PEC sphere with a radius of 1 m
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab2.png?pub-status=live)
A realistic example considered here is a generic battleship. When the incident wave at θ = 90° and ϕ = 0° is considered, the bistatic RCS of θθ-polarization is computed by the IE-FFT algorithm with CFIE without the help of any preconditioner. The discretization of the triangular mesh is 5 elements per wavelength. Third-order Lagrange polynomials are used to interpolate free-space Green's function. The results of bistatic RCS at a frequency of 30 MHz are shown in Fig. 4. The results of the IE-FFT with CFIE are compared with those of the EFIE by the traditional MoM. Both results have very good agreements.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig4.png?pub-status=live)
Fig. 4. Comparison of the bistatic RCS results for the battleship at 30 MHz (θθ-polarization) using the IE-FFT with CFIE and the traditional MoM.
Figure 5 shows the results of bistatic RCS at a frequency of 60 MHz. Due to the prohibitive complexity of the conventional MoM, we compare the results of the CFIE for the IE-FFT with those of the EFIE for the ACA algorithm [Reference Zhao, Vouvakis and Lee5]. Both results have very good agreements.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig5.png?pub-status=live)
Fig. 5. Comparison of the bistatic RCS results for the battleship at 60 MHz (θθ-polarization) using the IE-FFT with CFIE and the ACA algorithms.
The results of bistatic RCS at a frequency of 120 MHz are shown in Fig. 6. The results between the IE-FFT of the EFIE and the IE-FFT with CFIE are compared. Both results have reasonable agreements.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig6.png?pub-status=live)
Fig. 6. Comparison of the bistatic RCS results for the battleship at 120 MHz (θθ-polarization) using the IE-FFT with CFIE and EFIE.
All the numerical complexities of the performed simulations are tabularized. First, the IE-FFT algorithm with CFIE is compared with the traditional MoM with EFIE in Table 3.
Table 3. Numerical complexities between the IE-FFT with CFIE and the traditional MoM of the EFIE for the battleship at 30 MHz (The number of unknowns is 12 972)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab3.png?pub-status=live)
The number of unknown is 12 972 and the operating frequency is 30 MHz. The CPU time of the MVM per iteration and the number of iterations are shown in the first and second columns. The memory of Z corr and ${\bf \Pi }$ matrices and G matrix is compared with that of the traditional MoM matrix in the third column. The traditional MoM requires an extra memory of geo-neighboring preconditioner [Reference Lee, Lee and Burkholder15] in the fourth column. It is undoubtedly proven that the IE-FFT with CFIE is more efficient than the traditional MoM. Second, the IE-FFT with CFIE is compared with the ACA of the EFIE in Table 4.
Table 4. Numerical complexities between the CFIE-FFT and the ACA of the EFIE for the battleship at 60 MHz (the number of unknowns is 48 363)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab4.png?pub-status=live)
The number of unknowns is 48 363 and the operating frequency is 60 MHz. Because of the memory limit of the current processor, the simulation cannot be operated by the conventional MoM. Even though the matrix of the IE-FFT with CFIE is non-symmetric, the memory of the IE-FFT with CFIE is approximately three times smaller than that of the ACA. The MVM per iteration of the IE-FFT with CFIE is twice worse than that of ACA. However, the ACA does not converge without the geo-neighboring preconditioner. Finally, the IE-FFT with CFIE is compared with the IE-FFT of the EFIE in Table 5.
Table 5. Numerical complexities between the IE-FFT with CFIE and the IE-FFT with EFIE for the battleship at 120 MHz (the number of unknowns is 189 606)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab5.png?pub-status=live)
Due to the storage limit of the processor in the simulation, the ACA cannot be performed. The number of unknowns is 189 606 and the operating frequency is 120 MHz. The total CPU time of both methods is approximately similar. The total storage of the IE-FFT looks better than that of the IE-FFT with CFIE. Nevertheless, the memory of the preconditioner is the only final result except for extra memory of reordering, factorization, etc. The realistic memory of the preconditioner is much bigger than the final written result. In the simulation, the IE-FFT of the EFIE does not converge without the geo-neighboring preconditioner. From the tables, the memory of Z corr and ${\bf \Pi }$ matrices follows O(N) complexity. The coefficient matrix of the free-space Green's function G is O(N 1.5) complexity. Therefore, the coefficient matrix will be more dominant as the electrical length is larger. Table 6 shows the number of iterations of the GMRES method according to the weight constant ρ of the IE-FFT with CFIE at a frequency of 60 MHz.
Table 6. The number of iterations with GMRES method according to the weighting constant of the IE-FFT with CFIE for the battleship at 60 MHz (the number of unknowns is 48 363)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_tab6.png?pub-status=live)
Both the EFIE and MFIE formulations accelerated by the IE-FFT do not converge within the number of iterations exceeded to a maximum of 5000. Both formulations may have internal resonance for this specific case. However, the IE-FFT with CFIE is free of internal resonance and has reliable convergence rate and solution.
In Fig. 7, the study of the error control of the IE-FFT with CFIE is performed indirectly in terms of the RMS errors of the bistatic RCS calculations [Reference Seo, Wang and Lee2]. The numerical simulations from the sampling segments per wavelength are carried out for 1 m PEC sphere at a frequency of 600 MHz. The average RMS errors are plotted. The dashed line indicates the RMS error of the conventional MoM with EFIE. The solid line with circular markers and dash-dotted line with square markers represent the RMS errors of the IE-FFT for the second- and third-order Lagrange polynomials, respectively. The RMS errors of the IE-FFT with CFIE converge to those of the conventional MoM as the sampling segments increase.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210203033426171-0638:S1759078720000653:S1759078720000653_fig7.png?pub-status=live)
Fig. 7. The RMS errors of the bistatic RCS calculations versus the sampling segments per wavelength (p = 2, 3).
Conclusion
The IE-FFT algorithm with CFIE has been implemented and demonstrated for arbitrary-shaped 3-D PEC objects. In the PEC sphere example, the IE-FFT algorithm with CFIE is shown to achieve O(N 1.5) and O(N 1.5log N) complexities for memory and MVM per iteration, respectively. The realistic example of the generic battleship shows the reliability of the IE-FFT algorithm with CFIE. The IE-FFT with CFIE is free of internal resonance and has reliable convergence rate and solution. The IE-FFT algorithm with CFIE can handle up to approximately 0.3 million unknowns with only 2GB memory.
Seung Mo Seo received the B.S. degree from Hong-Ik University in 1998 and the M. S. and Ph.D. degrees from The Ohio State University in 2001 and 2006, respectively, all in Electrical Engineering. During 2007–2010, he stayed in DMC R&D Center, Samsung Electronics. He is now with Agency for Defense Development.