Using GPU Technologies To Drastically Accelerate FDTD Simulations
Using GPU Technologies To Drastically Accelerate FDTD Simulations
FDTD Simulations
Introduction
Wireless technologies play a significant part of the world we live in and have
quickly developed from obscure and mysterious to openly accepted and often
demanded. Cell phones, once a status commodity, are now common world-wide.
GPS devices, communicating with satellites over 10,000 miles away traveling at
thousands of miles per hour, are found in numerable devices giving users real-
time, precise location. WIFI stations communicate with a host of devices,
providing untold conveniences. Doctors gain precise detail about the inner
workings of patients for diagnosis and treatment. Uncounted devices operating
simultaneously and in close proximity necessitates precision and intelligence to
ensure correct and safe functionality. However, time to market for high tech
devices directly affects competitiveness and profitability. Is it possible to be
accurate and still get to market quickly? While the answer may be complex,
many may benefit from using a GPU accelerated Finite Difference Time Domain
method as described in this paper.
REMCOM INC. | 315 South Allen Street, Suite 222 | State College, PA 16801 USA
Tel: +1.814.861.1299 | Fax: +1.814.861.1308 | www.remcom.com
Overview of Finite Difference Time Domain Method
The Finite Difference Time Domain (FDTD) method has been utilized over the
last several decades and become increasingly more prevalent in scientific
research and technical industry. The origin of the FDTD method is generally
attributed to Kane Yee who, in a paper published in 1966 [1], described a method
for computing Maxwell’s Equations discretely in a time-stepping manor. This
technique was later expanded and named by Allen Taflove[2]-[4]. The FDTD
method is virtually unique to EM simulation methods because it directly
implements Maxwell's Curl Equations, which models electromagnetic fields at the
most elementary level. Since the method is fundamentally sound, FDTD is often
used to verify results originating from faster, assumption based techniques. The
method has been applied to problems ranging from kilohertz to visible light. While
accurate, the FDTD method has inherent obstacles that have kept it from being
universally used. For example, the entire computation space must be evaluated
at each time step and the grid dimensions must be sufficiently small to accurately
model the signal propagation. For effectively large project spaces, FDTD-based
codes may become memory intensive and relatively slow.
Figure 1: A CAD representation of an F/A-22 Raptor rendered in XFdtd version 7.0.3. CAD
images may be imported or created in geometry space and easily converted into FDTD grid.
Remcom’s XFdtd® is an EM solver based on the FDTD method and has been
used significantly to model structures that require a high level of fidelity. For the
FDTD results of this paper, Remcom’s XFdtd version 7.0.3 was utilized.
Now, we consider the actual timing comparisons for some examples. The
combination of these examples begins to showcase the benefits of coupling the
GPU technology with the accuracy of FDTD simulations.
Rotman Lens
The Rotman Lens can be
costly to simulate since the
device is electrically large
and contains a relatively
complex geometry along
one plane. The lens shown
in Figure 4 is resolved in a
geometry that requires
about 1 GB in RAM. A
broadband source was used
and only S-Parameters were
requested during simulation
time.
GPU simulations were run Figure 4: Rotman lens as generated by Remcom’s RLD
software and imported into XFdtd.
using one and two NVIDIA
Tesla C1060 cards. This
produced a performance
increase of about 49x and 75x, respectively. Due to its nature, the Rotman Lens
may run for multiple days to resolve a single device. With a 75x speedup, a
simulation requiring three days to complete would complete in just less than one
hour.
When running this on the Tesla cards, performance speedups of 43x and 54x
were achieved for one and two cards, respectively. This was a significantly large
project and a noticeable amount of data was requested, but we were still able to
realize more than a 50x speedup. This could be the difference between two
weeks or six and one quarter hours.
Cell Phone
The simulation of a cell phone represents an
interesting challenge for EM simulation tools. The
modeling of internal conductors and dielectric
components for most handheld devices requires a
high degree of fidelity. A typical simulation may
require the calculation of SAR information, which
carries a significant amount of data transfers.
90
80
70
60
50
40
30
20
10
0
Quadro FX 2x Tesla
Tesla C870 Tesla C1060
5600 C1060
8x8 Patch Array 17.81 13.74 46.79 45.83
Rotman Lens 48.94 74.45
Vivaldi Quad Array 43.11 54.19
Cell Phone 29.19 13.17 54.26 87.99
[6] E. Thiele and A. Taflove “FD-TD Analysis of Vivaldi Flared Horn Antennas
and Arrays”. IEEE Trans. Antennas Prop., AP-42, 1994, pp. 633-641.