Optimal Design of Efficient Acoustic Antenna Arrays
Optimal Design of Efficient Acoustic Antenna Arrays
North-Holland
O P T I M A L D E S I G N OF E F F I C I E N T A C O U S T I C A N T E N N A
ARRAYS
B. B U E H L E R
Naval Underwater Systems Center, Code 3292, New London, CT 06320, USA
A.D. WAREN
Department of Computer and lnJormation Science, Cleveland State University, Cleveland, OH 44115,
USA
Minimax optimal design of sonar transducer arrays can be formulated as a nonlinear program
with many convex quadratic constraints and a nonconvex quadratic efficiency constraint. The
variables of this problem are a scaling and phase shill applied to the output of each sensor.
This problem is solved by applying Lagrangian relaxation to the convex quadratic constraints.
Extensive computational experience shows that this approach can efficiently find near-optimal
solutions of problems with up to 391 variables and 579 constraints.
Key words: Acoustic arrays, optimal design, Lagrangian relaxation, nonlinear programming,
computer implementation.
1. Introduction
2. Problem formulation
Consider the sensor shown in Fig. 1, whose position is specified by the vector
r = (x, y, z). Acoustic plane waves of wavelength A, incident in a direction specified
by the unit vector u = (cq/3, 3') impinge upon this sensor. The quantities a,/3, y are
direction cosines, specified by
Prop or~ Of
agatibn, u
where 0 and q5 are the spherical coordinate angles. It is also convenient to define
the "wave number"
k = (2~/A)u
Let the incoming signal have unit amplitude and zero phase at the origin. The
sensors output, os, can be represented by the complex number
os = B ( k ) exp(ikTr)
where B is a (possibly complex) response function, characteristic of the sensor,
giving its response to waves of different wavelengths incident from different direc-
tions. The phase term k T r arises because the plane wave reaches the sensor r seconds
before it reaches the origin, where
= d/v = d/fA
The variable d is the additional distance traveled, shown in Fig. 1, and f is the
wave's frequency. From the figure it is evident that
d = rVu
so the added phase (in radians) is
2~f~" = (2~/A)rTu = rVk.
Associated with each sensor are 2 adjustable parameters, a (complex) shading
coefficient, w, and a steering phase rTks. The shading w represents the effect of
applying an amplitude scaling and a phase shift to the sensor's output, prior to
summing over all outputs to obtain the array output. This is illustrated in Fig. 2.
The steering phase arises from a time delay, used to insure that signals from all
sensors are in phase when the incoming wave arrives from a specified steering
direction. After these operations are applied to its output, os, the sensor response is
rs = w . B ( k ) 9 exp(irV(k - ks)). (2.1)
134 L.S. Lasdon et al. / Optimal design
J DELAY ~ ] J
"[[ ~E,AY
"FI ~ al I] i
E
DELAY
L e t j be the sensor index. The farfield array response, also called the beam pattern,
is obtained by summing (2.1) over all sensors:
tl
a(k, ks, w) = Y vvsBj(k ) exp(irT(k-- ks)). (2.2)
.j= 1
In the above, we redefine w to be the vector o f all shadings ( w ~ , . . . , w,,). These are
the design variables in the remainder of this paper. The sensor positions ri are
assumed fixed.
Minimax formulation
If ks and w are fixed, and a(k, ks, w) is evaluated in some plane o f interest, then
10 log,01a[-', the magnitude o f the pattern in decibels, may be plotted as a function
o f an angle within the plane. Such a plot is shown in Fig. 3, where the steering
angle is 90 ~ The shadings w have been adjusted so that lal = 1 at the steering angle,
90 ~. The region about the steering angle, out to the first null, is called the main lobe,
while the remainder o f the pattern on either side o f the main lobe is called the
sidelobe region. A frequent objective o f array design is to achieve sidelobe levels
which are as low as possible, given a m a x i m u m acceptable width for the main lobe.
Of course, in the general case, we deal with three-dimensional array responses, so
the main and sidelobe regions must be specified in 3-space.
This statement lends itself naturally to a minimax formulation. First we choose
a finite set o f sidelobe points at which the pattern is to be evaluated. The mth such
point is specified by direction cosines u,,, = (a,,,, fl .... 3',,), m = 1. . . . , M. It is con-
venient to let the steering direction be the zero'th point, i.e.
k,,, = (2~r/A)um
and define
o
o
i I I i I i i
O. O0 20,0 40.0 6r).O 80.0 100.0 120,0 140.0 160.0 180.0
ao(w) = 1. (2.5)
subject to
and
ao(w) = 1 (2.8)
Unfortunately, solving (2.6)-(2.8) usually leads to beam patterns which are both
extremely sensitive to r a n d o m errors and have low signal to noise gain for spatially
136 L.S. Lasdon et aL / Optimaldesign
uncorrelated noise. Random errors can occur either in the amplitude or phase
response of the sensors, or may be due to deviation of the wave front from a plane
wave [1]. As the sensitivity and signal to noise properties of an array improve, its
ability to achieve uniformly low sidelobes decreases. In [1], Cox and Thurston show
that, by imposing the constraint
Ne = wj lwil2~> N~ (2.9)
j=
and choosing the level -Ne appropriately, these problems can be resolved. The
constraint is derived assuming omnidirectional elements, but is an acceptable
approximation for the problem considered here. The quantity ATe is called the
effective number of elements; and (2.9) is called the efficiency constraint. It is shown
in [1] that
Ne<~n
and typical values for Ne may range from 0.4n to 0.8n.
am(w)= ~ (rwj+ilwj)(rsi,,+iIsi,,)
j--I
Hence, the pattern magnitude squared at each farfield point (a,,,,/3m, y,,,) is a convex
quadratic function of the design variables x. This is an important structural feature
of this problem, and is exploited by the algorithms used.
The number of effective elements, Are in (2.9), can be expressed in terms of x by
Ne + xo+ ) (2.20)
bTox = 0, (2.24)
and the dual feasible region is the set of nonnegative A for which this infimum is
finite. Since z is unconstrained, the infimum is finite if and only if the coefficient
of z vanishes. Hence, the Lagrangian dual problem (LD) is
Problem LD
maximize h(A) (3.1)
subject to
M
where
Since the objective of the above minimization problem is the weighted LI norm of
the vector of squared pattern magnitudes, we call the problem LI(A) and rewrite
it here:
Problem LI(A)
M
subject to
a ~ x = 1, (3.5)
b-~x = O, (3.6)
- + x , , + k ) ~> O. (3.7)
\k=l k 1 k=l
2. If the dual subproblem LI(A) has a unique solution x(A), then h is differentiable
at A and its gradient is
Vh(A)=g(x(A))
a ,~ (x ~ - ~) = 0
where
2= max gm(x ~
I~m<~M
is the peak sidelobe level and x ~ is an optimal solution to SD. This implies that
3,~ for most sidelobe points, since most are below the peak level. This fact is
used in the dual ascent algorithm, where we attempt to set several A,,,'s to zero
during each linesearch.
Our Lagrangian relaxation algorithm for SD has the following major steps:
0. set A = ( l / M , . . . , 1 / M ) , i t e r = 0 and tol and itmax to user provided values
(default: tol = 2, itmax = 4).
1. solve LI(A), obtaining an optimal solution x(A). During this iterative process,
each time a new feasible point x is obtained compute the peak sidelobe level in
decibels as
Let ubd be the smallest value of peak obtained thus far. ubd is an upper bound on
the solution of SD.
2. Compute
The expression, along with expressions for the constraint gradients, are coded for
GRG2 in a subroutine PARSH. The objective and constraints are evaluated in a
T
separate subroutine called GCOMP. Since the terms (a,,x) and ( b T,,x ) are common
to the objective and its gradient, they are computed only once, in G C O M P , and
are transferred to PARSH in a C O M M O N block. This makes the objective gradient
much faster to compute than the objective itself.
Since the efficiency constraint (2.25) is nonlinear and is almost always active,
some Newton iterations are needed by G R G 2 to restore feasibility each time the
nonbasic variables are changed. Since there are at most three basic variables, and
two of the three active constraints are linear, the average number of Newton iterations
is small--less than one in all cases observed thus far. Further, the Newton algorithm
does not require the value of the objective of L1 ()~). Hence, in G C O M P , the objective
computation is skipped in phase 1, or if the nonlinear constraint is not satisfied to
within the required tolerance. The elimination of such superfluous objective evalu-
ations has yielded time savings of from 17 to 36 percent on test problems ranging
from 15 to 31 sensors, using complex shadings.
L.S. Lasdon et al. / Optimal design 141
In step 5 (the dual ascent step), the conditions &/> 0 have been replaced by
A~/> h min
where Amin = 10 -7 currently. Early tests indicated that, if weights were set to zero,
pattern values corresponding to such weights sometimes took on peak or near peak
values in subsequent subproblems. At the start of step 5, we evaluate the sets S and
N of superbasic and nonbasic weights respectively (see [7]) as
S = { / [ h i > hmin},
N={/[Ai=hmin}.
The objective gradient, V h, is projected into the intersection of the active constraints,
i.e. into the set
One way to reduce the size of these optimization problems is to decompose the
sonar array into subarrays. Let subarray elements be indexed consecutively, with ip
and ./~, the indices of the initial and final elements in subarray p. A decomposition
occurs if the shading coefficient for sensor j, wj, is factored as
(5.2) becomes
P
a(k, ks, w") = ~, w~ rp(k, ks). (5.4)
p=l
Since the complex number rp can be computed in advance for all specified values
of k and ks, (5.4) has the same form as the original array response expression (2.2),
with r~, playing the role of the sensor response. Hence, the problem can be expressed
in terms of real quantities as in (2.21)-(2.25), where x now contains the real and
L.S. Lasdon et aL / Optimal design 143
imaginary parts of the subarray weights wp, and RS and IS in (2.14)-(2.15) are
replaced by the real and imaginary parts of re .
A subarray capability based on these ideas has been included in our computer
implementation. Some modifications to the expression for the effective number of
elements (2.9) are required. Substituting (5.1) into (2.9) yields
where
.r
% = Z wjp, (5.6)
.i- i~,
S;
~ip: Z Iwj.l -~. (5.7)
.]- io
Defining
% = r% +iI%,
a = a 9 a
Wp rwe+llWe,
Num= ~ (r%xp-l%xp.,.p) + ~. ( I % x p + r % x e + p) ,
p=l .3 I_e=l
P
D e n o m = )~ [ 3 e ( x 2 + x ~ + Q .
p=I
6. Computer implementation
D a t a structures
Most o f the time required to solve the weighted L1 subproblem LI(A) (over 90%
in large problems) is used to evaluate its objective, rewritten here as
M
This requires evaluation of 2 M scalar products a ,v, , x and bV,,,x, defined by (2.12)-
(2.19), rewritten here as
n
a ~ x = ~ rs;,,,x j - E Isj,,x,,+, (6.2)
j--I j=l
Storage is reserved for each scalar product a ~ x , bV,,,x, m = 1 . . . . , M , since they are
used in the computation of V f Denoting aV,,,x by a t x ( m ) and bV,,x by b t x ( m ) , these
quantities are computed via the following logic:
for rn = 1, M do
axl =0, bxl : 0
f o r j = 1, n do
k:=(j-1)n+m
ax l = ax l + RS( k )x(j)
bxl = bxl + RS(k)x(n +j)
end
atx( m ) = ax l
btx(m) = bxl
end
for m = 1, M do
a x 2 = O, b x 2 = 0
forj = 1, n do
k=(j-1)n+m
ax2 = ax2 + IS( k )x( n + j)
bx2 = bx2 + IS(k)x(j)
end
atx(m) = atx(m) - ax2
btx( m ) = btx( m ) + bx2
end
This scheme computes two of the partial sums in a single pass through the RS
array, followed by a pass through the IS array to compute the other two. This has
three advantages:
L.S. Lasdon et aL / Optimal design 145
Storage allocation
Most of the arrays used in the optimization program have dimensions which
depend on problem size (number of sensors, number of farfield points, or number
of subarrays). A pseudo-dynamic storage allocation scheme is employed, which
partitions all working array space out of blank C O M M O N . This reduces the need
to redimension and recompile when problem size changes, and allows storage used
to be more easily matched to what is required. It also assures that all arrays used
in solving the subproblems Ll()t ) are contiguous, which helps reduce paging activity.
Algorithm modifications
The objective gradient V f in (4.1) is a sum of M terms. In problems with M = 284
and n =42, the G R G 2 optimizer encountered difficulties which stem from inac-
curacies in V f Several subproblems terminated when no progress was made in a
search along the negative reduced gradient direction. At that point, the norm of
this gradient was not small, and only a few iterations had been done. We hypothesized
that this was due to roundolt error resulting from cancellation of terms in V f of
approximately equal magnitudes and opposite signs. These problems were elimi-
nated by accumulating separately positive and negative terms (for each component
of Vf) in the sum (4.1), adding them together only after all M terms had been
processed.
System design
7. Computational results
Initial implementations of this approach did not include the efficiency constraint
(2.9). Characteristics of six problems solved without this constraint are shown in
Table 1. All have omnidirectional sensors, i.e. B(k) = 1 for all k, and all but HA4
are line arrays. HA4 has a partial hoop geometry. All were solved allowing the
Table 1
Problem characteristics
Size of k l Problem
Sidelobe
Problem Points Sensors Variables Constraints
STI 30 7 14 2
ST2 60 7 14 2
ST3 30 15 30 2
ST4 60 15 30 2
HA4 182 31 62 2
LINE 172 49 98 2
L.S. Lasdon et aL / Optimal design 147
Table 2
Results for the problems of Table I
linear, and a conjugate gradient method with exact linesearch is employed. Computa-
tion times increase rapidly with problem size, reflecting the increased time required
to evaluate the pattern at all sidelobe points.
The second set of results, given in Table 3, shows the effects of imposing the
efficiency constraint. All problems have omnidirectional sensors and a "stacked
h o o p " geometry, with the sensors mounted on one, two, or four identical hoops,
each containing 42 sensors, spaced along the x axis. The " E F F ( % ) " column gives
the required percent efficiency, IVe/n. All problems were run on a VAX 11/780 at
the Naval Underwater Systems Center in New London, Connecticut. Note the rapid
increase of the peak sidelobe level as the percent efficiency requirement is increased.
When no efficiency constraint is imposed, the final number of effective elements is
unacceptably small, and tends to decrease with problem size. There is obviously a
strong tradeoff between efficiency and peak sidelobe level. The final distance between
upper and lower bounds is small, and tends to decrease as the efficiency specification
increases.
Table 3
Runs of stacked hoop problems with different percent efficiency levels
The effects of varying the width of the main lobe are shown in Fig. 4. These
patterns, all within approximately 2 db of minimax optimality, are for the 31 sensor
problem HA4 of Table 1. The sensors are arranged on a partial hoop in the half of
the yz plane with y positive, and both farfield points and the patterns shown lie in
that half plane. Farfield point spacing is 1 degree, all sensors are onmi-directional,
the frequency is 300 hz, required efficiency is 50%, and shadings are constrained to
L.S. Lasdon et al. / Optimal design 149
I+
i =- T--T---~--~
';-_;_ ~,,
I
i ~
i ~--17-+-~i
i I / ~-L~
+
t
i
!
-
i i I t
i i i i- i i i -
i i i !~___l__.{ i
i ~ i :l i II I i '
I ; l I# : I+ 1 ;
400 : ~ I [
I
' '' i
7' i!i I!~h- - - - + . . . - - + ~ . , - - + . ~ . . - - 4 . ' ' ' '
beamwidth
",,,'......
- I , I ',---1---- /i +-:~
i ~ I i i I
, ~ , , , _~ . . . . I .i ' ; i i i
0. , , , ! , I , I , , , l
i i i i . I ; ; ; i i i ;
"i~i-i
V \!/\!~
t :i ,I iI f
ii Ji ii iiA!l iAIAi
\i/ ",4
'" ! 'v' \i/'\~ I I i -i i i/-'+t-!,l-'f I
.,.I i 't/ I i I i i i i i J. tf l !
1 x ~ ; i i i ; it ; i i t.... [l l i i
~.i
i
i
i
1 ,\(k i i i i I li I i i [\ll ; i[ iI
, , i" i
,w i~ T
, i
, i i" i
, i, "
,i I
, I
, /l v! i
i i !
i i ~ ~ :1 ', ! 3 I : |I , 1 I l; I : ', ;
SL'
. . ~. . I~. . l . . !
'
II
~
I
'
!
*
I
!
'
~
!~ I
!' !
~
1 i
,
4.
,
---4-
I
,
---L-'
;
I '
--4------4~
l
, I
,
I l 1 I ~ ' ~ !
, I I i I I i I I I 1 i l i i l
llO. I I .I.-- I I --~ 4 1 L l ~ ~ I I I I ~
-Illl~. -110. -lit0. -120. -ill. 4G. .-+tlEI. .Ill. *-IQ. 0. 1O. till. ~lli. IO. l~II. 120. llll0* lli0. lllg.
a. ' ~ .... ~ I l , ! I ! !
I i i iN I i Ii I i -- I I I
ll. i i i i --~
' i I
20 o u. +---- ; , : ,; i i
beamwidth i I i i i i i
u.i i- i ,i---' i
, i ! 1 !
Ii. ~ I ! I '. ! --~ 1 '.
.. + - - - - + - - . + -' 1 1
. I - - 1 - . . ._l. I I, - - - ~ .!- - - + - - + - - - - ~ - - - - + - - - I I
-ii~. -iio. -ll~, -l~l~. -ioil -i10. .-I~, .-II. .,~i, 0,. IG. ii0. 6Q. III. to0, i~.0. Ill. iP+o. 111o.
I intervalfor optimization--~
Ik
' i I I
, I ~I
~ ' !
' ~
' ~
' ~' ~
' ~I ''
k.[~, , !V , i ! ! ! , !
__i__L__L_] i ! i Ii
, , ! , t t !~ ! ! i i
tO~ dth
beamwi ! ~ i ! i i *,* i i
I
I
' 1 i i 1 i I
Fig. 4 (continued).
Q" ~ i 1 M I t I I ! !
' I
""
S~ i
! i i I I i I '.
i I i i i l
o1~. -isn. -t~n. -lzo. -1oo. --~, ..~. -~o. --~. o. zo...~..- ~, |~. (~. llm. l~e_ l~o. 1~.
(1+cos0)2/4, 0~<90 ~
B(k)= 0, 0>90 ~
Table 4
Data for two 391 sensor problems
Case 1 Case 2
_I "/I
: - 7
~ J
i ' i
5 "+"---": .... , .. ! . . . . -.a . . . . . . . . . . . .
i " ' i i J
I I! I ~ #
i
!
,
I /
9 1
I
i
!
,
!
1 ; i i 1
.... j
~
1~ ii 1i
_.__~L____-I.
i i~......... _ j
; ; i ] i
Horizontal Plane
O. i i
I , i0
i i , i V ~
-l-~
in u
....... vi ....... ~-i ..... +i ....... / ~~ i
i
i
~IA~
Ill l;
i
~
i
;
i /!
~ l ~
i
I
Vertical Plane
Fig. 6. O p t i m i z e d p a t t e r n s in h o r i z o n t a l a n d v e r t i c a l p l a n e s for C a s e I.
algorithm. In case 1, subproblem 1 (see the row with ltn = 0 ) yields a pattern with
peak of -27.98 db in the 28th of its 138 iterations. Subproblem 2 improves this to
-29.39 db in the sixth of 120 iterations. The remaining 3 subproblems improve the
lower bound, but not the pattern peak. This behavior is even more pronounced in
case 2. Subproblem 1 finds a pattern with peak of -34.86 db in the 23rd of 113
iterations, and further subproblems fail to improve on this value. Clearly, early
termination of each subproblem, based on behavior of the pattern peak, could
produce dramatic time savings. This would invalidate the lower bound computation,
but a final run with normal G R G 2 termination criteria would re-establish this bound.
Such a strategy would allow more subproblems to be solved, and would permit the
weights A to be adjusted by the dual ascent step more often. Experiments with these
ideas are currently under way.
L.S. Lasdon et al. / Optimal design 153
I i
18b
Horizontal Plane
i |
15.,
i
T - -
j,- ] - - - -
J
i I I
I
I J i
30. "' ~ l " - - " - - ~ i
! i i i
i
1 \ t
i
30 -G 18b
Vertical Plane
Fig. 7. O p t i m i z e d p a t t e r n s in h o r i z o n t a l a n d v e r t i c a l p l a n e s f o r C a s e 2.
8. Conclusions
Table 5
Iteration summary for 391 element problems
Subprob. Subprob.
Itn when Solution
Pattern Lower Subprob. Best Peak Time
Problem Itn max (db) Bound Diff. ltns Found (hrs.)
Acknowledgement
T h e a u t h o r s w i s h to t h a n k M s . P a t M a c i e j e w s k i a n d M r . M o h a m m e d Ahmed of
the Naval Underwater S y s t e m s C e n t e r f o r t h e i r a i d in t h e c o m p u t a t i o n a l phases of
t h i s w o r k , a n d Dr. N e a l G l a s s m a n o f t h e Office o f N a v a l R e s e a r c h f o r h e l p i n g to
initiate the research.
References
[ 1] H. Cox and J. Thurston, "'Advanced conformal submarine acoustic sensor beamforming engineer-
ing,'" Report No. 5258, Bolt Beranek and Newman, Inc., Suite 400, 1300 N. 17th St., Arlington,
VA 22209, (March 1983).
[2] D.J. Edelblute, J.M. Fisk and G.L Kinnison, "Criteria for optimum signal detection theory for
arrays," Journal q( the Acoustical Society o f America 41 (1967) 199-205.
[3] L.S. Lasdon, A.D. Waren and D. Suchman, "Optimal design of acoustic sonar transducer arrays,"
Proceedings oI'IEEE International Co~!ference on Engineering in the Ocean Environment 1 (1974) 3-10.
L.S. Lasdon et aL / Optimal design 155
[4] L.S. Lasdon and A.D. Waren, "Generalized reduced gradient software for linearly and nonlinearly
constrained problems," in: H. Greenberg, ed,, Design and hnplementation of Optimization S@ware
(Sijthoff and Noordhoff, 1978) 363-397.
[5] W. Mossberg, "'Quiet defender,'" The Wall Street Journal 1 (May 19, 1983) 22.
[6] B.A. Murtagh and M.A. Saunders, "Large-scale linearly constrained optimization,'" Mathematical
Programming 14 (1978) 41-72.
[7] J. Plummet, L.S. Lasdon and M. Ahmed, "'Solving a large nonlinear programming problem on a
vector processing computer,'" Working Paper 85/86-3-1, Department of Management Science and
Information Systems, School of Business Administration, University of Texas, Austin, TX 78712
(1985).
[8] D.F. Shanno, "'Conjugate gradient methods with inexact searches," Mathematics of Operations
Research 3 (1977) 241-254.
[9] R.L. Streit and A.H. Nuttall, "'A general chebyshev complex function approximation procedure
and an application to beamforming," Journal of the Acoustical Society of America 72 (1982) 181-190.
[ 10] A.D. Waren, L.S. Lasdon and D. Suchman, "'Optimization in engineering design," Proceedings of
the IEEE 55 (1967) 1185-1196.